Skip to main content

New algorithm could help diagnose depression by analyzing the tone of your voice

hotline dating app talking on phone 123rf 45715075 ml
sifotography / 123RF
The concept of an AI psychoanalyst has been in circulation for decades, tracing all the way back to Joseph Weizenbaum’s ELIZA chatterbot in the 1970s. But now researchers from the University of Southern California are taking the idea to the next level, courtesy of a machine learning algorithm designed to analyze a person’s speech patterns and help diagnose the possibility of depression in the process.

The tool is part of an ongoing research project called SimSensei, referring to a Kinect-powered virtual therapist able to “read” patient’s’ body language for signs of anxiety, nervousness, contemplation and other emotional attributes.

More recently, however, the project has increasingly focused on not just understanding the responses given (like Apple’s Siri does, for instance), but also the manner in which they are spoken. “I’m not so interested in what people say, as how they say it,” Stefan Scherer, one of the researchers involved with the work, tells Digital Trends. “We’re focusing on aspects of speech like voice quality — from the timbre to the color of the voice: whether it’s a tense voice, a harsh voice, or a breathy voice. We want to pick up these changes and contextualize them.”

Scherer calls his work “behavioral analytics” and says that it’s all part of creating a more fully-realized tool which can be used to augment the abilities of a real therapist or physician. “It provides a different set of eyes and ears that they would not normally have available,” he says.

In a recent paper, the authors of the study explain how: “depressed patients often display flattened or negative affect, reduced speech variability and monotonicity in loudness and pitch, reduced speech, reduced articulation rate, increased pause duration, and varied switching pause duration. Further, depressed speech was found to show increased tension in the vocal tract and the vocal folds.” Such vocal tics may not immediately be picked up on by a human.

Looking forward, Scherer says he could see technology such as this being installed in smartphone apps, so that people can more objectively measure moods in a similar way to how the “Quantified Self” movement currently does health-tracking. “You could imagine people asking if they’ve done their 1,000 smiles in a day, or whether or not they are getting excited about things,” he says. “It could be used for both people suffering from depression but also for the general population.”

Editors' Recommendations

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
This AI cloned my voice using just three minutes of audio
acapela group voice cloning ad

There's a scene in Mission Impossible 3 that you might recall. In it, our hero Ethan Hunt (Tom Cruise) tackles the movie's villain, holds him at gunpoint, and forces him to read a bizarre series of sentences aloud.

"The pleasure of Busby's company is what I most enjoy," he reluctantly reads. "He put a tack on Miss Yancy's chair, and she called him a horrible boy. At the end of the month, he was flinging two kittens across the width of the room ..."

Read more
Digital Trends’ Top Tech of CES 2023 Awards
Best of CES 2023 Awards Our Top Tech from the Show Feature

Let there be no doubt: CES isn’t just alive in 2023; it’s thriving. Take one glance at the taxi gridlock outside the Las Vegas Convention Center and it’s evident that two quiet COVID years didn’t kill the world’s desire for an overcrowded in-person tech extravaganza -- they just built up a ravenous demand.

From VR to AI, eVTOLs and QD-OLED, the acronyms were flying and fresh technologies populated every corner of the show floor, and even the parking lot. So naturally, we poked, prodded, and tried on everything we could. They weren’t all revolutionary. But they didn’t have to be. We’ve watched enough waves of “game-changing” technologies that never quite arrive to know that sometimes it’s the little tweaks that really count.

Read more
Digital Trends’ Tech For Change CES 2023 Awards
Digital Trends CES 2023 Tech For Change Award Winners Feature

CES is more than just a neon-drenched show-and-tell session for the world’s biggest tech manufacturers. More and more, it’s also a place where companies showcase innovations that could truly make the world a better place — and at CES 2023, this type of tech was on full display. We saw everything from accessibility-minded PS5 controllers to pedal-powered smart desks. But of all the amazing innovations on display this year, these three impressed us the most:

Samsung's Relumino Mode
Across the globe, roughly 300 million people suffer from moderate to severe vision loss, and generally speaking, most TVs don’t take that into account. So in an effort to make television more accessible and enjoyable for those millions of people suffering from impaired vision, Samsung is adding a new picture mode to many of its new TVs.
[CES 2023] Relumino Mode: Innovation for every need | Samsung
Relumino Mode, as it’s called, works by adding a bunch of different visual filters to the picture simultaneously. Outlines of people and objects on screen are highlighted, the contrast and brightness of the overall picture are cranked up, and extra sharpness is applied to everything. The resulting video would likely look strange to people with normal vision, but for folks with low vision, it should look clearer and closer to "normal" than it otherwise would.
Excitingly, since Relumino Mode is ultimately just a clever software trick, this technology could theoretically be pushed out via a software update and installed on millions of existing Samsung TVs -- not just new and recently purchased ones.

Read more