Why Computerized Voices Don't Sound Human

Will computerized voices ever sound human?

By Lulu Chang February 14, 2016

siri upgrade vocaliq ios smartphone iphone apple — Kārlis Dambrāns/Flickr

Sounding robotic has never been a compliment, but with the right amount of tinkering, computer scientists and engineers are hoping that this may soon change. Computerized voices haven’t quite hit the mark yet on sounding … human, and it’s the subject of one of the tech industry’s latest major efforts. As an increasing number of devices begin speaking to us — from Apple’s Siri to Amazon’s Alexa to our GPS system, it’s becoming increasingly important for machines to have voices we actually want to listen to.

As the New York Times reports, the relatively new focus area of “conversational agents” in the little-understood field of human-computer interaction design, seeks to build programs that understand language and are also able to respond to commands. Today, it is impossible for a computer’s voice to be rendered indistinguishable from that of a human’s. At least, not for anything more complex than offering short bits of information — whether it’ll rain, for example, or when to turn left.

Part of the issue lies in “prosody,” which is the capacity to correctly enunciate or stress certain syllables — saying words the way an actual human would. And of course, there’s also the uniquely human ability to add emotion into pronunciation. After all, we don’t always say “good” or even “left” in the same way. Machines, on the other hand, have yet to master that nuance.

“The problem is we don’t have good controls over how we say to these synthesizers, ‘Say this with feeling,’” Scottish computer scientist and Carnegie Mellon professor Alan Black told the Times. And it may still be some time before we’re actually able to do this at all.

But that might not be a bad thing, some say. “Jarring is the way I would put it,” Brian Langner, senior speech scientist at digital speech company ToyTalk, said about having machines sound too much like humans. “When the machine gets some of those things correct, people tend to expect that it will get everything correct.”

So no, you probably won’t be able to get Siri to sound like your mom anytime soon. But you may want to enjoy that inability while you can.

Editors' Recommendations