Alexa is already one smart cookie. The perennially popular smart assistant from Amazon has quickly become one of the most popular helpers on the market, and is capable of helping its users with everything from controlling their smart homes to answering pressing questions about life to making announcements to the household. But according to Rohit Prasad, the vice president and head scientist of the Alexa division at Amazon, we ain’t seen nothin’ yet. In fact, according to an interview with Prasad published in the Amazon blog, his team has only “scratched the surface of what’s possible.”
Prasad leads Alexa’s research and development in speech recognition, natural language understanding, and machine learning technologies, all in hopes of bettering users’ experiences with Echo devices. Since November 2014, Prasad and his team have shown that far-field speech recognition, even in loud environments, is possible with a high degree of accuracy. The reason for this, the executive says, is that Amazon has managed to develop a series of machine learning algorithms, data, and “immense computing power.” While conversation artificial intelligence (A.I.) has been a topic of interest among researchers for nearly five decades, it has historically been difficult for machines to not only understand, but also communicate in human language. As a result, Alexa’s ability to comprehend and respond to a “wide array of intents” makes her particularly impressive, Prasad noted.
So how exactly does Alexa work? First and foremost, your Echo device listens for a spoken audio cue, which is then converted to text by far-field automatic speech recognition (ASR) in the Amazon Web Services (AWS) cloud. Then,
Alexa then responds using text-to-speech synthesis (TTS), which helps to translate strings of words into intelligible audio. Of course, the challenge here is to ensure that
So what is it that makes Alexa more capable than other smart assistants? Apparently, it has a lot to do with the fact that
And as far as what we can look forward to from not only Alexa, but smart assistants as a whole, Prasad has a few ideas. “A.I. will have deep societal impact and will help humans learn new skills that we can’t even imagine today,” he said. “In the next five years, we will see conversational A.I. get smarter on multiple dimensions as we make further advances with machine learning and reasoning. With these advances, we will see