Sensory, a Santa Clara, California-based software company, doesn’t often find itself in the limelight. It’s typically content to let its solutions speak for themselves. The firm, which has been developing computer-driven voice technologies like speech recognition, speech and music synthesis, and speaker verification since the late ’90s, has its technology embedded in more than a billion devices around the world. The vast majority are smartphones, but Sensory has designed solutions from voice-triggered alarm clocks to the microphone used on NASA’s Mars Polar Lander. And now, starting on Wednesday, it’s making a move for markets beyond: In a partnership with retail giant Amazon, Sensory is launching a suite of tools to make Alexa Voice Service activation possible on non-Amazon-branded products.
In more concrete terms, Sensory is enabling what it calls “wakeup words” — that is, terms and phrases that trigger the always-on listening that is a hallmark of assistants like Google Now and Siri — on devices that integrate Amazon’s Alexa. “We pioneered the concept that devices don’t have to use buttons to use speech recognizers,” Todd Mozer, Sensory’s chief executive officer, told Digital Trends. “Sensory’s been around for a long time. Speech recognition and voice synthesis is very trendy right now, but we’ve been using machine learning techniques for more than a decade.”
Sensory’s solution, TrulyHandsFree, takes the form of an “assortment” of speech synthesis models tailor-made for a spectrum of electronics. The voice engines, of which there are more than 20, range from no-frills, single-word models to powerful algorithms capable of deciphering speech in “highly noise robust” environments. One, an ultra low-power model designed for use in smartphones, smartwatches, and other battery-operated mobile devices which lack a dedicated power source, requires “less than one milliamp” of power under load, Mozer said.
Sensory is of course not new to mobile. The company has worked with Motorola, Samsung, and other smartphone makers to enable hotwords like “Hey Galaxy” and “OK Google” on flagships from the Galaxy series to the Moto X. Crucially, Sensory’s solutions interpret voice locally, on silicon within the devices themselves. Mozer said that helps to shave both processing time and excessive power draw. “The internet can do a lot, but you still need hardware to perform the listening on the device,” Mozer said.
The launch of Sensory’s Alexa wake word suite comes at a pivotal moment for speech synthesis — and Amazon’s Alexa platform, more narrowly speaking. The tech is becoming ubiquitous: the CoWatch became the first smartwatch to tap Alexa earlier this year, and Pebble followed suit with its clip-on Core. BMW, meanwhile, has promised to integrate Alexa into its Connected Car platform later this year, and smart home company Control4 announced it would launch Alexa one its existing line of products later this year.
“We can help get them on platforms that might be difficult to run on their own,” said Mozer. “We’re very focused on the area of speech for consumer electronics.”
To that end, Sensory partnered with home intercom startup Nucleus to support wake words on its hardware. And it’s extending support of the new Alexa wake word platform to AVS for Raspberry Pi, an open source project that imbues the affordable Raspberry Pi computing board with Alexa’s voice-driven intelligence. Previously, activating Alexa on the Pi required pressing a physical trigger, but starting tomorrow, project contributors will be able to tap Sensory’s low-power engine — saying “Alexa” will trigger Alex’s listening mode.
As for the future, Sensory intends to pursue an element of increasing importance in the voice-activated assistant field: speaker identification. Mozer envisions a “layer of authentication,” or means of personalizing voice recognition to individual voices. “We have voice imprint platforms that are secure enough to do transactions,” he said. The potential is nearly endless: imagine a Google Home that automatically tailored your song requests to your musical preferences, for instance, or an Amazon Echo that could keep the kids from ordering stuff from Amazon. “Anybody can talk to a home assistant,” he said. “We can deliver that security.”
- Cortana vs. Siri vs. Google Assistant vs. Alexa
- His granddad lost the ability to read, so he built a DIY text-to-speech rig
- Arduino vs. Raspberry Pi
- Amazon Echo vs. Echo Dot: Battle of siblings
- Amazon unveils Live Translation feature for Echo devices