Smartphone speech recognition can text 3 times faster than you can type

Computer dictation is a whole lot better than it was a decade ago, but exactly how much better? That was a challenge computer scientists from Stanford University, the University of Washington, and Chinese tech giant Baidu recently took on in an experiment pitting humans against the latest cutting-edge speech recognition software in both speed and accuracy.

Stanford computer science professor James Landay said the study began as a “coffee shop conversation” between himself and Stanford adjunct professor Andrew Ng, currently chief scientist at Baidu. “Andrew said that Baidu’s speech recognition tools were getting really great, but that they didn’t know the right experiment to quantify it,” Landay told Digital Trends.

Baidu’s Deep Speech 2 cloud-based speech recognition software is based on a deep learning neural network: an impressive machine learning tool that is able to train itself by analyzing enormous datasets of real speech.

“Previously, we didn’t have the data and computational ability to build these models, so that a computer could understand different accents and patterns of speech,” Landay continued.

In the end, the casual conversation between Landay and Ng turned into a full-blown experiment, involving 32 participants speaking either Chinese or English. All participants had grown up text messaging, and both were using the standard keyboards which come with the iPhone.

For the English speakers this meant the regular iOS QWERTY keyboard, while the Mandarin speakers used Apple’s Pinyin keyboard. In both cases, speech recognition was around three times faster than users were able to type — while the error rate was 20.4 percent lower for the English speech recognition, and 63.4 percent lower for the Mandarin equivalent.

“My expectation was that speech would be faster than text,” Landay said. “We know this, because you can talk faster than you can type. The problem in the past was that you got a lot of errors with speech recognition, and this slowed you down. I thought speech would prove faster. What I didn’t expect was that it would wind up being three times faster. I figured maybe we would get 50 percent faster. Instead it was much more than that.”

The test isn’t 100 percent comprehensive, of course. Currently the world’s fastest mobile keyboard (at least in English) is the third-party Fleksy keyboard. In a 2014 Guinness World Record for fastest texting, a user was able to type a 126-letter sentence in just 18.44 seconds. However, Landay noted that this study chose a regular iPhone keyboard because it gives a good indication of the typical typist. “Most people don’t take the time to learn alternative keyboards,” he said.

As to what the study means, Landay suggests it represents an important benchmark for speech recognition. “There’s still room to improve, but we think some kind of inflection point has been passed,” he said. “Further improvements will come in recognizing names, performing better in noisy environments, etc.”

This, he said, opens up more possibilities for developers to think more seriously about incorporating speech recognition into their systems without worry. “What will increasingly make sense is relying on speech,” he said. “For example, multimodal interfaces combining speech with other elements to help people navigate. The biggest challenge, though, is going to be understanding the meaning of words and sentences. That part still has a way to go.”

Emerging Tech

Inflating smart pills could be a painless alternative to injections

Could an inflating pill containing hidden microneedles replace painful injections? The creators of the RaniPill robotic capsule think so — and they have the human trials to prove it.
Gaming

Your PlayStation 4 game library isn't complete without these games

Looking for the best PS4 games out there? Out of the massive crop of titles available, we selected the best you should buy. No matter what your genre of choice may be, there's something here for you.
Smart Home

The five best teeth-whitening kits you can buy on Amazon

Teeth whitening can have a major impact on a person’s smile and overall appearance. You don't necessarily have to go to the dentist to get your teeth whitened though. Here are the best teeth-whitening kits you can buy.
Emerging Tech

It’s not time travel, but scientists can turn back clock on a quantum computer

Physicists have demonstrated that they can wind back the clock on a quantum computer a fraction of a second. Don't get too excited about the prospect of human time travel any time soon, though.
Computing

At $99, Nvidia’s Jetson Nano minicomputer seeks to bring robotics to the masses

Nvidia announced a new A.I. computer, the Jetson Nano. This computer comes with an 128-core GPU that Nvidia claims can handle pretty much any A.I. framework you could imagine. At $99, it's an affordable way for A.I. newbies to get involved.
Computing

Nvidia’s A.I. Playground lets you edit photos, experience deep learning research

Nvidia is making it easier to access information on deep learning research. It has launched an online space with three demos for image editing, styling, as well as photorealistic image synthesis. 
Emerging Tech

The U.S. Army is building a giant VR battlefield to train soldiers virtually

Imagine if the U.S. Army was able to rehearse battlezone scenarios dozens, or even hundreds, or times before settling foot on actual terrain. Thanks to virtual reality, that's now a possibility.
Business

British Airways’ new Club Suite for business class comes with a door

British Airways is going after a bigger slice of the business class market with the imminent launch of the Club Suite. The plush seating option offers a more private space as well as an easier route to the bathroom.
Smart Home

Sony’s Aibo robot dog can now patrol your home for persons of interest

Sony released the all-new Aibo in the U.S. around nine months ago, and since then the robot dog has (hopefully) been melting owners' hearts with its cute looks and clever tricks. Now it has a new one up its sleeve.
Emerging Tech

A silver bullet is being aimed at the drug-resistant superbugs on the ISS

A bacteria which is benign here on Earth can mutate into a drug-resistant superbug once it enters space. Now this problem is being tackled by a team of microbiologists who have found a way to inhibit the spread of bacteria in the ISS.
Emerging Tech

Tombot is the hyper-realistic dog robot that puts Spot to shame

Forget Boston Dynamics’ Spot! When it comes to robot dogs, the folks behind a new Kickstarter campaign have plans to stake their claim as makers of man’s (and woman’s) newest best friend.
Emerging Tech

Researchers gave alligators headphones and ketamine, and all for a good cause

Researchers in Germany and the United States recently gave ketamine and earphones to alligators to monitor how they process sounds. Here's what it reveals about alligator evolution.
Emerging Tech

Cheese tastes different when it listens to Led Zeppelin, Swiss study finds

A funky new study says that exposing cheese to music changes its aroma and flavor. What’s more, the genre of music matters. Researchers from the Bern University of Arts played music to nine, 22-pound wheels of Emmental cheese.
Emerging Tech

Astronomers plan to beam Earth’s greatest hits into deep space, and you can help

A new project from the SETI Institute (search for extraterrestrial intelligence) will give the public the chance to submit compositions to be beamed into space, with the aim of connecting people around the world through music.