Smartphone speech recognition can text 3 times faster than you can type

By Luke Dormehl Published August 24, 2016

Computer dictation is a whole lot better than it was a decade ago, but exactly how much better? That was a challenge computer scientists from Stanford University, the University of Washington, and Chinese tech giant Baidu recently took on in an experiment pitting humans against the latest cutting-edge speech recognition software in both speed and accuracy.

Stanford computer science professor James Landay said the study began as a “coffee shop conversation” between himself and Stanford adjunct professor Andrew Ng, currently chief scientist at Baidu. “Andrew said that Baidu’s speech recognition tools were getting really great, but that they didn’t know the right experiment to quantify it,” Landay told Digital Trends.

Recommended Videos

Baidu’s Deep Speech 2 cloud-based speech recognition software is based on a deep learning neural network: an impressive machine learning tool that is able to train itself by analyzing enormous datasets of real speech.

“Previously, we didn’t have the data and computational ability to build these models, so that a computer could understand different accents and patterns of speech,” Landay continued.

In the end, the casual conversation between Landay and Ng turned into a full-blown experiment, involving 32 participants speaking either Chinese or English. All participants had grown up text messaging, and both were using the standard keyboards which come with the iPhone.

For the English speakers this meant the regular iOS QWERTY keyboard, while the Mandarin speakers used Apple’s Pinyin keyboard. In both cases, speech recognition was around three times faster than users were able to type — while the error rate was 20.4 percent lower for the English speech recognition, and 63.4 percent lower for the Mandarin equivalent.

“My expectation was that speech would be faster than text,” Landay said. “We know this, because you can talk faster than you can type. The problem in the past was that you got a lot of errors with speech recognition, and this slowed you down. I thought speech would prove faster. What I didn’t expect was that it would wind up being three times faster. I figured maybe we would get 50 percent faster. Instead it was much more than that.”

The test isn’t 100 percent comprehensive, of course. Currently the world’s fastest mobile keyboard (at least in English) is the third-party Fleksy keyboard. In a 2014 Guinness World Record for fastest texting, a user was able to type a 126-letter sentence in just 18.44 seconds. However, Landay noted that this study chose a regular iPhone keyboard because it gives a good indication of the typical typist. “Most people don’t take the time to learn alternative keyboards,” he said.

As to what the study means, Landay suggests it represents an important benchmark for speech recognition. “There’s still room to improve, but we think some kind of inflection point has been passed,” he said. “Further improvements will come in recognizing names, performing better in noisy environments, etc.”

This, he said, opens up more possibilities for developers to think more seriously about incorporating speech recognition into their systems without worry. “What will increasingly make sense is relying on speech,” he said. “For example, multimodal interfaces combining speech with other elements to help people navigate. The biggest challenge, though, is going to be understanding the meaning of words and sentences. That part still has a way to go.”

Former Contributor

I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…

Topics

Phones

Apple’s iPhone Ultra could one-up the Galaxy Z Fold 7 with a bigger battery

4,883mAh total capacity, two cells, and two screens drawing power. Somewhere between "fine" and "I hope Apple's software does the heavy lifting."

Electronics, Phone, Mobile Phone

Apple's foldable iPhone is getting closer to its September announcement. Despite rumors of a delay, a recent report claimed that Foxconn is hiring temporary workers to ramp up production of the Ultra. Now we have a number for one of its most important specs: the battery.

I'll be honest: when I saw the battery figure, my reaction was somewhere between "that works" and "I was hoping for more."

Phones

The next “flagship killer” is coming from Motorola, but it may not reach the US anytime soon.

The Motorola Edge 70 Max looks great on paper, but only India is getting it on July 15.

Electronics, Mobile Phone, Phone

Motorola is building the most ambitious phone in its Edge 70 lineup, but it might not be available in the United States.

Specs like a 7,000-nit display and MagSafe-style magnetic wireless charging belong in a conversation that often includes flagships, but it looks like Motorola wants to break that norm.

Phones

Your Google Voice calls just got an AI note-taker, and a cheaper price tag

Your calls just got a personal assistant, and your wallet just caught a break.

Google Voice Featured image

Remember when Google Voice was just that free number you used to dodge spam calls? It's come a long way since then, and today it's taking its biggest leap yet: letting Gemini quietly sit in on your calls and handle the note-taking for you.

How does AI note taking work on calls?