Skip to main content

Smartphone speech recognition can text 3 times faster than you can type

Stanford experiment shows speech recognition writes texts more quickly than thumbs
Computer dictation is a whole lot better than it was a decade ago, but exactly how much better? That was a challenge computer scientists from Stanford University, the University of Washington, and Chinese tech giant Baidu recently took on in an experiment pitting humans against the latest cutting-edge speech recognition software in both speed and accuracy.

Stanford computer science professor James Landay said the study began as a “coffee shop conversation” between himself and Stanford adjunct professor Andrew Ng, currently chief scientist at Baidu. “Andrew said that Baidu’s speech recognition tools were getting really great, but that they didn’t know the right experiment to quantify it,” Landay told Digital Trends.

Recommended Videos

Baidu’s Deep Speech 2 cloud-based speech recognition software is based on a deep learning neural network: an impressive machine learning tool that is able to train itself by analyzing enormous datasets of real speech.

Please enable Javascript to view this content

“Previously, we didn’t have the data and computational ability to build these models, so that a computer could understand different accents and patterns of speech,” Landay continued.

In the end, the casual conversation between Landay and Ng turned into a full-blown experiment, involving 32 participants speaking either Chinese or English. All participants had grown up text messaging, and both were using the standard keyboards which come with the iPhone.

For the English speakers this meant the regular iOS QWERTY keyboard, while the Mandarin speakers used Apple’s Pinyin keyboard. In both cases, speech recognition was around three times faster than users were able to type — while the error rate was 20.4 percent lower for the English speech recognition, and 63.4 percent lower for the Mandarin equivalent.

“My expectation was that speech would be faster than text,” Landay said. “We know this, because you can talk faster than you can type. The problem in the past was that you got a lot of errors with speech recognition, and this slowed you down. I thought speech would prove faster. What I didn’t expect was that it would wind up being three times faster. I figured maybe we would get 50 percent faster. Instead it was much more than that.”

The test isn’t 100 percent comprehensive, of course. Currently the world’s fastest mobile keyboard (at least in English) is the third-party Fleksy keyboard. In a 2014 Guinness World Record for fastest texting, a user was able to type a 126-letter sentence in just 18.44 seconds. However, Landay noted that this study chose a regular iPhone keyboard because it gives a good indication of the typical typist. “Most people don’t take the time to learn alternative keyboards,” he said.

As to what the study means, Landay suggests it represents an important benchmark for speech recognition. “There’s still room to improve, but we think some kind of inflection point has been passed,” he said. “Further improvements will come in recognizing names, performing better in noisy environments, etc.”

This, he said, opens up more possibilities for developers to think more seriously about incorporating speech recognition into their systems without worry. “What will increasingly make sense is relying on speech,” he said. “For example, multimodal interfaces combining speech with other elements to help people navigate. The biggest challenge, though, is going to be understanding the meaning of words and sentences. That part still has a way to go.”

Luke Dormehl
Former Digital Trends Contributor
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
Hyundai Ioniq 5 sets world record for greatest altitude change
hyundai ioniq 5 world record altitude change mk02 detail kv

When the Guinness World Records (GWR) book was launched in 1955, the idea was to compile facts and figures that could finally settle often endless arguments in the U.K.’s many pubs.

It quickly evolved into a yearly compilation of world records, big and small, including last year's largest grilled cheese sandwich in the world.

Read more
Global EV sales expected to rise 30% in 2025, S&P Global says
ev sales up 30 percent 2025 byd sealion 7 1stbanner l

While trade wars, tariffs, and wavering subsidies are very much in the cards for the auto industry in 2025, global sales of electric vehicles (EVs) are still expected to rise substantially next year, according to S&P Global Mobility.

"2025 is shaping up to be ultra-challenging for the auto industry, as key regional demand factors limit demand potential and the new U.S. administration adds fresh uncertainty from day one," says Colin Couchman, executive director of global light vehicle forecasting for S&P Global Mobility.

Read more
Faraday Future could unveil lowest-priced EV yet at CES 2025
Faraday Future FF 91

Given existing tariffs and what’s in store from the Trump administration, you’d be forgiven for thinking the global race toward lower electric vehicle (EV) prices will not reach U.S. shores in 2025.

After all, Chinese manufacturers, who sell the least expensive EVs globally, have shelved plans to enter the U.S. market after 100% tariffs were imposed on China-made EVs in September.

Read more