Smartphone speech recognition can text 3 times faster than you can type

Computer dictation is a whole lot better than it was a decade ago, but exactly how much better? That was a challenge computer scientists from Stanford University, the University of Washington, and Chinese tech giant Baidu recently took on in an experiment pitting humans against the latest cutting-edge speech recognition software in both speed and accuracy.

Stanford computer science professor James Landay said the study began as a “coffee shop conversation” between himself and Stanford adjunct professor Andrew Ng, currently chief scientist at Baidu. “Andrew said that Baidu’s speech recognition tools were getting really great, but that they didn’t know the right experiment to quantify it,” Landay told Digital Trends.

Baidu’s Deep Speech 2 cloud-based speech recognition software is based on a deep learning neural network: an impressive machine learning tool that is able to train itself by analyzing enormous datasets of real speech.

“Previously, we didn’t have the data and computational ability to build these models, so that a computer could understand different accents and patterns of speech,” Landay continued.

In the end, the casual conversation between Landay and Ng turned into a full-blown experiment, involving 32 participants speaking either Chinese or English. All participants had grown up text messaging, and both were using the standard keyboards which come with the iPhone.

For the English speakers this meant the regular iOS QWERTY keyboard, while the Mandarin speakers used Apple’s Pinyin keyboard. In both cases, speech recognition was around three times faster than users were able to type — while the error rate was 20.4 percent lower for the English speech recognition, and 63.4 percent lower for the Mandarin equivalent.

“My expectation was that speech would be faster than text,” Landay said. “We know this, because you can talk faster than you can type. The problem in the past was that you got a lot of errors with speech recognition, and this slowed you down. I thought speech would prove faster. What I didn’t expect was that it would wind up being three times faster. I figured maybe we would get 50 percent faster. Instead it was much more than that.”

The test isn’t 100 percent comprehensive, of course. Currently the world’s fastest mobile keyboard (at least in English) is the third-party Fleksy keyboard. In a 2014 Guinness World Record for fastest texting, a user was able to type a 126-letter sentence in just 18.44 seconds. However, Landay noted that this study chose a regular iPhone keyboard because it gives a good indication of the typical typist. “Most people don’t take the time to learn alternative keyboards,” he said.

As to what the study means, Landay suggests it represents an important benchmark for speech recognition. “There’s still room to improve, but we think some kind of inflection point has been passed,” he said. “Further improvements will come in recognizing names, performing better in noisy environments, etc.”

This, he said, opens up more possibilities for developers to think more seriously about incorporating speech recognition into their systems without worry. “What will increasingly make sense is relying on speech,” he said. “For example, multimodal interfaces combining speech with other elements to help people navigate. The biggest challenge, though, is going to be understanding the meaning of words and sentences. That part still has a way to go.”


You can now listen to Google Podcasts on your desktop without the app

The Google Podcasts app is no longer entirely necessary to listen to the podcasts it offers. With a simple tweak of the sharing URL, you can listen to a Google Podcasts podcast on your desktop or laptop without the app.
Emerging Tech

Astronomers plan to beam Earth’s greatest hits into deep space, and you can help

A new project from the SETI Institute (search for extraterrestrial intelligence) will give the public the chance to submit compositions to be beamed into space, with the aim of connecting people around the world through music.
Emerging Tech

Inflating smart pills could be a painless alternative to injections

Could an inflating pill containing hidden microneedles replace painful injections? The creators of the RaniPill robotic capsule think so — and they have the human trials to prove it.
Smart Home

Sony’s Aibo robot dog can now patrol your home for persons of interest

Sony released the all-new Aibo in the U.S. around nine months ago, and since then the robot dog has (hopefully) been melting owners' hearts with its cute looks and clever tricks. Now it has a new one up its sleeve.
Emerging Tech

A 3D printer the size of a small barn will produce entire homes in Saudi Arabia

If you’re looking for a 3D printer that can comfortably fit on the side of your desk… well, Danish company Cobod International’s enormous new 3D house printer probably isn’t for you.

Need a ride? Amazon is slashing prices on popular electric scooters

If you’re not much of a cyclist or if you’re looking for a lazier way to zip about town, an electric scooter should be right up your alley. Two of our favorites, the foldable Glion Dolly and the eco-friendly Razor scooter, are on sale…
Emerging Tech

Unexpected particle plumes discovered jetting out of asteroid Bennu

The OSIRIS-REx craft traveled to asteroid Bennu last year and won't return until 2023. But the mission is already throwing up unexpected findings, like plumes of particles which are being ejected from the surface of the asteroid.
Emerging Tech

Trip to Neptune’s moon, Triton, could inform search for extraterrestrial life

NASA has proposed sending a craft to Neptune to study its largest moon, Triton. Studying Triton could offer clues to how liquid water is maintained on planets, which may indicate what to look for when searching for life beyond our planet.
Emerging Tech

NASA’s Mars 2020 rover passes its tests with flying colors

The Mars 2020 rover team has been undertaking a series of tests to see if the craft will be able to launch, navigate, and land on the Red Planet. Called Systems Test 1, or ST1, these tests represent the first test drive of the new rover.

Light up the night! Here are the five best headlamps money can buy

Headlamps make all the difference when camping or walking the dog at night, especially when you're in need of both hands. From Petzl to Tikkid, here are some of the best headlamps on the market.
Emerging Tech

A hive of activity: Using honeybees to measure urban pollution

According to a new study from Vancouver, bees could help us understand urban pollution. Scientists have found an innovative way to measure the level of source of pollution in urban environments: by analyzing honey.
Emerging Tech

Spacewalk a success as astronauts upgrade batteries on the ISS

The International Space Station was treated to some new batteries on Friday, thanks to two NASA astronauts who took a spacewalk for nearly seven hours in order to complete the upgrades.
Emerging Tech

Awesome Tech You Can’t Buy Yet: Robotic companions and computer-aided karaoke

Check out our roundup of the best new crowdfunding projects and product announcements that hit the web this week. You may not be able to buy this stuff yet, but it's fun to gawk!
Emerging Tech

Asteroid Ryugu is porous, shaped like a spinning top, and is formed of rubble

The Japanese Space Agency has been exploring a distant asteroid named Ryugu with its probe, Hayabusa 2. Now the first results from study of the asteroid are in, with three new papers published.