Skip to main content

Google is recruiting Reddit users to improve speech recognition

Trusted Contacts
Image used with permission by copyright holder
Google Now, search giant Google’s eponymous voice assistant, has a surprisingly good grasp on the nuances of human speech. Thanks to a killer combination of machine learning and crowdsourced data, it can parse mumbles, murmurs, and even the most garbled of phrases. In August of last year, as an example, Google said it cut voice transcription errors by up to 49 percent.

But if there’s one element of linguistic diversity that’s tended to trip it up, it’s accents — only recently did Now gain official support for Indian and Australian dialects. Reportedly, though, Google has a plan to improve things: recruiting users of Reddit.

Recommended Videos

Reddit, a social network perhaps as well known for its internet activism as its controversial upper management, is reportedly serving as a recruitment pool for Google voice volunteers. The Mountain View, California-based company has retained the services of a third-party firm, Appen, that has begun hiring Reddit users — or Redditors, as they’re colloquially known — with specific accents for the purpose of improving Google’s voice recognition engine.

Gig listings by Appen began appearing this week on a number of subreddits — Reddit’s term for the individual communities that live under the broader network’s umbrella. The ads are equitably directed at users searching for part-time work — i.e., Redditors of /r/slavelabour, /r/WorkOnline /r/beermoney — and those who live in cities with high concentrations of distinctive inflections, like /r/Edinburgh. They’re all seeking the same: users with particular linguistic cadences who will submit to “the [collection of] speech data.”

“I’m currently recruiting to collect … data for Google,” read one request, since removed, on /r/slavelabour. “It requires you to use an Android to complete the task. The task is recording voice prompts like ‘Indy now,’ [and] ‘Google what’s the time.’ Each phrase takes around 3-5 seconds.”

The work in whole is fairly involved, apparently — participants are required to recite 2,000 individual phrases over the course of three hours — but rewarded generously in cold, hard cash. Adults earn 27 pounds ($36), and kids under 16 earn slightly less — 20 pounds ($26) — but they read from a shorter, 45-minute script of 500 phrases.

Google appears to be focusing on one accent in particular: that of the Scottish variety. It’s a relatively tough inflection to nail, according to Quartz — its peculiar cadence frequently trips up voice assistants from Now to Apple’s Siri on the iPhone and iPad.

The training sessions are relatively straightforward. Participants who spoke to The Verge — a diverse bunch with accents from “the U.K.” and “America” in addition to more exotic dialects, including “Indian” and “Chinese-accented English” — reported being directed to a mobile onboarding webpage. After tapping a “record” icon on that page, phrases appeared in sequence.

Some snippets referenced Google, apparently — “OK Google,” and “Hey, Google” — while others included brand names, toys, video games, movie titles, and YouTube channel names. And still others ran the gamut: queries from Google searches like “How to make a birthday cake”; idioms like “Hey Google, get cold feet,” and even trivia questions (“Presidents in order”).

Samples, once collected, are processed by Aspen’s in-house team. Company chief Mark Brayan, who spoke to The Verge, broke down the workflow: employees analyze recordings from “around the world” in 130 languages, distilling sentences down into their grammatical fundamentals. In a subsequent process Aspen calls “decoration,” the linguists make contextual annotations, noting such details as the environment in which the recordings were made — outdoors, for instance, or in a crowded hallway — and the device used to conduct them.

It’s an arduous undertaking, according to Brayan. Minor improvements require massive quantities of data and analysis. “To go from understanding 95 percent of words to 99 percent, the recognizer has to digest infrequently used words, of which there are millions,” Brayan told The Verge. And “unusual” terms like esoteric product names are even more problematic — Appen must account not only for familiar pronunciations of such words, but unique pronunciations of them, too. “One of the big challenges is what we call named entity recognition,” Brayan said. “That’s brand names, product names, individual names, and so on. So if you’re launching in Canada, for example, you need not only the French language but also French-accented Canadian English.”

The ideal end result? Leaps and bounds in voice recognition. Marsal Gavalda, head of machine intelligence at Yik Yak, said that historically, the capabilities of speech recognition systems have been limited by the homogeny of the data ingested. “[Such systems] have been trained from data collected mostly in universities, and mostly from the student population,” he told the Verge. He has a term for it: electronic imperialism. “The [diversity of voices] reflect the student population 30 years ago,” Gavalda said.

Already, the situation is improving… albeit marginally. Google misinterprets words in  “tier 2” languages  — the less popular languages to which companies like Google and Apple devote less attention — much less frequently than it once did. Over the past two years alone, the word error rate for Indonesian has decreased from 40 percent to 18 percent, Google’s chief of speech recognition Johan Schalkwyk told Fusion. But companies like Google have a long way to go — Schalkwyk said the company’s voice recognition engine needs at least 5,000 hours of voice data to understand a language “well.”

Google, it seems, is going to need a lot more accented Redditors.

Kyle Wiggers
Former Digital Trends Contributor
Kyle Wiggers is a writer, Web designer, and podcaster with an acute interest in all things tech. When not reviewing gadgets…
4 ways Google is making Android more accessible to everyone
Updates to Android accessibility features as of August 2024.

While most of the attention will inevitably be focused on the Pixel 9 and Pixel 9 Pro today, Google also made some interesting announcements around accessibility in Android at its Made by Google event. Also, likely to the surprise of nobody at all, they include some AI. Here are the four ways Google is improving accessibility in Android.
Magnifier

Originally released in 2023, Magnifier is a very helpful app that only works on Pixel phones. It uses the camera to help people zoom in on the world around them to make reading signs, menus, and other visual guides easier. By integrating AI into Magnifier, it now has a visual search using keywords so you can find relevant terms quickly. Plus, a picture-in-picture view gives you both an overview of what you’re looking at, along with any zoomed-in area.

Read more
Should you buy the Google Pixel 8 now or wait for the Pixel 9?
The Google Pixel 8's screen.

Now far from being the obscure secret of the mobile industry, the Pixel has become a household name, thanks to Google's insistence on delivering a solid Android experience, along with one of the best smartphone cameras you can buy. The current pinnacle of that line is the Google Pixel 8, which offers everything we want from a Google smartphone. It's fast, gets updates on day one, and has a camera that delivers stunning shots.
The Google Pixel 8 was revealed in October 2023, which means it's fast approaching its one-year birthday. But celebrations aren't likely to be in the cards, not when there's likely to be a shiny new phone to distract us. Google has confirmed the Google Pixel 9's existence, and that means prospective Pixel 8 buyers now have a choice to make: buy the Pixel 8 now, or endure an agonizing wait and see what the Pixel 9 can offer.
It's a tough choice, but we're here to help make it a little easier. While we don't know for sure what the Pixel 9 will bring to the table, we have a lot of leaks and rumors to help us make a more informed choice.
So, should you buy the Google Pixel 8 now or wait for the Pixel 9? Read on to find out.

Google Pixel 9 vs. Pixel 8: design
Google Pixel 8 Andy Boxall / Digital Trends

Read more
Google just gained exclusive access to Reddit
The Reddit app icon on an iOS Home screen.

Reddit has begun blocking all search engines except those that pay to crawl its site -- namely, Google. A report from 404 Media says that search engines like Bing or DuckDuckGo don't show any results from the last week, even when using the "site:reddit.com" search query. Because Google has paid the bill upfront, niche search engines like Kagi that rely on Google still have access to Reddit.

In the case of DuckDuckGo, the report claims that Reddit has blocked the search engine from pulling any data, stating, “We would like to show you a description here but the site won't allow us.”

Read more