Skip to main content

AI headphones driven by Apple M2 can translate multiple speakers at once

Sony WH-1000XM4
Riley Young / Digital Trends

Google’s Pixel Buds wireless earbuds have offered a fantastic real-time translation facility for a while now. Over the past few years, brands such as Timkettle have offered similar earbuds for business customers. However, all these solutions can only handle one audio stream at once for translation. 

The folks over at the University of Washington (UW) have developed something truly remarkable in the form of AI-driven headphones that can translate the voice of multiple speakers at once. Think of it as a polyglot in a crowded bar, able to understand the speech of people around him, speaking in different languages, all at once. 

Recommended Videos

The team is referring to their innovation as a Spatial Speech Translation, and it comes to life courtesy of binaural headphones. For the unaware, binaural audio tries to simulate sound effects just the way human ears perceive them naturally. To record them, mics are placed on a dummy head, apart at the same distance as human ears on each side. 

The approach is crucial because our ears don’t only hear sound, but they also help us gauge the direction of its origin. The overarching goal is to produce a natural soundstage with a stereo effect that can provide a live concert-like feel. Or, in the modern context, spatial listening

The work comes courtesy of a team led by Professor Shyam Gollakota, whose prolific repertoire includes apps that can put underwater GPS on smartwatches, turning beetles into photographers, brain implants that can interact with electronics, a mobile app that can hear infection, and more. 

How does multi-speaker translation work?

“For the first time, we’ve preserved the sound of each person’s voice and the direction it’s coming from,” explains Gollakota, currently a professor at the institute’s Paul G. Allen School of Computer Science & Engineering.

The team likens their stack to a radar, as it kicks into action by identifying the number of speakers in the surroundings, and updating that number in real-time as people move in and out of the listening range. The whole approach works on-device and doesn’t involve sending user voice streams to a cloud server for translation. Yay, privacy!

In addition to speech translation, the kit also “maintains the expressive qualities and volume of each speaker’s voice.” Morever, directional and audio intensity adjustments are made as the speaker moves across the room. Interestingly, Apple is also said to be developing a system that allows the AirPods to translate audio in real-time.

How does it all come to life?

The UW team tested the AI headphones’ translation capabilities in nearly a dozen outdoor and indoor settings. As far as performance goes, the system can take, process, and produce translated audio within 2-4 seconds. Test participants appeared to prefer a delay worth 3-4 seconds, but the team is working to speed up the translation pipeline.

So far, the team has only tested Spanish, German, and French language translations, but they’re hopeful of adding more to the pool. Technically, they condensed blind source separation, localization, real-time expressive translation, and binaural rendering into a single flow, which is quite an impressive feat.

As far as the system goes, the team developed a speech translation model capable of running in real-time on an Apple M2 silicon, achieving real-time inference. Audio duties were handled by a pair of Sony’s noise-cancelling WH-1000XM4 headphones and a Sonic Presence SP15C binaural USB mic.

And here’s the best part. “The code for the proof-of-concept device is available for others to build on,” says the institution’s press release. That means the scientific and open-source tinkering community can learn and base more advanced projects on the foundations laid out by the UW team. 

Nadeem Sarwar
Nadeem is a tech and science journalist who started reading about cool smartphone tech out of curiosity and soon started…
Walmart reveals the price for its Chromecast replacement, and it’s a bargain
The Onn 4K Pro remote (left) and Google TV Streamer remote.

With the retirement of the Google Chromecast line last year, users who want an easy way to get streaming features on their non-smart TV have had to look elsewhere. Thankfully alternatives like the Onn 4K streaming device from Walmart have been available to meet that need, functioning as a worthy replacement for the Chromecast.

Now, a product page on Walmart's website confirms that a new version of the streaming device, the Onn 4K Plus, is on its way. Spotted by 9to5Google, the listing reveals that the device will sell for just $30, which is $20 cheaper than the typical price of a 4K Chromecast, and like the Chromecast it has a small footprint and comes with a remote.

Read more
I saw the future of AI on Netflix. It skips hype and finds a purpose
AI features in Netflix on TV.

When you think of AI, names like Google, Microsoft, and OpenAI pop up in your mind. Netflix, the world’s biggest streaming platform, doesn’t quite sound like the right platform where you would expect something like a generative AI chatbot — having gobbled up the entire world’s knowledge — to show up. 

After all, you log on to Netflix for watching films and TV shows. Maybe, a few short clips. Or play games, even. Yet, Netflix has made a historically deep bet on tools such as machine learning in a variety of ways, and especially to fine-tune its recommendation algorithm. 

Read more
Save 22% on the UE Wonderboom 4 speaker when you purchase today
A man and woman using the UE Wonderboom 4.

Now that summer is just around the corner, it’s time to pack that surfboard and hit the beach with all your best pals! And what better way to reign in the good vibes than with your favorite playlist? Sounds like you should be looking at one of the best Bluetooth speakers on the market!

Fortunately, the Ultimate Ears Wonderboom 4 Bluetooth Speaker is on sale this week for only $78, a $22 discount from its $100 retail price. Purchase at Amazon, Best Buy, and Target to take advantage of this offer. 

Read more