Baidu’s new A.I. can mimic your voice after listening to it for just one minute

baidu ai emulate your voice speechrecognition

We’re not in the business of writing regularly about “fake” news, but it’s hard not to be concerned about the kind of mimicry technology is making possible. First, researchers developed deep learning-based artificial intelligence (A.I.) that can superimpose one person’s face onto another person’s body. Now, researchers at Chinese search giant Baidu have created an A.I. they claim can learn to accurately mimic your voice — based on less than a minute’s worth of listening to it.

“From a technical perspective, this is an important breakthrough showing that a complicated generative modeling problem, namely speech synthesis, can be adapted to new cases by efficiently learning only from a few examples,” Leo Zou, a member of Baidu’s communications team, told Digital Trends. “Previously, it would take numerous examples for a model to learn. Now, it takes a fraction of what it used to.”

Baidu Research isn’t the first to try and create voice-replicating A.I. Last year, we covered a project called Lyrebird, which used neural networks to replicate voices including President Donald Trump and former President Barack Obama with a relatively small number of samples. Like Lyrebird’s work, Baidu’s speech synthesis technology doesn’t sound completely convincing, but it’s an impressive step forward — and way ahead of a lot of the robotic A.I. voice assistants that existed just a few years ago.

The work is based around Baidu’s text-to-speech synthesis system Deep Voice, which was trained on upwards of 800 hours of audio from a total of 2,400 speakers. It needs just 100 5-second sections of vocal training data to sound its best, but a version trained on only 10 5-second samples was able to trick a voice-recognition system more than 95 percent of the time.

“We see many great use cases or applications for this technology,” Zou said. “For example, voice cloning could help patients who lost their voices. This is also an important breakthrough in the direction of personalized human-machine interfaces. For example, a mom can easily configure an audiobook reader with her own voice. The method [additionally] allows creation of original digital content. Hundreds of characters in a video game would be able to have unique voices because of this technology. Another interesting application is speech-to-speech language translation, as the synthesizer can learn to mimic the speaker identity in another language.”

For a deeper dive into this subject, you can listen to a sample of the voices or read a paper describing the work.

Product Review

Google’s Pixel 3 is a hair away from pocket-sized perfection

Google’s Pixel 3 smartphone is the best Android phone you can buy. It doesn’t have the best looks or the best hardware, but you’ll be hard pressed to find better software and unique A.I. functionalities.
Home Theater

Learn how to calibrate your home theater speakers for sheer audio bliss

Make your home theater rumble just right with our manual speaker setup guide, a simple, step-by-step walkthrough to getting the most from your audio equipment without needing to rely on imperfect automatic calibration.
Emerging Tech

Intel’s new ‘neural network on a stick’ aims to unchain A.I. from the internet

To kick off its first developer conference in Beijing, Intel unveiled the second generation of its Neural Compute Stick -- a device that promises to democratize the development of computer vision A.I. applications.
Smart Home

Google Assistant adds smart home bells and whistles in time for the holidays

Just in time for the holidays, Google Assistant is introducing a bunch of new smart home features, including the ability to reply to broadcast messages, create and use cookbooks, and access enhanced storybook content for kids.
Smart Home

This alarm clock uses targeted light and sound to wake you, but not your partner

The Wake v2 isn't like your typical bedside alarm. Instead, it wakes you by shining a soft light directly into your face, thereby not disturbing the person sharing a bed with you. Pretty smart, huh?
Emerging Tech

Believe it or not, this fire-proof exoskeleton isn’t designed for space marines

A company called Levitate Technologies has developed a fire-resistant upper body exoskeleton that’s capable of lowering exertion levels by up to 80 percent when you carry out manual work.
Emerging Tech

Frogs regrow ‘paddle-like’ limbs when placed in a bioreactor

Frogs have partially regrown amputated limbs thanks to a bioreactor at Tufts University. By jump-starting tissue repair, the bioreactor helped the amphibians regenerate a bigger, more complete appendages than they usually do.
Emerging Tech

Prepare for liftoff: Here are all the important upcoming SpaceX rocket launches

From ISS resupply missions to a host of communication and scientific satellite launches, SpaceX has a busy year ahead. Here's a rundown of some of the company's most important missions slated for the next year.
Emerging Tech

China says it has developed a quantum radar that can see stealth aircraft

Chinese defense giant China Electronics Technology Group Corporation claims that it has developed a quantum radar that's able to detect even the stealthiest of stealth aircraft. Here's how it works.
Emerging Tech

Glass orb packs all the constellations in the night sky into fancy desk ornament

Ever wanted to know more about the star constellations? A stunning new Kickstarter campaign, taking the form of a fancy desk ornament that re-creates the night sky in a glass orb, aims to help.
Emerging Tech

Stronger than steel, thinner than paper, graphene could be the future of tech

Since its discovery, graphene has set the research world on fire. What exactly is it, though, and what could it mean for the future of tech? Here's everything you need to know about what could be the next supermaterial to take center stage.
Emerging Tech

The best drone photos from around the world

Most of today's drones come equipped with high-end cameras, which are quickly revolutionizing the world of aerial photography as we know it. Here are some of the best drone photos from around the world.
Emerging Tech

SpaceX makes rocketry look easy, sticks yet another Falcon 9 landing

SpaceX is due to perform its latest Falcon 9 rocket launch and landing on November 15 from NASA’s Kennedy Space Center in Cape Canaveral, Florida. Here's how you can watch the proceedings live.
Emerging Tech

In a weighty decision, scientists prepare to redefine the kilogram

Metrologists are meeting at the General Conference on Weights and Measures in Versailles to vote on whether to redefine the kilogram as a constant that can be observed in the natural world.