Skip to main content
  1. Home
  2. Computing
  3. News

OpenAI needs just 15 seconds of audio for its AI to clone a voice

Add as a preferred source on Google

In recent years, the listening time required by a piece of AI to clone someone’s voice has been getting shorter and shorter.

It used to be minutes, now it’s just seconds.

Recommended Videos

OpenAI, the Microsoft-backed company behind the viral generative AI chatbot ChatGPT, recently revealed that its own voice-cloning technology requires just 15 seconds of audio material to reproduce someone’s voice.

In a post on its website, OpenAI shared a small-scale preview of a model called Voice Engine, which it’s been developing since late 2022.

Voice Engine works by feeding it a minimum of 15 seconds of spoken material. The user can then input text to create what OpenAI describes as “emotive and realistic” speech that “closely resembles the original speaker.”

OpenAI insists it is taking a “cautious and informed approach to a broader release due to the potential for synthetic voice misuse,” adding that it wants to “start a dialogue on the responsible deployment of synthetic voices, and how society can adapt to these new capabilities.”

It added: “Based on these conversations and the results of these small scale tests, we will make a more informed decision about whether and how to deploy this technology at scale.”

One of the misuses that OpenAI refers to is a scam that some criminals are already carrying out using similar technology that’s been publicly available for some time. It involves cloning a voice and then calling a friend or relative of that person to trick them into handing over cash via a bank transfer. There are also fears about how such technology might be used in the upcoming presidential election, an issue highlighted by a recent high-profile incident in which a robocall using a clone of President Joe Biden’s voice told people not to vote in January’s New Hampshire primary.

Another concern is how the rapidly improving technology will impact the livelihoods of voice actors who fear that they’ll be increasingly asked to sign over the rights to their voice so that AI can be used to create a synthetic version, with compensation for such a contract likely to be much lower than if the actor was asked to perform the job in person.

Looking at more positive deployments of the technology, OpenAI suggests that it could be used to provide reading assistance to non-readers and children using natural-sounding, emotive voices “representing a wider range of speakers than what’s possible with preset voices,” as well as instant translation of videos and podcasts, something that Spotify is already trialing.

It could also be used to help patients who are gradually losing their voice through illness to continue communicating using what sounds like their own voice.

OpenAI has some examples of the AI-generated audio and the reference audio on its website, and we’re sure you’ll agree that they’re pretty extraordinary.

Trevor Mogg
Contributing Editor
Not so many moons ago, Trevor moved from one tea-loving island nation that drives on the left (Britain) to another (Japan)…
A YouTuber 3D printed an entire outfit, but the comfort and cost are more complicated than you’d think
The 3D-printed outfit is real. Whether it's practical is a different conversation entirely.
Adult, Male, Man

YouTuber Matthew Trahan has made a career out of 3D printing increasingly unusual things. He has printed musical instruments, bedroom furniture, and, in one particularly memorable video, himself.

His latest project is a full outfit, from shirt to shoes, belt to glasses, because apparently nobody told him 3D printers are for creating engineering prototypes or structures that aren’t otherwise feasible, not for fashion week.

Read more
The memory crisis isn’t going to ease, and you will pay the price for it, says a research firm
Forty to 50% higher this quarter, 30 to 40% more next quarter, and no real relief until 2028. Plan accordingly.
RAM memory chips

If you were hoping the memory crisis was about to ease up, I have some bad news for you. It comes directly from Wall Street.

Your next smartphone, laptop, or tablet could cost even more, regardless of whether it has recently been subject to a price hike.

Read more
Apple’s next Mac Studio could get a new M5 Ultra chip and a cooler upgrade
The desktop workstation is tipped to receive an M5 Ultra this year, an M7 Ultra later, and a redesigned heat sink.
Apple Mac Studio Featured

Apple's Mac Studio may not be getting a fresh new look anytime soon, but it could be getting a meaningful upgrade where it matters most. According to Mark Gurman in the latest edition of his Power On newsletter, Apple is preparing an M5 Ultra-powered Mac Studio as early as this year, while an even more powerful M7 Ultra version is already on the company's roadmap for 2028. Interestingly, the report also claims Apple is redesigning one component most users will never see: the heat sink.

More power is coming, and Apple wants to keep it cool

Read more