Skip to main content

Meta is building a space-age ‘universal language translator’

When you think of tools infused with artificial intelligence (AI) these days, it’s natural for ChatGPT and Bing Chat to spring to mind. But Facebook owner Meta wants to change that with SeamlessM4T, an AI-powered “universal language translator” that could instantly convert any language in the world into whatever output you want.

Meta describes SeamlessM4T as “the first all-in-one multilingual multimodal AI translation and transcription model.” That’s quite a mouthful, but in simple terms, it means it can convert languages in a range of different ways, such as taking speech audio and switching it into text in a different tongue.

A silhouetted person holds a smartphone displaying the Facebook logo. They are standing in front of a sign showing the Meta logo.
SOPA Images / Getty Images

According to Meta, the tool’s speech recognition and translation features can work in a few different ways:

• Speech recognition for nearly 100 languages
• Speech-to-text translation for nearly 100 input and output languages
• Speech-to-speech translation for nearly 100 input languages and 36 output languages (including English)
• Text-to-text translation for nearly 100 languages
• Text-to-speech translation for nearly 100 input languages and 35 output languages (including English)

Meta says this will “allow people to communicate effortlessly through speech and text across different languages.”

Coming soon to Facebook?

Meta's AI translation tool SeamlessM4T converts Spanish-language input into Vietnamese, both in text and audio forms.

SeamlessM4T is being released under a research license, and Meta states it’s doing this to “allow researchers and developers to build on this work.” As well as that, the metadata of the dataset that was used to train the translation model, called SeamlessAlign, is also being publicly released. This consists of “270,000 hours of mined speech and text alignments,” Meta claims.

However, Meta did not make it clear where these 270,000 hours of “mined speech” have been sourced. Concerns have been raised over the privacy implications of Meta’s work on AI chatbots, while other AI tools have already been caught stealing protected work. There will no doubt be fears that Meta could have done something similar when it trained SeamlessM4T.

There are already other translation tools like Google Translate that can convert text to text and speech to text, but Meta says its own efforts are superior. SeamlessM4T, Meta argues, “reduces errors and delays, increasing the efficiency and quality of the translation process.”

Meta has not said whether the new tool will be integrated into its apps like Facebook and Instagram, but the company did reveal that it aimed to “explore how this foundational model can enable new communication capabilities” in the future. We’ll have to see what that entails.

Editors' Recommendations

Alex Blake
In ancient times, people like Alex would have been shunned for their nerdy ways and strange opinions on cheese. Today, he…
OpenAI building new team to stop superintelligent AI going rogue
A digital brain on a computer interface.

If the individuals who are at the very forefront of artificial intelligence technology are commenting about the potentially catastrophic effects of highly intelligent AI systems, then it's probably wise to sit up and take notice.

Just a couple of months ago, Geoffrey Hinton, a man considered one of the “godfathers” of AI for his pioneering work in the field, said that the technology's rapid pace of development meant that it was “not inconceivable” that superintelligent AI -- considered as being superior to the human mind -- could end up wiping out humanity.

Read more
Google tells workers to be wary of AI chatbots
ChatGPT versus Google on smartphones.

Alphabet has told its employees not to enter confidential information into Bard, the generative AI chatbot created and operated by Google, which Alphabet owns.

The company’s warning also extends to other chatbots, such as Microsoft-backed ChatGPT from OpenAI, Reuters reported on Thursday.

Read more
What is MusicLM? Check out Google’s text-to-music AI
MusicLM prompt.

MusicLM is one of Google's experimental artificial intelligence (AI) tools that uses natural language models to interpret your instructions. But instead of chatting to you like ChatGPT, or helping you search, like Bing Chat, MusicLM is an AI that takes what you tell it and creates music based on it.

You'll need to join the waitlist to get access, but once you're in, you can start making music with Google's latest AI tool.

Read more