Skip to main content
  1. Home
  2. Audio / Video
  3. News

Meta’s new open-source AI tool helps you clean up noisy recordings just by typing

It supports text, visual, and time-based prompts for precise sound separation.

Add as a preferred source on Google
meta-ai-sam-audio
Meta

Cleaning up audio usually means scrubbing timelines and tweaking filters, but Meta thinks it should be as easy as describing the sound you want. The company has released a new open-source AI model called SAM Audio that can isolate almost any sound from a complex recording using simple text prompts.

Users can pull out specific noises like voices, instruments, or background sounds without digging through complicated editing software. The model is now available through Meta’s Segment Anything Playground that houses other prompt-based image and video editing tools.

🔉 Introducing SAM Audio, the first unified model that isolates any sound from complex audio mixtures using text, visual, or span prompts.

We’re sharing SAM Audio with the community, along with a perception encoder model, benchmarks and research papers, to empower others to… pic.twitter.com/FuMJyULmJR

— AI at Meta (@AIatMeta) December 16, 2025

Broadly speaking, SAM Audio is designed to understand what sound you want to work with and separate it cleanly from everything else. Meta says this opens the door to faster audio editing for use cases like music production, podcasting, film and television, accessibility tools, and research.

For example, a creator could isolate vocals from a band recording, remove traffic noise from a podcast, or delete a barking dog from an otherwise perfect recording, all by describing what they want the model to target.

How SAM Audio works

SAM Audio is a multimodal model that supports three different types of prompts. Users can describe a sound using text, click on a person or object in a video to visually identify the sound they want to isolate, or mark a time span where the sound first appears. These prompts can be used alone or combined, giving users fine-grained control over what gets separated.

Under the hood, the system relies on Meta’s Perception Encoder Audiovisual engine. It acts as the model’s ability to recognize and understand sounds before slicing them out of the mix.

Recommended Videos

To improve audio separation evaluation, Meta has also introduced SAM Audio-Bench, a benchmark for measuring how well models handle speech, music, and sound effects. It is accompanied by SAM Audio Judge, which evaluates how natural and accurate the separated audio sounds to human listeners, even without reference tracks to compare against.

Meta claims these evaluations show SAM Audio performs best when different prompt types are combined and can handle audio faster than real-time, even at scale.

That said, the model has clear limitations. It does not support audio-based prompts, cannot perform full separation without any prompting, and struggles with similar overlapping sounds, such as isolating a single voice from a choir.

Meta says it plans to improve these areas and is already exploring real-world applications, including accessibility work with hearing-aid makers and organizations supporting people with disabilities.

The launch of SAM Audio ties into Meta’s broader AI push. The company is improving voice clarity on its AI glasses for noisy environments, working toward next-generation mixed reality glasses expected to arrive in 2027, and developing a conversational AI that could rival ChatGPT, signaling a wider focus on AI models that understands sound, context, and interaction.

Manisha Priyadarshini
Manisha Priyadarshini is a tech and entertainment writer with over nine years of editorial experience.
LG C6H OLED Evo AI Review: The First Meaningful C-Series Upgrade in Years?
This one stays true to its roots, while delivering upgrades that revive the C-series as a worthwhy investment.
Electronics, Screen, Computer Hardware

Buy from Best Buy

The LG C-Series has long occupied a unique position in the TV market. For years, it has been the default recommendation for anyone looking for a premium OLED experience without stepping into flagship pricing territory. It consistently delivered the picture quality, gaming performance, and overall reliability that made it one of the safest OLED recommendations available.

Read more
Tidal lays down the rules for AI music. I wish Spotify and everyone else would follow
Tidal app showing on iPhone 15 Pro.

Every week, the AI music problem is getting increasingly hard to ignore, especially for streaming platforms. Deezer reported that 44% of all new music uploaded to its platform daily is now AI-generated; that's almost half the songs.

Spotify relabeled and tightened its AI policies last September, while Apple Music announced a tagging approach in March. However, the subscription-based artist-first music platform Tidal has done something none of them did. 

Read more
Netflix just got a whole lot more irritating if you share a screen in a household
Every profile will soon need its own email address, adding another hurdle for households that share a TV.
Netflix on TV couple watching

Netflix's password-sharing crackdown isn't over just yet. The streaming giant is now rolling out another change that could make shared household accounts a little more cumbersome, this time by asking every profile on an account to have its own email address. While the move isn't designed to stop families from sharing a subscription, it does add another layer of identity verification that many users probably weren't asking for.

Netflix wants every profile to have its own identity

Read more