Skip to main content
  1. Home
  2. Computing
  3. News

Google’s Gemini AI can now process and talk about audio files

Hey Gemini, give me a one-page summary of this two-hour lecture. Thanks!

Add as a preferred source on Google
A person using Google Gemini on the Google Pixel 9a.
Andy Boxall / Digital Trends

Google’s Gemini AI is multi-modal, which means it can process and generate files in various formats, ranging from text and images to videos. Though it can generate audio, so far, it has lacked the ability to process audio files uploaded by users. That finally changes, as Gemini now lets you feed audio files and talk about them.

What’s the big change?

The ability to upload audio files is now live in the Gemini mobile app and the web version, too. In the Gemini chat bubble, tap on the “+” icon and upload the audio clip by selecting the clip-shaped file upload icon. Oh, by the way, this feature is free for all Gemini users.

Recommended Videos

According to Google’s support page, you can upload audio clips of up to ten minutes duration. But if you pay for the Gemini AI Pro or Ultra bundles, you can upload audio files with a run time of up to 3 hours.

Now that you can upload audio to Gemini, it’s easy to create a podcast episode infographic summary with Canvas.https://t.co/QcZqWS8dkr pic.twitter.com/PB9gKPfVbR

— Steve Chipman (@SteveChipman) September 9, 2025

In case you’re curious about what other file formats you can feed to Gemini, here’s a quick rundown:

  • Up to 10 files in one go, including ZIP files.
  • Video of up to 2GB in size. 5 minutes in length for free users, and 1 hour for paying customers.
  • One code folder, or one GitHub repository (up to 5,000 files / 100MB size)

A boon for the bibliophiles

Not everyone loves digging into an audiobook, podcast, or lecture recording. Sometimes, walls of text are where the real magic happens, or it’s where the cognitive comfort zone lies. If you count yourself among the folks who seek some aural liberation, this Gemini feature update is nothing short of a godsend. And yeah, audio support goes beyond the English language, as you can see in the post below.

You can now upload audio files into Gemini as well which means Gemini now supports all media types. pic.twitter.com/gxJxfnQ4kZ

— Saadh Jawwadh (@SaadhJawwadh) September 8, 2025

Now, whether it’s the summarization of a long lecture, or the need to extract only a few specific talking points from a podcast, Gemini will handle the audio and give you just what you want. You can ask it to write long reports, short briefs, or even convert it into the form of knowledge slides that you can export as images.

On the other end of the rope, we have the fantastic NotebookLM tool. It can turn your long text files into an engaging two-person audio podcast. If you prefer video overviews, it can do that, as well. And while at it, go and avail the free Gemini AI Pro offer that Google is offering to students in numerous countries, including the US.

Nadeem Sarwar
Nadeem is the Managing Editor at Digital Trends.
Apple’s historically high tax for RAM upgrades on Macs has now become absurd
Mac RAM upgrade prices have doubled amid the global memory crunch
MacBook Pro.

Apple’s Mac RAM upgrades were already expensive enough to raise eyebrows. After the company’s latest round of price hikes, some of them now look ridiculous.

Apple recently raised prices across its Mac and iPad lineup, along with other products, citing rising memory and storage costs. The supply crunch is real, but Mac buyers were paying steep premiums for RAM and SSD upgrades long before this jump. Recent MacBook Pro configuration screenshots shared by 9to5Mac show how much worse the upgrade path has become.

Read more
Windows 11 is getting a new Screen Tint mode, and your eyes might thank Microsoft
Users can apply custom color overlays to reduce screen intensity and visual fatigue.
Windows 11 on a laptop

Microsoft is testing a new accessibility feature for Windows 11 called Screen Tint, and it could be one of those small additions that make a surprisingly big difference. Instead of changing your display's color temperature like Night Light, Screen Tint applies a customizable color overlay across the entire screen, making bright displays easier on the eyes during long work or gaming sessions.

A softer screen for tired eyes

Read more
Apple’s looking at a politically radioactive fix for the memory crisis, and the US government isn’t happy about it
Apple blamed memory costs for your price hike. Its proposed solution involves a Pentagon blacklist.
Apple Mac Mini on a Desk

A few days ago, Apple announced an ugly mid-cycle price hike, blaming the worsening-by-the-day memory crisis. According to the Financial Times, the company is now lobbying the government for approval to buy memory chips from a Chinese company. 

The company in question is CXMT, a Chinese chipmaker that the Pentagon added to its Chinese Military Company blacklist for alleged ties to the Chinese army.

Read more