Skip to main content

No human touch needed: Computers learn new way to recognize natural sound

What-is-google-duplex
Image used with permission by copyright holder
With speech recognition getting better every day, it’s remarkable how well Siri, Alexa, and Cortana can parse human speech. But what about cheering crowds or crashing waves? Can our AI personal assistants tell the difference between those? Well, probably not. Sound recognition is actually very difficult for computers, particularly natural sounds.

From our smartphones to our most advanced supercomputers, recognizing images and speech is something they’re able to do fairly well across the board. While natural sounds have been an exception, that may be about to change. Scientists at the Massachusetts Institute of Technology might have found a solution.

According to Phys.Org, a group of researchers at MIT’s Computer Science and Artificial Intelligence Laboratory, or CSAIL, have pioneered a new way to teach computers to recognize sound – by cutting out the middle men.

Normally, vast databases of sounds need to be annotated by hand, by humans, to teach computers how to recognize and identify particular sounds. This new method, however, circumvents the human element by using video.

“Computer vision has gotten so good that we can transfer it to other domains. We’re capitalizing on the natural synchronization between vision and sound. We scale up with tons of unlabeled video to understand sound,” Carl Vondrick, an MIT graduate student in electrical engineering, told Phys.Org.

The new system essentially leverages a computer’s ability to recognize visual information and tie that recognition to its understanding of the sounds the videos produce. Think of it this way, the computers recognize objects in the video, and look for correlations between the appearance of those objects and the sound information they’re processing.

It’s a quicker, easier, and more accurate way to train computers to recognize sounds. According to a research paper, it’s between 13 and 15 percent more accurate than the previous method of hand-annotating massive libraries of sounds and feeding that information into a computer.

The CSAIL research team’s full conclusions will be presented at the Neural Information Processing Systems conference in early December.

Editors' Recommendations

Jayce Wagner
Former Digital Trends Contributor
A staff writer for the Computing section, Jayce covers a little bit of everything -- hardware, gaming, and occasionally VR.
It’s time to stop believing these PC building myths
Hyte's Thicc Q60 all-in-one liquid cooler.

As far as hobbies go, PC hardware is neither the cheapest nor the easiest one to get into. That's precisely why you may often run into various misconceptions and myths.

These myths have been circulating for so long now that many accept them as a universal truth, even though they're anything but. Below, I'll walk you through some PC beliefs that have been debunked over and over, and, yet, are still prevalent.
Liquid cooling is high-maintenance (and scary)

Read more
AMD’s next-gen CPUs are much closer than we thought
AMD Ryzen 7 7800X3D held between fingertips.

We already knew that AMD would launch its Zen 5 CPUs this year, but recent motherboard updates hint that a release is imminent. Both MSI and Asus have released updates for their 600-series motherboards that explicitly add support for "next-generation AMD Ryzen processors," setting the stage for AMD's next-gen CPUs.

This saga started a few days ago when hardware leaker 9550pro spotted an MSI BIOS update, which they shared on X (formerly Twitter). Since then, Asus has followed suit with BIOS updates of its own featuring a new AMD Generic Encapsulated Software Architecture (AGESA) -- the firmware responsible for starting the CPU -- that brings support for next-gen CPUs (spotted by VideoCardz).

Read more
AMD Zen 5: Everything we know about AMD’s next-gen CPUs
The AMD Ryzen 5 8600G APU installed in a motherboard.

AMD Zen 5 is the next-generation Ryzen CPU architecture for Team Red and is slated for a launch sometime in 2024. We've been hearing tantalizing rumors for a while now and promises of big leaps in performance. In short, Zen 5 could be very exciting indeed.

We don't have all the details, but what we're hearing is very promising. Here's what we know about Zen 5 so far.
Zen 5 release date and availability
AMD confirmed in January 2024 that it was on track to launch Zen 5 sometime in the "second half of the year." Considering the launch of Zen 4 was in September 2022, we would expect to see Zen 5 desktop processors debut around the same timeframe, possibly with an announcement in the summer at Computex.

Read more