Skip to main content

Microsoft hits another milestone in speech-recognition software accuracy

If you’re fed up with chatbots mishearing you, Microsoft is making machine ears a little more attentive. Researchers from the tech giant have achieved an impressively low error rate for speech-recognition software — just 6.3 percent, according to a paper published last week. The company hopes this milestone will help refine and personalize its AI assistant, Cortana, and features like Skype Translator.

The newest error rate of Microsoft’s conversational speech-recognition system is regarded as the lowest in the industry, according to Xuedong Huang, Microsoft’s chief speech scientist. IBM meanwhile recently announced an error rate of 6.6 percent, bettering its 6.9 percent error rate from April and the 8 percent milestone that the company achieved last year. Two decades ago, the lowest error rate of a published system was more than 43 percent, Microsoft notes in a blog post.

Recommended Videos

In artificial intelligence development, researchers often model machines of off humans by equipping the systems with the abilities to speak, see, and hear. Although Microsoft’s achievement is just 0.3 percent below IBM’s, incremental advancements like these bring machines closer to human-like capabilities. In speech recognition, the human error rate is around 4 percent, according to IBM.

“This new milestone benefited from a wide range of new technologies developed by the AI community from many different organizations over the past 20 years,” Microsoft’s Huang said.

A few of these technologies include biologically inspired systems called neural networks, a training technique known as deep learning, and the adoption of graphic processing units (GPUs) to process algorithms. Over the past two years, neural networks and deep learning have enabled AI researchers to develop and train systems in advanced speech recognition, image recognition, and natural language processing. Just last year, Microsoft created image-recognition software that outperformed humans.

Although initially designed for computer graphics, GPUs are now regularly used to process sophisticated algorithms. Cortana can process up to 10 times more data using GPUs than previous methods, according to Microsoft.

With steady advances like these, repeating your question to a chatbot may be a thing of the past.

Dyllan Furness
Former Digital Trends Contributor
Dyllan Furness is a freelance writer from Florida. He covers strange science and emerging tech for Digital Trends, focusing…
A new test shows Microsoft Recall’s continued security problems
Recall screenshot.

Microsoft is currently previewing its latest version of Recall to Windows Insiders on Snapdragon-, Intel-, and AMD-based Copilot+ PCs -- and the topic on most users' minds is security. The company updated its security and privacy architecture for the feature in September, but, according to tests run by Tom's Hardware, it still might not be good enough.

The new version of Recall includes a sensitive information filter that's supposed to detect when there's information like credit card numbers and Social Security numbers on the screen. If it detects them, it will avoid taking a screenshot. When Tom's Hardware put this filter to the test, however, it failed in a number of situations.

Read more
Microsoft’s Copilot Vision arrives to surf the web with select users
The Copilot logo

Microsoft's new Copilot Vision feature that can “see what you see, and hear what you hear” while you navigate the internet is finally being made available, though only to a limited number of Copilot Pro subscribers in the U.S.

"Starting today, we are introducing an experience where – with your permission – Copilot can now understand the full context of what you’re doing online," according to a Microsoft blog post. "When you choose to enable Copilot Vision, it sees the page you're on, it reads along with you, and you can talk through the problem you're facing together."

Read more
With Copilot Actions, Microsoft brings AI agents to Outlook, Teams, and more
microsoft expanding ai agents 365 copilot early 2025 actions2

Microsoft plans to roll out a slew of new features for its business-facing 365 Copilot products starting early next year, the company announced during its Microsoft Ignite 2024 event on Tuesday.

365 Copilot, which was rebranded from just Copilot in September, enables businesses to incorporate Microsoft Copilot generative AI into its Microsoft 365 family of apps (as well as in Teams) for a $30/employee/month subscription.

Read more