Skip to main content

TiVo takes personalization to the next level with voice ID

The ability to use your voice to interact with smart TVs and streaming media devices is rapidly becoming standard. What was once an exotic and expensive feature, microphone buttons are appearing on remote controls for devices that cost as little as $30. Some of the newest smart TVs, with built-in far-field mics, don’t require a button at all. But as convenient as voice commands are, they’re also kind of dumb. All systems attempt to understand what was said, but few — if any — try to understand who said it, and that creates a big opportunity.

Today, TiVo and Pindrop, a voice authentication company, are taking the first step toward voice commands that understand who is doing the talking, with a new partnership that will see Pindrop’s voice ID technology added to TiVo’s voice-enabled devices.  Pindrop is also opening up its voice authentication platform so that any third-party developer can take advantage of the same capability.

Recommended Videos

But what exactly does this new technology do, and how does it work? Pindrop CEO, Vijay Balasubramaniyan, gave Digital Trends an overview.

Being able to ID someone using their voice has a lot of advantages (some of which we’ll discuss later) but in the context of a streaming media platform like TiVo, the biggest benefit is helping users get to the movies, TV shows, and other content that they’re most likely interested in seeing.

Many platforms, TiVo included, already do a pretty good job of diving into the catalogs of your subscribed services like Netflix, Amazon Prime Video, or Disney+, and showing you recommended content. Some of these platforms may even create a “continue watching” section that lets you resume a paused show or move on to the next episode in a season. But these options and recommendations are, in a sense, generic. They’re based on the activity that has taken place on one specific device, by all users of that device. Depending on the size of your household, that could be a lot of people.

Individual services have already recognized this as a roadblock to accurate personalization, which is why so many now include the ability to create multiple user profiles. That system works well enough when you’re navigating using the remote’s keypad, but it leaves voice-driven interaction without any ability to declare who’s watching.

This is where Pindrop comes into the picture. Pindrop’s expertise comes from developing interactive voice response (IVR) services for Fortune 500 companies like banks, insurance companies, and shipping companies. Its technology analyzes more than 250 specific biological and behavioral voice characteristics, like the frequency and harmonics of speech as well as the patterns of intonation, rhythm, and style, which it then uses to create the equivalent of a voice fingerprint.

It’s similar to voice profile systems used by Google and Amazon for their respective voice assistants, but unlike these platforms, Pindrop’s technology can work with any device.

When using a Pindrop-enabled device like a TiVo, voice commands are no longer just verbal replacements for button-presses, they’re also a way to understand who’s using the device. The question, “What should I watch?” can trigger a set of content suggestions tailored to the speaker, not the household. If another member of the household says the exact same thing, they’ll get completely different results — no profile switching required.

Pindrop’s Voice ID system is sophisticated enough that its accuracy isn’t hampered by factors that might otherwise confuse a voice-recognition system like background noises, changes in a speaker’s voice caused by sickness, aging, or even mask-wearing.

There’s even a section of Pindrop’s algorithm that can identify a speaker’s tone and emotion. When answering an open-ended question like, “What should I watch?” a person’s mood could easily affect the content that’s offered in the results.

Amazingly — and somewhat frighteningly — Pindrop can also identify multiple voices at once. If one person in the room asks for content suggestions and the system hears other voices in the background that it recognizes, it can pass that info along to the TiVo platform, letting TiVo make recommendations based on the youngest person in the room (if it chose to do so).

All of this raises several security and privacy questions, but Pindrop claims that the way its technology works should alleviate any concerns. First, the system is opt-in. Before a platform like TiVo uses Pindrop to voice ID users, those users would have to specifically agree to participate. Second, Pindrop says that its voice IDs aren’t associated with any personally identifiable information and that the voice ID data doesn’t actually contain samples of someone’s voice.

Whether or not this voice ID system proves popular with users of smart TVs and streaming media devices, Pindrop sees this use of its technology as a very early step in a much bigger vision for voice authentication.

Its ambition is to become the voice authentication system for all voice-enabled products, from smartphones to driverless cars. Ultimately, it wants to give people a centrally-managed permission-based platform, where you can grant and revoke access to devices and services in much the same way that Google currently lets you use your Google account to sign-in on phones, computers, and streaming devices.

Once you realize the potential of such a wide-ranging voice authentication system, it’s amazing to think that Google, Amazon, and Apple — with their many years of both identity management and voice recognition services — haven’t yet planted their respective flags on this territory.

For now, the TiVo implementation of Pindrop’s technology will serve as a useful test. How well does it work, and how seamless can it make voice-based interactions? We’ll let you know as soon as we get a chance to try it out. TiVo is expected to make Pindrop personalization a feature of its platform in the second quarter of 2021.

Simon Cohen
Simon Cohen is a contributing editor to Digital Trends' Audio/Video section, where he obsesses over the latest wireless…
YouTube starts using AI to make ads annoyingly difficult to avoid
YouTube app in iOS app gallery.

YouTube is relying on AI in its latest crusade against seekers of an ad-free video-watching experience. The company recently announced plans to use AI models to make ads more persuasive by strategically placing them within the video.

At its Brandcast 2025 event in New York, YouTube revealed it will deploy Google's Gemini AI to analyze videos to optimize placement of ads. The AI will be used to identify key moments or "Peak Points" in the video where viewers are most likely to be engaged and too invested to stop watching it in order to avoid the ad.

Read more
Qobuz Connect launches with Denon, Marantz, and more than 50 other hi-fi brands
Qobuz Connect.

Fans of Qobuz, the France-based subscription music service that specializes in lossless, hi-res audio, now have a new way of streaming their favorite tracks to their favorite devices. Qobuz Connect has been added to the company's iOS, Android, macOS, and Windows apps, letting them control compatible streaming speakers and components from a big list of hi-fi brands.

Most folks will recognize names like Denon and Marantz -- every device made by these brands that work with the HEOS streaming software are now Qobuz Connect compatible -- but the list also includes niche hi-fi players, such as Rotel, Nagra, HiFi Rose, Lindemann, Wiim, and Volumio. Here's the entire list.

Read more
Sony’s WH-1000XM6 debut with better ANC, a folding hinge, and a higher price
Sony WH-1000XM6.

After multiple leaks, there wasn't much left for Sony to announce, but nonetheless, today it's official: Sony's WH-1000XM6 are here and as expected, they feature a generous number of upgrades from the WH-1000XM5, including a new metal folding hinge that's designed to be both more durable and more flexible. The new XM6 comes in three colors: black, midnight blue, and Sony's strangely named platinum silver (which is actually an off-white, sandy color seen here). They're priced at $450 in the U.S., a $50 increase over the XM5 that appears partially tariff-driven, given their $599 Canadian dollar price (about $428 U.S.). They're available starting today at major online retailers and sony.com.

Sony took a bit of criticism for it fold-flat design of the WH-1000XM5, which some viewed as less travel friendly. The XM6 is a clever response to those concerns -- Sony has kept the XM5's sleek lines while adding in that missing second degree of motion in the hinge. It has also reduced the size of the travel case and given it a quick release magnetic closure instead of the usual zipper.

Read more