Skip to main content

Trint text-to-speech web app takes the pain out of transcription

trint groundbreaking text to speech 68710765 l
Antonio Guillem/123RF
Sometimes stories about breakthrough artificial intelligence systems make us worry about the prospect of robots snapping up good jobs. Other times they’re automating a job so time-consuming and mind-numbingly dull that we really couldn’t be happier about the imminent machine takeover.

Guess which one of these categories “transcribing lengthy passages of audio” falls into?

The groundbreaking tool in question is a web app called Trint, a portmanteau of “transcription” and “interview,” which promises to listen to long blocks of text and transcribe it almost flawlessly. It can even do neat things like distinguishing between multiple people in a recording, or letting you assign time code to your transcription for later reference.

“We use the best automated speech-to-text you will find,” CEO and co-founder Jeffrey Kofman told Digital Trends. “With reasonably clear audio we can return transcripts that are 95-98 percent accurate. When you take very clear speakers like Trump or Obama, our automated transcripts are often 99 percent accurate. People tell us they think what we’ve built is magic.”

Image used with permission by copyright holder

As a former journalist, Kofman appreciates that professionals such as researchers, lawyers, and others need to know that they can trust their transcripts. As a result, Trint marries two pieces of software to allow for a toolset that not only carries out automated speech-to-text, but also provides a simple, intuitive way to quickly search, verify and if necessary correct the output.

As such, the software comprises both a text editor and audio/video player, which lets users check the finished product like a karaoke track, with both video and text on screen at the same time. If they spot an error, it’s incredibly easy to correct it.

Trint can transcribe in North American, British and Australian English, along with 12 other languages including French, German, Spanish, Italian, Portuguese, and Russian. Kofman notes that it’s no miracle worker — so you’ll need to provide decent audio to get a good output — but it’s amazing what it can do.

The service is currently available, priced at $10-15 per hour depending on the type of audio. If you want to check it out as a comparison to existing dictation tools, Trint offers a free 30-minute trial.

“In the coming months, you will see Trint begin to release a series of publishing tools and social media integrations that for the first time will make it easy to quickly and cost-effectively transcribe and share recorded content, and instantly make it searchable on Google,” Kofman said.

Editors' Recommendations

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
This AI cloned my voice using just three minutes of audio
acapela group voice cloning ad

There's a scene in Mission Impossible 3 that you might recall. In it, our hero Ethan Hunt (Tom Cruise) tackles the movie's villain, holds him at gunpoint, and forces him to read a bizarre series of sentences aloud.

"The pleasure of Busby's company is what I most enjoy," he reluctantly reads. "He put a tack on Miss Yancy's chair, and she called him a horrible boy. At the end of the month, he was flinging two kittens across the width of the room ..."

Read more
Digital Trends’ Top Tech of CES 2023 Awards
Best of CES 2023 Awards Our Top Tech from the Show Feature

Let there be no doubt: CES isn’t just alive in 2023; it’s thriving. Take one glance at the taxi gridlock outside the Las Vegas Convention Center and it’s evident that two quiet COVID years didn’t kill the world’s desire for an overcrowded in-person tech extravaganza -- they just built up a ravenous demand.

From VR to AI, eVTOLs and QD-OLED, the acronyms were flying and fresh technologies populated every corner of the show floor, and even the parking lot. So naturally, we poked, prodded, and tried on everything we could. They weren’t all revolutionary. But they didn’t have to be. We’ve watched enough waves of “game-changing” technologies that never quite arrive to know that sometimes it’s the little tweaks that really count.

Read more
Digital Trends’ Tech For Change CES 2023 Awards
Digital Trends CES 2023 Tech For Change Award Winners Feature

CES is more than just a neon-drenched show-and-tell session for the world’s biggest tech manufacturers. More and more, it’s also a place where companies showcase innovations that could truly make the world a better place — and at CES 2023, this type of tech was on full display. We saw everything from accessibility-minded PS5 controllers to pedal-powered smart desks. But of all the amazing innovations on display this year, these three impressed us the most:

Samsung's Relumino Mode
Across the globe, roughly 300 million people suffer from moderate to severe vision loss, and generally speaking, most TVs don’t take that into account. So in an effort to make television more accessible and enjoyable for those millions of people suffering from impaired vision, Samsung is adding a new picture mode to many of its new TVs.
[CES 2023] Relumino Mode: Innovation for every need | Samsung
Relumino Mode, as it’s called, works by adding a bunch of different visual filters to the picture simultaneously. Outlines of people and objects on screen are highlighted, the contrast and brightness of the overall picture are cranked up, and extra sharpness is applied to everything. The resulting video would likely look strange to people with normal vision, but for folks with low vision, it should look clearer and closer to "normal" than it otherwise would.
Excitingly, since Relumino Mode is ultimately just a clever software trick, this technology could theoretically be pushed out via a software update and installed on millions of existing Samsung TVs -- not just new and recently purchased ones.

Read more