Skip to main content

Descriptive Camera concept defies the idea that a picture is worth a thousand words

Descriptive Camera
Image used with permission by copyright holder

The world of concept gadgets is a weird one – and that’s why we love it. But every once in awhile, this strange and complicated place features a few ideas that leave us scratching our heads. This Descriptive Camera created by Matt Richardson does precisely that.

Instead of taking photos and logging metadata, this camera actually produces that metadata and translates it into human descriptions of the scene. For example, if you take a picture of an old chair, the result might read something like “This is a chair that looks worn. It seems to be old. It needs to be fixed.”

Now the impetus for this product might not clear, but the technology the powers it is certainly remarkable. The camera uses Amazon’s Mechanical Turk API, which allows developers to outsource this metadata to actual people (this being called a Human Intelligence Task) who are paid to read it and translate it into language readable by humans (interested? You can sign up to be a Mechanical Turk Worker here). There’s also an option called “accomplice mode” where the camera will instant message a person with the picture and then will receive the description. This is faster and cheaper but the results aren’t quite as high quality.

As it currently exists, this means you snap a picture, the photo is sent in for translation, and then you wait. You have to pay for the Human Intelligence Task service, which costs $1.25 each time and returns the results within 3-6 minutes.

Image used with permission by copyright holder

The Descriptive Camera is connected to the Internet via ethernet and is powered by an external 5 volt source, but Richardson says his long term vision is to create something that looks and functions like an actual digital camera and is wireless.

Still can’t get beyond why? Neither can we – the need for a written description of a scene instead of the image just doesn’t connect. But the use of the Mechanical Turk API is fairly interesting and sort of magic in a way, and many photo junkies would probably agree that a more reader-friendly format for our metadata would be a welcome change. It’d be great to see this product integrated with images to print or log image info (including ISO, aperture, shutter speed, etc) that’s better written for humans.

For now, it exists as a concept without a ton of real-world application. That said, there’s something undeniably tempting about taking pictures and having them interpreted by someone else. Check out examples of how the Descriptive Camera works below. 

Image used with permission by copyright holder
Molly McHugh
Former Digital Trends Contributor
Before coming to Digital Trends, Molly worked as a freelance writer, occasional photographer, and general technical lackey…
This AI cloned my voice using just three minutes of audio
acapela group voice cloning ad

There's a scene in Mission Impossible 3 that you might recall. In it, our hero Ethan Hunt (Tom Cruise) tackles the movie's villain, holds him at gunpoint, and forces him to read a bizarre series of sentences aloud.

"The pleasure of Busby's company is what I most enjoy," he reluctantly reads. "He put a tack on Miss Yancy's chair, and she called him a horrible boy. At the end of the month, he was flinging two kittens across the width of the room ..."

Read more
Digital Trends’ Top Tech of CES 2023 Awards
Best of CES 2023 Awards Our Top Tech from the Show Feature

Let there be no doubt: CES isn’t just alive in 2023; it’s thriving. Take one glance at the taxi gridlock outside the Las Vegas Convention Center and it’s evident that two quiet COVID years didn’t kill the world’s desire for an overcrowded in-person tech extravaganza -- they just built up a ravenous demand.

From VR to AI, eVTOLs and QD-OLED, the acronyms were flying and fresh technologies populated every corner of the show floor, and even the parking lot. So naturally, we poked, prodded, and tried on everything we could. They weren’t all revolutionary. But they didn’t have to be. We’ve watched enough waves of “game-changing” technologies that never quite arrive to know that sometimes it’s the little tweaks that really count.

Read more
Digital Trends’ Tech For Change CES 2023 Awards
Digital Trends CES 2023 Tech For Change Award Winners Feature

CES is more than just a neon-drenched show-and-tell session for the world’s biggest tech manufacturers. More and more, it’s also a place where companies showcase innovations that could truly make the world a better place — and at CES 2023, this type of tech was on full display. We saw everything from accessibility-minded PS5 controllers to pedal-powered smart desks. But of all the amazing innovations on display this year, these three impressed us the most:

Samsung's Relumino Mode
Across the globe, roughly 300 million people suffer from moderate to severe vision loss, and generally speaking, most TVs don’t take that into account. So in an effort to make television more accessible and enjoyable for those millions of people suffering from impaired vision, Samsung is adding a new picture mode to many of its new TVs.
[CES 2023] Relumino Mode: Innovation for every need | Samsung
Relumino Mode, as it’s called, works by adding a bunch of different visual filters to the picture simultaneously. Outlines of people and objects on screen are highlighted, the contrast and brightness of the overall picture are cranked up, and extra sharpness is applied to everything. The resulting video would likely look strange to people with normal vision, but for folks with low vision, it should look clearer and closer to "normal" than it otherwise would.
Excitingly, since Relumino Mode is ultimately just a clever software trick, this technology could theoretically be pushed out via a software update and installed on millions of existing Samsung TVs -- not just new and recently purchased ones.

Read more