Skip to main content

Carnegie Mellon taught a computer to read human body language in real time

Multi-Person Face/Body/Hand Keypoint Detection
Technology like the Microsoft Kinect camera can already carry out simple motion sensing and use this as a way of interfacing with software. However, researchers from Carnegie Mellon University have gone much further with the development of a new computer system that is capable of recognizing the body poses and movements of multiple people in real time — right down to someone’s facial expression, or the pose of an individual person’s fingers.

“The technology has the potential to unlock new ways for us to interact with machines, to play games, play musical instruments, or produce content,” Yaser Sheikh, an associate professor of robotics at Carnegie Mellon, told Digital Trends. “It will help us diagnose and treat behavioral conditions such as autism, depression, and dyslexia. It has the ability to create new monitoring systems for physical therapy and rehabilitation. It will allow us to build safer systems, such as self-driving cars and home robotics. Perhaps its most exciting potential — and the one that motivates me — is that machines would be able to enter our social spaces and become collaborative partners in our daily lives, instead of passive tools.”

While the technology was developed with the aid of a two-story dome embedded with 500 video cameras, called the Panoptic Studio, the technology can be used by anyone with a single camera and a laptop computer. With this setup, they can potentially carry out some extraordinary recognition tasks — such as monitoring every member of a sports team during a game or building self-driving car technology that might offer early warnings about pedestrians based on their body language.

The work is being presented at CVPR 2017, the Computer Vision and Pattern Recognition Conference, scheduled to take place later in July in Honolulu, Hawaii. However, the Carnegie Mellon team is not keeping it locked up as simply a research project. The team open-sourced it on Github, allowing anyone who wants to take advantage of the technology the means by which to access the code. Already, it is being used by various research groups and interest has been expressed from major commercial research and development labs — including in the automotive industry.

Sheikh does warn against misuse, though. “[As exciting as this kind of technology is, it also] tremendous negative potential in enabling broad-based surveillance and monitoring for specific behaviors,” he said. “There are still ways to go before that happens, of course, but our human community needs to start thinking about its implications.”

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
Digital Trends’ Tech For Change CES 2023 Awards
Digital Trends CES 2023 Tech For Change Award Winners Feature

CES is more than just a neon-drenched show-and-tell session for the world’s biggest tech manufacturers. More and more, it’s also a place where companies showcase innovations that could truly make the world a better place — and at CES 2023, this type of tech was on full display. We saw everything from accessibility-minded PS5 controllers to pedal-powered smart desks. But of all the amazing innovations on display this year, these three impressed us the most:

Samsung's Relumino Mode
Across the globe, roughly 300 million people suffer from moderate to severe vision loss, and generally speaking, most TVs don’t take that into account. So in an effort to make television more accessible and enjoyable for those millions of people suffering from impaired vision, Samsung is adding a new picture mode to many of its new TVs.
[CES 2023] Relumino Mode: Innovation for every need | Samsung
Relumino Mode, as it’s called, works by adding a bunch of different visual filters to the picture simultaneously. Outlines of people and objects on screen are highlighted, the contrast and brightness of the overall picture are cranked up, and extra sharpness is applied to everything. The resulting video would likely look strange to people with normal vision, but for folks with low vision, it should look clearer and closer to "normal" than it otherwise would.
Excitingly, since Relumino Mode is ultimately just a clever software trick, this technology could theoretically be pushed out via a software update and installed on millions of existing Samsung TVs -- not just new and recently purchased ones.

Read more
AI turned Breaking Bad into an anime — and it’s terrifying
Split image of Breaking Bad anime characters.

These days, it seems like there's nothing AI programs can't do. Thanks to advancements in artificial intelligence, deepfakes have done digital "face-offs" with Hollywood celebrities in films and TV shows, VFX artists can de-age actors almost instantly, and ChatGPT has learned how to write big-budget screenplays in the blink of an eye. Pretty soon, AI will probably decide who wins at the Oscars.

Within the past year, AI has also been used to generate beautiful works of art in seconds, creating a viral new trend and causing a boon for fan artists everywhere. TikTok user @cyborgism recently broke the internet by posting a clip featuring many AI-generated pictures of Breaking Bad. The theme here is that the characters are depicted as anime characters straight out of the 1980s, and the result is concerning to say the least. Depending on your viewpoint, Breaking Bad AI (my unofficial name for it) shows how technology can either threaten the integrity of original works of art or nurture artistic expression.
What if AI created Breaking Bad as a 1980s anime?
Playing over Metro Boomin's rap remix of the famous "I am the one who knocks" monologue, the video features images of the cast that range from shockingly realistic to full-on exaggerated. The clip currently has over 65,000 likes on TikTok alone, and many other users have shared their thoughts on the art. One user wrote, "Regardless of the repercussions on the entertainment industry, I can't wait for AI to be advanced enough to animate the whole show like this."

Read more
4 simple pieces of tech that helped me run my first marathon
Garmin Forerunner 955 Solar displaying pace information.

The fitness world is littered with opportunities to buy tech aimed at enhancing your physical performance. No matter your sport of choice or personal goals, there's a deep rabbit hole you can go down. It'll cost plenty of money, but the gains can be marginal -- and can honestly just be a distraction from what you should actually be focused on. Running is certainly susceptible to this.

A few months ago, I ran my first-ever marathon. It was an incredible accomplishment I had no idea I'd ever be able to reach, and it's now going to be the first of many I run in my lifetime. And despite my deep-rooted history in tech, and the endless opportunities for being baited into gearing myself up with every last product to help me get through the marathon, I went with a rather simple approach.

Read more