Skip to main content

Google's video AI was tricked into thinking a video about apes was about spaghetti

researchers trick google cloud video intelligence into misclassifying videos data center header
Google
While artificial intelligence is an incredibly important field that’s growing by leaps and bounds, perhaps its most interesting lessons concerns just how incredible the human brain is at performing certain functions. While computers might be better at performing math and looking dozens of chess moves into the future, they can’t yet compete with the human brain at figuring out things like a video’s topic.

A recent research project demonstrated just that fact by feeding videos to Google’s Cloud Video Intelligence API and seeing if it could determine exactly what a given video was about. Apparently, this seemingly simple task is a challenge for Google’s AI and points out the difficulty of creating automatic systems to categorize video, as Motherboard reports.

Recommended Videos

The research team in question works at the University of Washington, and the team used some trickery to see how smart the Google API really is. Currently in beta, the Google Cloud Video Intelligence API has one job, which was to “make video searchable” and to annotate video to make it easier for humans to search through them.

In their tests, the researchers injected extraneous, and subliminal, images of a pasta bowl into a video featuring primatologist Jane Goodall and gorillas. The result was that the Google AI concluded that the video was actually about spaghetti and not the apes. Another example involved placing a picture of an Audi into a video about tigers, which caused the AI to conclude that the video was about cars.

University of Washington

Although it might sound somewhat comical, these mistakes point out a serious issue with the AI. As the researchers noted in their conclusion:

“However, we showed that the API has certain security weaknesses. Specifically, an adversary can insert an image, periodically and at a very low rate, into the video in a way that all the generated shot labels are about the inserted image. Such vulnerability seriously undermines the applicability of the API in adversarial environments.”

Even worse, according to the researchers, “Furthermore, an adversary can bypass a video filtering system by inserting a benign image into a video with illegal contents.” The fact that the process of doing so requires no specialized knowledge about the AI’s machine learning algorithms or about video annotation in general was particularly disturbing.

Ultimately, what the research points out is that AI has a long way to go before it can match the human brain in determining things like a video’s topic. Inserting subliminal messages into video has been known for a long time to affect the human psyche, but at least a human wouldn’t think that a video about apes is actually about spaghetti — the human would probably just start craving pasta instead.

Mark Coppock
Mark Coppock is a Freelance Writer at Digital Trends covering primarily laptop and other computing technologies. He has…
Runway brings precise camera controls to AI videos
Gen-3 alpha advanced camera controls

Content creators will have more control over the look and feel of their AI-generated videos thanks to a new feature set coming to Runway's Gen-3 Alpha model.

Advanced Camera Control is rolling out on Gen-3 Alpha Turbo starting today, the company announced via a post on X (formerly Twitter).

Read more
ChatGPT Search is here to battle both Google and Perplexity
The ChatGPT Search icon on the prompt window

ChatGPT is receiving its second new search feature of the week, the company announced on Thursday. Dubbed ChatGPT Search, this tool will deliver real-time data from the internet in response to your chat prompts.

ChatGPT Search appears to be both OpenAI's answer to Perplexity and a shot across Google's bow.

Read more
Google expands AI Overviews to over 100 more countries
AI Overviews being shown in Google Search.

Google's AI Overview is coming to a search results page near you, whether you want it to or not. The company announced on Monday that it is expanding the AI feature to more than 100 countries around the world.

Google debuted AI Overview, which uses generative AI to summarize the key points of your search topic and display that information at the top of the results page, to mixed reviews in May before subsequently expanding the program in August. Monday's roll-out sees the feature made available in seven languages — English, Hindi, Indonesian, Japanese, Korean, Portuguese, and Spanish — to users in more than 100 nations (you can find a full list of covered countries here)

Read more