Skip to main content

Google's video AI was tricked into thinking a video about apes was about spaghetti

researchers trick google cloud video intelligence into misclassifying videos data center header
Google
While artificial intelligence is an incredibly important field that’s growing by leaps and bounds, perhaps its most interesting lessons concerns just how incredible the human brain is at performing certain functions. While computers might be better at performing math and looking dozens of chess moves into the future, they can’t yet compete with the human brain at figuring out things like a video’s topic.

A recent research project demonstrated just that fact by feeding videos to Google’s Cloud Video Intelligence API and seeing if it could determine exactly what a given video was about. Apparently, this seemingly simple task is a challenge for Google’s AI and points out the difficulty of creating automatic systems to categorize video, as Motherboard reports.

The research team in question works at the University of Washington, and the team used some trickery to see how smart the Google API really is. Currently in beta, the Google Cloud Video Intelligence API has one job, which was to “make video searchable” and to annotate video to make it easier for humans to search through them.

In their tests, the researchers injected extraneous, and subliminal, images of a pasta bowl into a video featuring primatologist Jane Goodall and gorillas. The result was that the Google AI concluded that the video was actually about spaghetti and not the apes. Another example involved placing a picture of an Audi into a video about tigers, which caused the AI to conclude that the video was about cars.

University of Washington

Although it might sound somewhat comical, these mistakes point out a serious issue with the AI. As the researchers noted in their conclusion:

“However, we showed that the API has certain security weaknesses. Specifically, an adversary can insert an image, periodically and at a very low rate, into the video in a way that all the generated shot labels are about the inserted image. Such vulnerability seriously undermines the applicability of the API in adversarial environments.”

Even worse, according to the researchers, “Furthermore, an adversary can bypass a video filtering system by inserting a benign image into a video with illegal contents.” The fact that the process of doing so requires no specialized knowledge about the AI’s machine learning algorithms or about video annotation in general was particularly disturbing.

Ultimately, what the research points out is that AI has a long way to go before it can match the human brain in determining things like a video’s topic. Inserting subliminal messages into video has been known for a long time to affect the human psyche, but at least a human wouldn’t think that a video about apes is actually about spaghetti — the human would probably just start craving pasta instead.

Editors' Recommendations

Mark Coppock
Mark has been a geek since MS-DOS gave way to Windows and the PalmPilot was a thing. He’s translated his love for…
The best AI video editing tools
KateFrames Studio AI editor.

If you want to make video content, you need to know how to edit. At least, you used to. With the power of some of the latest artificial intelligence video editing tools, you can get by with just a few clicks. Some of the best AI video editors have the power to craft your content for you or repackage it for different platforms with just the click of a button.

You'll still need to record the footage yourself (most of the time), and there's no substitute for a good human-driven edit, but if you want some AI tools to help you out and expedite your editing, here are the best AI video editing suites you can use right now.

Read more
Google Bard can now speak, but can it drown out ChatGPT?
Google Bard on a green and black background.

In the world of artificial intelligence (AI) chatbots, OpenAI’s ChatGPT is undoubtedly the best known. But Google Bard is hot on its heels, and the bot has just been granted a new ability: the power of speech.

The change was detailed in a Google blog post, which described the update as “Bard’s biggest expansion to date.” It grants Bard not just speech, but the ability to converse in over 40 languages, use images as prompts, and more.

Read more
All of the internet now belongs to Google’s AI
ChatGPT versus Google on smartphones.

Google's latest update to its privacy policy will make it so that the company has free range to scrape the web for any content that can benefit building and improving its AI tools.

“Google uses information to improve our services and to develop new products, features, and technologies that benefit our users and the public,” the new Google policy says. “For example, we use publicly available information to help train Google’s AI models and build products and features like Google Translate, Bard, and Cloud AI capabilities.”

Read more