Skip to main content

Searching for objects and locations inside video footage is getting much easier

google cloud video intelligence api
Image used with permission by copyright holder
During the Google Cloud Next Conference in San Francisco, Google revealed a new machine learning application program interface (API) called Cloud Video Intelligence. With this API, developers can create applications capable of detecting objects within video and making them searchable and discoverable. Both nouns and verbs can be applied to those objects, such as “dog” and “run.”

An API is essentially a bridge between a service and an application. In this case, the API connects to the Google Cloud Machine Learning platform for the compute aspect and stores annotated videos on Google Cloud Storage. Thus, due to this “bridge,” an application based on Google’s new API will have access to this specific functionality to provide end-users with a better way of searching through videos.

“You can now search every moment of every video file in your catalog and find every occurrence as well as its significance,” Google states. “It helps you identify key nouns entities of your video, and when they occur within the video. Separate signal from noise, by retrieving relevant information at the video, shot or per frame.”

In a demo, users can search for animals in an MP4 video file lasting just over a minute and a half. The labels generated by Cloud Video Intelligence consist of Animal (99 percent), Wildlife (94 percent), Zoo (91 percent), Terrestrial Animal (54 percent), Nature (51 percent), Tourism (47 percent), and Tourist Destination (43 percent). The sample video focuses on the Los Angeles Zoo presented by Disney’s Zootopia CGI-animated movie.

However, what’s really neat about the new API is how it can detect a scene in a video. In the same clip, Cloud Video Intelligence can detect 48 scene changes and in real time detect and label objects as the scenes change. For instance, in one scene that displays just Nick the fox, the API will generate seven labels. In another scene focusing on the zoo’s sign, the system only generates two labels … again, all in real time.

What Google has done is create a tool that enables users to search through a video catalog just like they would with text documents. According to the company, this will be highly useful for businesses to separate signals that are buried under noise. It can also “detect features of a signal providing only relevant entities at video, shot or frame level.”

“Google has a long history working with the largest media companies in the world, and we help them find value from unstructured data like video,” said Fei-Fei Li, Chief Scientist of Google Cloud AI and Machine Learning. “This API is for large media organizations and consumer technology companies, who want to build their media catalogs or find easy ways to manage crowd-sourced content.”

The new API is now in a private beta and will also be offered to Google’s partners such as Cantemo, which will use the API to connect its video management software to the Google Cloud Machine Learning platform.

Editors' Recommendations

Kevin Parrish
Former Digital Trends Contributor
Kevin started taking PCs apart in the 90s when Quake was on the way and his PC lacked the required components. Since then…
These 4K monitors are discounted at Best Buy — from $200
The Sony InZone M9 sitting next to a PlayStation 5.

A 4K monitor is a great way of enjoying an enhanced image as you work with more pixels, higher resolutions, and often better colors too. Over at Best Buy, there are some great monitor deals squarely focused on all things 4K. There are dozens of 4K monitors in the sale so the smart move is to hit the button below to see what’s there for yourself. If you want some help though, we’re here. We’ve picked out a few highlights in the sale so take a look for yourself.

What to shop for in the Best Buy 4K monitor sale
Samsung makes some of the best monitors around so why not start with the ? It’s currently reduced by $150 so it costs just $200. Its IPS panel looks great with AMD FreeSync support effectively eliminating screen tears and stutters. There’s also HDR support which brings with it some great looking colors while wide viewing angles mean it looks great from any perspective.

Read more
9 best processors for PC gaming: tested and reviewed
The AMD Ryzen 9 7950X3D installed in a motherboard.

It's tough to find the right gaming CPU for your next PC. We've benchmarked dozens of processors to find the best CPU for gaming, and there's a clear winner right now: AMD's Ryzen 7 7800X3D. Although the latest chip from Team Red claims the crown, there are still several other great options on the market.

Whatever your needs and budgets, though, we have options from AMD and Intel that will be great performers. We're focused on gaming here, but if you want a processor that can game and get work done, make sure to check out our list of the best processors.

Read more
The best 5K monitors you can buy for max resolution
A person using the Dell UltraSharp 40 U4025QW 40-inch curved Thunderbolt hub monitor with a Dell laptop on a desk.

Despite their relatively steep price tags, 5K monitors have gained substantial popularity among various creative professionals, including photographers, videographers, filmmakers, and graphic designers. These displays not only deliver exceptionally sharp and detailed imagery but also come with high-end, factory-calibrated panels to ensure precise color reproduction.

A true 5K resolution is defined as 5120 x 2880 pixels, with the emphasis often placed on the horizontal pixel count by many manufacturers. It's important to note that only a few monitors offer this exact resolution. Therefore, we have compiled a list of the top monitors that provide a 5K2K resolution (5120 x 2160 pixels) as well. Here are the best 5K monitors currently available for purchase in 2024.

Read more