Skip to main content

Searching for objects and locations inside video footage is getting much easier

google cloud video intelligence api
Image used with permission by copyright holder
During the Google Cloud Next Conference in San Francisco, Google revealed a new machine learning application program interface (API) called Cloud Video Intelligence. With this API, developers can create applications capable of detecting objects within video and making them searchable and discoverable. Both nouns and verbs can be applied to those objects, such as “dog” and “run.”

An API is essentially a bridge between a service and an application. In this case, the API connects to the Google Cloud Machine Learning platform for the compute aspect and stores annotated videos on Google Cloud Storage. Thus, due to this “bridge,” an application based on Google’s new API will have access to this specific functionality to provide end-users with a better way of searching through videos.

“You can now search every moment of every video file in your catalog and find every occurrence as well as its significance,” Google states. “It helps you identify key nouns entities of your video, and when they occur within the video. Separate signal from noise, by retrieving relevant information at the video, shot or per frame.”

In a demo, users can search for animals in an MP4 video file lasting just over a minute and a half. The labels generated by Cloud Video Intelligence consist of Animal (99 percent), Wildlife (94 percent), Zoo (91 percent), Terrestrial Animal (54 percent), Nature (51 percent), Tourism (47 percent), and Tourist Destination (43 percent). The sample video focuses on the Los Angeles Zoo presented by Disney’s Zootopia CGI-animated movie.

However, what’s really neat about the new API is how it can detect a scene in a video. In the same clip, Cloud Video Intelligence can detect 48 scene changes and in real time detect and label objects as the scenes change. For instance, in one scene that displays just Nick the fox, the API will generate seven labels. In another scene focusing on the zoo’s sign, the system only generates two labels … again, all in real time.

What Google has done is create a tool that enables users to search through a video catalog just like they would with text documents. According to the company, this will be highly useful for businesses to separate signals that are buried under noise. It can also “detect features of a signal providing only relevant entities at video, shot or frame level.”

“Google has a long history working with the largest media companies in the world, and we help them find value from unstructured data like video,” said Fei-Fei Li, Chief Scientist of Google Cloud AI and Machine Learning. “This API is for large media organizations and consumer technology companies, who want to build their media catalogs or find easy ways to manage crowd-sourced content.”

The new API is now in a private beta and will also be offered to Google’s partners such as Cantemo, which will use the API to connect its video management software to the Google Cloud Machine Learning platform.

Kevin Parrish
Former Digital Trends Contributor
Kevin started taking PCs apart in the 90s when Quake was on the way and his PC lacked the required components. Since then…
Best Buy deals: Save on laptops, TVs, appliances, and more
best buy shuts down insignia line smart home products store 2 768x768

If you're looking to snag a good deal, Best Buy is probably one of the best retailers to do it, and we often draw from it for some of the best deals we put on these lists. A lot of that has to do with the massive variety of products that best Buy sells, and that includes things like the best TV deals, best laptop deals, and best phone deals, so there is always something to draw from. That said, it can be difficult to navigate all the deals and offers that are available on Best Buy, which is why we've gone out and collected some of our favorite deals across various categories, from headphones to small kitchen appliances.
Best Buy TV deals

There may be no better place to purchase one of the best TVs than Best Buy. There is almost always some huge savings to find on TVs at Best Buy, and that’s certainly the case right now. You’ll find deals top TV brands like Sony, Samsung, and LG, and more budget-friendly brands like TCL and Hisense are in play, too.

Read more
Target is selling Lenovo laptops for $150, with a catch
The Lenovo IdeaPad Slim 3 on a white background.

Considering the back to school shopping season is in full swing, now is one of the best times of the year to look for laptop deals. Of course, you’ll find markdowns on a wide array of models at just about every retailer, so sometimes finding the best discounts can be a little tough. It’s our job to stay on top of all the best sales though, and we recently came across a Target promo we’d like to share:

For a limited time, Target is selling a refurbished version of the Lenovo Ideapad Slim 3 with 4GB of RAM and 64GB of storage for $150. At full price, this model can go for upwards of $270. 

Read more
OpenAI Project Strawberry: here’s everything we know so far
a strawberry

Even as it is reportedly set to spend $7 billion on training and inference costs (with an overall $5 billion shortfall), OpenAI is steadfastly seeking to build the world's first Artificial General Intelligence (AGI). Project Strawberry is the company's next step toward that goal.
What is Project Strawberry?
Project Strawberry is OpenAI's latest (and potentially greatest) large language model, one that is expected to broadly surpass the capabilities of current state-of-the-art systems with its "human-like reasoning skills" when it is released. It might power the next generation of GPTs.
What can Strawberry do?
Project Strawberry will reportedly be a reasoning powerhouse. It will be able to solve math problems it has never seen before and act as a high-level agent, creating marketing strategies and autonomously solving complex word puzzles like the NYT's Connections. It can even "navigate the internet autonomously" to  perform "deep research," according to internal documents viewed by Reuters in July.

The Reuters report also notes that Strawberry's architecture is similar to the Self-Taught Reasoner (STaR) technique. Developed at Stanford in 2022, STaR enables a model to generate training data on which to fine-tune itself, becoming more capable over time.
Why is it called that?
We don't know the exact reason for the name "Strawberry," as that's not something OpenAI has publicly disclosed. It's a code name chosen for internal reference and to maintain secrecy during development.

Read more