Skip to main content

Nvidia reportedly caught scraping AI data from Netflix and YouTube (again)

Nvidia CEO Jensen in front of a background.
Nvidia

According to a damning report from 404 Media, backed with internal Slack chats, emails, and documents obtained by the outlet, Nvidia helped itself to “a human lifetime visual experience worth of training data per day,” Ming-Yu Liu, vice president of Research at Nvidia and a Cosmos project leader, admitted in a May email.

Recommended Videos

Unnamed former Nvidia employees told 404 that they had been asked to scrape video content from Netflix, YouTube, and other online sources in order to obtain training data for use with the company’s various AI products. Those include Nvidia’s Omniverse 3D world generator, self-driving car systems, and “digital human.”

When those employees asked about the legality of the project, internally named Cosmos, they were assured by management that they had been given clearance by the highest levels of the company to use that content.

The project sought to build a foundation model, akin to Gemini 1.5, GPT-4, or Llama 3.1, “that encapsulates simulation of light transport, physics, and intelligence in one place to unlock various downstream applications critical to Nvidia.”

To do this, project Cosmos allegedly used an open-source video downloader and employed machine learning to IP hop, thereby avoiding YouTube’s attempts to block it. According to emails viewed by 404, project managers discussed using as many as 30 virtual machines running on Amazon Web Services to download 80 years’ worth of full-length and clip-length videos every day.

For its part, Nvidia claims no wrongdoing. “We respect the rights of all content creators and are confident that our models and our research efforts are in full compliance with the letter and the spirit of copyright law,” an Nvidia spokesperson told 404 Media via email. “Copyright law protects particular expressions but not facts, ideas, data, or information. Anyone is free to learn facts, ideas, data, or information from another source and use it to make their own expressions. Fair use also protects the ability to use a work for a transformative purpose, such as model training.”

This is far from the first time that Nvidia (not to mention a vast majority of the rest of the AI field) has taken a “scrape first and maybe ask forgiveness later” approach to its AI training efforts. In July, Nvidia was named in another report on illegal scraping of copyrighted videos alongside Anthropic and Salesforce.

At CES 2024, the company set off an internet firestorm with its ambiguous answers as to how its new generative AI for gaming engine was trained. In response, Nvidia reiterated that its tools were “commercially safe.”

Andrew Tarantola
Former Digital Trends Contributor
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
Nvidia is taking generative AI into a whole new dimension
A visualization of Nvidia's Omniverse platform.

We've seen plenty of generative AI tools that can spit out images, from Microsoft Designer to Stable Diffusion, but Nvidia is taking generative AI into a new dimension -- literally. The company announced a partnership with Shutterstock at Siggraph 2024 that will allow users to generate 3D models using generative AI.

There are handful of AI tools that can generate 3D models, but Shutterstock's take is definitely the most official tool we've seen so far. The models will live in Shutterstock's TurboSquid library, which currently hosts the company's library of 3D assets. To generate new assets, Shutterstock says users will be able to provide text and images as prompts. From there, designers can take the assets and edit them in separate apps, with exports available "in a variety of popular file formats," according to Nvidia.

Read more
Nvidia’s G-Assist AI assistant actually sounds incredible
Destiny 2 running on the Asus ROG PG42UQ.

Well, no one saw this coming. Nvidia announced an AI game assistant at Computex 2024 that will help you do everything from allocate stat points in a game to set up your PC for the best gaming experience. It may sound like another AI ploy from the AI giant that is Nvidia, but Project G-Assist actually could be really useful.

The headline feature that Nvidia is promoting is that Project G-Assist can help you when you're stuck in a game. Developers can train the model on their game, and then that model can access your prompt, what's visible on screen, and even data from the game. From there, you'd be able to ask questions you'd normally need to look up online. For example, where you should put a stat point, or how to craft a particular item. It's not dissimilar to what we saw Microsoft show off with its Copilot demo at Build earlier this month, where a voice assistant walked a player through Minecraft in real time.

Read more
Nvidia just hinted at AI that can play games for you
A screenshot from Nvidia's G-Assist April Fool's video.

"The future is never far away." That's the comment Nvidia made yesterday on social media about an old April Fool's joke about AI in games.

Here's the context: In 2017, Nvidia made a silly April Fool's joke video about something called GTX G-Assist, a dongle that could add some outlandish AI features to gaming. Then again, what was outlandish in 2017 feels possible seven years later.

Read more