Skip to main content

Meta faces lawsuit for training AI with pirated books

A silhouetted person holds a smartphone displaying the Facebook logo. They are standing in front of a sign showing the Meta logo.
SOPA Images / Getty Images

In a recent lawsuit, Meta has been accused of using pirated books to train its AI models, with CEO Mark Zuckerberg’s approval. As per Ars Technica, the lawsuit filed by authors including Ta-Nehisi Coates and Sarah Silverman in a California federal court, cite internal Meta communications indicating that the company utilized the Library Genesis (LibGen) dataset—a vast online repository known for hosting pirated books—despite internal concerns about the legality of using such material.

The authors argue that Meta’s actions infringe upon their copyrights and could undermine the company’s position with regulators. They claim that Meta’s AI models, including Llama, were trained using their works without permission, potentially harming their livelihoods. Meta has defended its practices by invoking the “fair use” doctrine, asserting that using publicly available materials to train AI tools is legal in certain cases, such as “using text to statistically model language and generate original expression.”

Recommended Videos

Unsealed court documents from February 5th, 2024, in Kadrey v. Meta show Meta (formerly Facebook) illegally torrented 81.7TB of data from "shadow libraries" such as Anna's Archive, Z-Library, and LibGen to train Meta artificial intelligence.

Highlights include:
– A senior AI… pic.twitter.com/Bqf60Hhbb6

— vx-underground (@vxunderground) February 8, 2025

One internal message highlighted in the lawsuit quotes an employee expressing discomfort, stating, “Torrenting from a corporate laptop doesn’t feel right.”

In response to the lawsuit, U.S. District Judge Vince Chhabria dismissed some claims but allowed the authors to amend their complaint to include new allegations, including those related to the removal of copyright management information. This case is part of a broader wave of legal challenges against tech companies like Meta, OpenAI, and Anthropic, where authors and creators are seeking to protect their intellectual property rights in the face of rapidly advancing AI technologies.

The outcome of this lawsuit could have significant implications for the tech industry, particularly concerning the use of copyrighted materials in AI training. It raises important questions about the balance between technological innovation and the protection of creators’ rights.

Kunal Khullar
Kunal Khullar is a computing writer at Digital Trends who contributes to various topics, including CPUs, GPUs, monitors, and…
Elon Musk says the world is running out of data for AI training
Grok app on an iPhone.

Tesla/X CEO Elon Musk seems to believe that training AI models with solely human-made data is becoming impossible. Musk claims that there's a growing lack of real-world data with which to train AI models, including his Grok AI chatbot.

“We’ve now exhausted basically the cumulative sum of human knowledge … in AI training,” Musk said during an X live-stream interview conducted by Stagwell chairman Mark Penn. “That happened basically last year.”

Read more
Generative-AI-powered video editing is coming to Instagram
Instagram on iPhone against a colorful background.

Editing your Instagram videos will soon be as simple as typing out a text prompt, thanks to a new generative AI tool the company hopes to release in 2025, CEO Adam Mosseri announced Thursday.

The upcoming tool, which leverages Meta's Movie Gen model, will enable users to "change nearly any aspect of your videos," Mosseri said during his preview demonstration. Those changes range from subtle modifications, like adding a gold chain to his existing outfit or a hippo in the background, to wholesale alterations including swapping his wardrobe or giving himself a felt, Muppet-like appearance.

Read more
Ray-Ban Meta Smart Glasses get real-time visual AI and translation
Tracey Truly shows multi-reflective options with Ray-Ban Meta Smart Glasses.

Meta is rolling out two long-awaited features to its popular Ray-Ban Smart Glasses: real-time visual AI and translation. While it's just being rolled out for testing right now, the plan is that, eventually, anyone that owns Ray-Ban Meta Smart Glasses will get a live assistant that can see, hear, and translate Spanish, French, and Italian.

It's part of the v11 update that cover the upgrades Meta described at its Connect 2024 event, which also include Shazam integration for music recognition. This all happens via the camera, speakers, and microphones built into the Ray-Ban Meta glasses, so you don’t need to hold up your phone.

Read more