Meta accused of downloading torrents of 81.7TB of pirated books to train its Llama AI models

Meta torrented at least 81.7TB of data across multiple shadow libraries to train its Llama AI: Mark Zuckerberg is in trouble yet again.

Meta accused of downloading torrents of 81.7TB of pirated books to train its Llama AI models
Comment IconFacebook IconX IconReddit Icon
Gaming Editor
Published
2 minutes read time
TL;DR: Meta is accused of illegally torrenting 81.7TB of pirated books to train its Llama AI models, according to a lawsuit in California. Authors allege Meta used sources like Z-Library and LibGen, with internal emails showing awareness of illegal actions. Plaintiffs seek to reopen depositions and access Meta's torrenting logs.

Meta has been accused of torrenting an astonishing 81.7TB of pirated books to train its Llama AI models according to a new lawsuit filed in the US District Court for the Northern District of California.

Meta accused of downloading torrents of 81.7TB of pirated books to train its Llama AI models 29

The social networking giant has been accused of illegally torrenting copyrighted materials from sources including Z-Library and LibGen, with the plaintiffs led by author Richard Kadrey and others representing a proposed class, filing a motion objecting to a pre-trial discovery ruling that the authors argue limits their ability to gather critical evidence against Meta.

The authors claim that Meta's last-minute disclosure of over 2000 documents on December 13, 2024 just hours before the close of fact discovery revealed admissions from Meta employees about using pirated materials for its AI training. The newly-unsealed emails reveal damning evidence against Meta in a copyright lawsuit filed by book authors, claiming that Meta unlawfully trained its AI models using pirated books downloaded over torrents.

In new evidence that shows Meta torrented "at least 81.7 terabytes of data across multiple shadow libraries through the site's Anna's Archive, including at least 35.7 terabytes of data from Z-Library and LibGen" according to the authors' court filing, adding "Meta also previously torrented 80.6 terabytes of data from LibGen".

The authors' filing alleges: "the magnitude of Meta's unlawful torrenting scheme is astonishing", insisting that "vastly smaller acts of data piracy-just .008 percent of the amount of copyrighted works Meta pirated-have resulted in Judges referring the conduct to the US Attorneys' office for criminal investigation".

One Meta staffer reportedly said: "I feel that using pirated material should be beyond our ethical threshold", while another document alleges that Meta's decision to use LibGen was escalated to Meta CEO Mark Zuckerberg. The authors claim that internal emails about torrenting prove that Meta was well aware its actions were illegal, pointing to warnings from employees that say they were ignored.

The plaintiffs are challenging several aspects of a recent discovery ruling:

  • Reopening Depositions: They argue that the late-disclosed documents contradict prior testimony from key Meta witnesses and justify reopening depositions to question them about these revelations.
  • Torrenting Data: Plaintiffs are seeking access to Meta's torrenting logs and peer-sharing records to demonstrate how much pirated material was downloaded and redistributed.
  • Llama 4 and 5 Training Datasets: The plaintiffs claim that datasets used for upcoming versions of Llama are relevant to their case and should be produced.
  • Crime-Fraud Exception: They allege that Meta's attorneys were involved in decisions to use pirated materials despite knowing it was illegal, warranting an in-camera review of privileged communications under the crime-fraud exception.
Photo of the AMD Ryzen 7 9800X3D
Best Deals: AMD Ryzen 7 9800X3D
Country flag Today 7 days ago 30 days ago
$572 USD $479 USD
Buy
-
$699 USD $734.99 USD
Buy
$689.99 CAD $1360 CAD
Buy
$754.98 CAD -
Buy
£498 £560.99
Buy
$572 USD $479 USD
Buy
* Prices last scanned on 2/2/2025 at 5:22 am CST - prices may not be accurate, click links above for the latest price. We may earn an affiliate commission from any sales.

Gaming Editor

Email IconX IconLinkedIn Icon

Anthony joined the TweakTown team in 2010 and has since reviewed 100s of graphics cards. Anthony is a long time PC enthusiast with a passion of hate for games built around consoles. FPS gaming since the pre-Quake days, where you were insulted if you used a mouse to aim, he has been addicted to gaming and hardware ever since. Working in IT retail for 10 years gave him great experience with custom-built PCs. His addiction to GPU tech is unwavering and has recently taken a keen interest in artificial intelligence (AI) hardware.

Related Topics

Newsletter Subscription