News: 0183141582

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

Zuckerberg 'Personally Authorized and Encouraged' Meta's Copyright Infringement (variety.com)

(Wednesday May 06, 2026 @05:00PM (BeauHD) from the move-fast-and-torrent-things dept.)


Five major publishers and author Scott Turow have sued Meta and Mark Zuckerberg, alleging that Zuckerberg " [1]personally authorized and actively encouraged" massive copyright infringement by using pirated books, journal articles, and web-scraped material to train Meta's Llama AI systems. Meta denies wrongdoing and says it will fight the case, arguing that courts have recognized AI training on copyrighted material as potentially fair use. Variety reports:

> "In their effort to win the AI 'arms race' and build a functional generative AI model, Defendants Meta and Zuckerberg followed their well-known motto: 'move fast and break things,'" the plaintiffs say in their lawsuit. "They first illegally torrented millions of copyrighted books and journal articles from notorious pirate sites and downloaded unauthorized web scrapes of virtually the entire internet. They then copied those stolen fruits many times over to train Meta's multibillion-dollar generative AI system called Llama. In doing so, Defendants engaged in one of the most massive infringements of copyrighted materials in history."

>

> The suit was filed Tuesday (May 5) in the U.S. District Court for the Southern District of New York by five publishers (Hachette, Macmillan, McGraw Hill, Elsevier and Cengage) and Turow individually. The proposed class-action suit seeks unspecific monetary damages for the alleged copyright infringement. A copy of the lawsuit is available at [2]this link (PDF). [...] the latest lawsuit alleges that Meta and Zuckerberg deliberately circumvented copyright-protection mechanisms -- and had considered paying to license the works before abandoning that strategy at "Zuckerberg's personal instruction." The suit essentially argues that the conduct described falls outside protections afforded by fair-use provisions of the U.S. copyright code.



[1] https://variety.com/2026/digital/news/meta-ai-mark-zuckerberg-copyright-infringement-lawsuit-publishers-scott-turow-1236738383/

[2] https://publishers.org/wp-content/uploads/2026/05/2026-05-05-Complaint.pdf



Goes to show how full of themselves they are (Score:1)

by HalAtWork ( 926717 )

It goes to show you how full of themselves someone is when they straight up flout the law and personally authorize it.

Platform engagement tactics also show his motives are not altruistic.

It is clear he is a greedy sociopath.

Re: (Score:1)

by Anonymous Coward

i despise Zuck as much as anyone, but you do realize this is just a law suit, nothing has been proven or even brought in front of the court.

Re: Goes to show how full of themselves they are (Score:4, Informative)

by toutankh ( 1544253 )

> Meta denies wrongdoing and says it will fight the case, arguing that courts have recognized AI training on copyrighted material as potentially fair use.

They did something and are now hoping for it to become legal.

Re: (Score:2)

by swillden ( 191260 )

>> Meta denies wrongdoing and says it will fight the case, arguing that courts have recognized AI training on copyrighted material as potentially fair use.

> They did something and are now hoping for it to become legal.

More precisely, they did something in an undefined gray area and are now hoping the courts decide their way.

Re: (Score:1)

by bill_mcgonigle ( 4333 ) *

> when they straight up flout the law and personally authorize it

That's the law for you, not the corporate oligarchs.

Worst case outcome for them is they pay a small fine, nobody is personally held liable.

Ruler of the perverse incentives? (Score:2)

by shanen ( 462549 )

Mostly agree with you and could cite several recent books to support the areas of agreement, but I still want to go for the joke. The root of the problem is perverse incentives, but I would argue the incentives rule and Zuck is just another fool trapped in a system that he thinks he built. (And I still think the most perverse incentives are over at YouTube and Amazon. Or maybe in the AI bubble somewhere? (We are so googled and zucked...))

Okay, I can't resist one citation for Chaos Monkeys because it is so

Re: (Score:2)

by rickb928 ( 945187 )

"flout the law"

They disregarded the rights of authors etc. to be compensated for their work. The Law just identified the transgressions.

You can be sure if I or you shared Meta's source code and built a site with it we would be pursued to the ends of the Earth.

Hypocrisy, and yes just one practitioner. Get caught, pay up.

I approach this the same way I approached the Martha Stewart insider trading scandal. It's not that she made a few bucks, or avoided losing some, but, but, some other schlub suffered the loss

training may be legal (Score:5, Insightful)

by k3v0 ( 592611 )

but unauthorized distribution via torrents is def not fair use

Training is not legal (Score:1)

by HalAtWork ( 926717 )

The training is not legal. They're using copyright materials for commercial purposes and may potentially be able to reproduce it for everyone who uses it even if it's in bits and pieces.

Re: (Score:1)

by Junta ( 36770 )

The problem is whether the AI "learning" is analagous to "learning" in the human sense.

For example, there was an existing library implementation of some function, but it didn't work the way I wanted and it was too deep to reasonably modify and the function wasn't *that* involved, so I decided to roll a new one.

Now I type a function name and LLM offered to tab complete the whole function body. The function body for a non-trivial function matched verbatim the very library that didn't work for my purposes, do

Re:Learning from books has always been legal (Score:4, Interesting)

by SoftwareArtist ( 1472499 )

Whether or not the training is fair use, stealing the works is not. The courts have already made that clear. Fair use is about what you're allowed to do with works that are legally in your possession. If you steal the works instead of acquiring them legally, that's not fair use.

Re: (Score:3)

by spacepimp ( 664856 )

If I steal a truckload of books so I can read them, it doesn't change the fact that I stole a truckload of books. Maybe the reading of the books wasn't the crime, but that is not what this lawsuit is about. This is about infringing copyright on millions of books, not about reading those books.

Re: (Score:2)

by Junta ( 36770 )

I agree with you, but the big tech companies *seem* to be winning their arguments, even when the plaintiff shows output that includes even the watermark of the plaintiff's stuff on something that looks like the plaintiff's assets.

So it's at least more pragmatic to show that the acquisition and likely redistribution of the works while torrenting were a problem without even bringing up the whole AI ingest argument.

Re: (Score:2)

by taustin ( 171655 )

> The training is not legal.

Until the Supreme Court rules on it (and they will, eventually), the question remains open.

After all, Google Won Authors Guild v. Google at trial. Many of the same arguments will apply here.

The issue to pursue here isn't what they did with the pirated material, it's how they pirated it.

(Commercial use can affect the penalties for infringement, but it's a factor in whether or not it is infringement.)

Re: (Score:3)

by bill_mcgonigle ( 4333 ) *

One might imagine that buying a million books would get a buyer a very good discount from the publisher.

They could have paid $5 or whatever for each book they trained on. $15M or so for 3 million books - they could totally afford that but "why pay when you can steal?"

Then at least they would be defending fair use rather than defending a 'theft' lawsuit.

If we had Constitutional Copyright they at least would have millions of 14-year-old books to train on. That would be quite sufficient to train and refine m

Re: (Score:2)

by swillden ( 191260 )

> One might imagine that buying a million books would get a buyer a very good discount from the publisher.

> They could have paid $5 or whatever for each book they trained on. $15M or so for 3 million books - they could totally afford that but "why pay when you can steal?"

True, though I'll bet the negotiations with dozens of publishers would have dragged on for years.

> If we had Constitutional Copyright they at least would have millions of 14-year-old books to train on. That would be quite sufficient to train and refine models.

Although I know what you mean, there is no such thing as "Constitutional Copyright". The Constitution authorizes Congress to set terms for protecting the work of authors and inventors, but it doesn't specify the 14-year term. That was just the first term that Congress chose to enact. It happened not long after the constitution was ratified, but it wasn't in any way part of the constitution.

Precedent (Score:5, Informative)

by Local ID10T ( 790134 )

Precedent holds that training with copyrighted material is transformative in nature, and thus is non-infringing.

Precedent further holds that pirating the material to train with is an incurable violation of copyright: That an AI trained using a dataset that includes pirated material is tainted to a degree that can only be cured by deletion of the AI and the training set data. Purchasing valid copies of the data after the fact are not sufficient; although a new dataset can be constructed from the newly purchased data and a new AI trained with this new dataset. This is in addition to the financial liability of the copyright violations.

Zuck is fucked.

Re: (Score:2)

by Pinky's Brain ( 1158667 )

The cure was to make some lawyers very rich while the copyright owners got peanuts, which also prevented those lawyers from taking it to a higher court. Even so, it ain't over till the supreme court geriatrics divine the penumbra of the law.

Cost of a re-do? (Score:1)

by Tablizer ( 95088 )

> Purchasing valid copies of the data after the fact are not sufficient; although a new dataset can be constructed from the newly purchased data and a new AI trained with this new dataset.

Any guestimates on how much it would cost META to actually purchase all or most those same works to re-train their bots legally?

(They still may have to pay a fine or refund for their original transgression.)

> Zuck is fucked.

The rich often skirt the law. They have a thousand flying monkey lawyers going through thousands of leg

Re:Cost of a re-do? (correction) (Score:1)

by Tablizer ( 95088 )

Re: "going through thousands of legal documents"

Make that millions, if not billions, being law search engines are used.

Re: (Score:2)

by swillden ( 191260 )

> Precedent holds that training with copyrighted material is transformative in nature, and thus is non-infringing.

> Precedent further holds that pirating the material to train with is an incurable violation of copyright

These two statements are mutually contradictory. Is it infringing or not?

Re: (Score:2)

by Local ID10T ( 790134 )

Training is not the same as pirating.

Training using material legally acquired is legal.

Training using illegally acquired material is illegal.

Not a contradiction.

Re: (Score:3)

by noshellswill ( 598066 )

Transistors ?

Re: Headline is wrong (Score:5, Informative)

by reanjr ( 588767 )

Citation greatly needed.

Re: (Score:2)

by MikeDataLink ( 536925 )

> Training an AI is exactly the same as training a human mind

Uh. No.

Re: (Score:2)

by swillden ( 191260 )

> Training an AI is exactly the same as training a human mind

I'm inclined to agree, except for one thing: The LLM can be duplicated infinitely, at near-zero cost, while the trained human cannot. That's a crucial difference. In the case of a textbook, for example, an LLM that has learned the whole contents can act as a full replacement for the book, to an arbitrary number of users, which might severely impact the commercial market for the original book. A human who learns it can write their own and sell it to many people, but that takes a lot of effort and signifi

all these AI systems are doing it (Score:4, Interesting)

by FudRucker ( 866063 )

ChatGPT (OpenAI) Google, Microsoft, and other unnamed AI systems too if they are taking other people's content to feed to their AI then they are guilty of copywrite infringement

Re: (Score:1)

by Tablizer ( 95088 )

I believe scraping the web would only be a problem if they skirted paywalls (hacked), or violated the terms of paid subscriptions, some of which forbid bot training use.

Scraping publicly available sites for AI training is generally not considered copyright violation, although I'm not an expert, so don't quote me.

At minimum (Score:2)

by wakeboarder ( 2695839 )

torrenting is not fair use. If an individual does this they will face jail time and fees, so should a business what facebook did is not fair.

Re: (Score:2)

by Pinky's Brain ( 1158667 )

Scanning a book and printing out a hundred copies for yourself is not fair use either.

Zuck, Dario and Altman should share a cell.

Re: (Score:2)

by wakeboarder ( 2695839 )

No it's not, but if it's ok for a human to read and retain information (like someone with a photographic memory) then wouldn't it be ok for AI to do the same? I'm not arguing for this but I suppose this is an argument that could be made. You should be able to determine if your work is searchable and readable by AI. AI should at least get citations right.

adjustable-rate mort-gauge (Score:2)

by Pseudonymous Powers ( 4097097 )

I heard Zuckerberg failed to pay back a loanshark operating in the Metaverse and they took his legs.

Meta argues AI training is fair use. (Score:2)

by Mirnotoriety ( 10462951 )

ClippyAI: 1. Direct Precedents: AI-Specific Rulings

As of 2026, several cases have directly addressed generative AI training, with mixed results that Meta uses to bolster its "fair use" defense.

Kadrey v. Meta (2025): In this case, Meta successfully argued for partial summary judgment on the grounds of fair use regarding the training process of its Llama models. Judge Vince Chhabria ruled in favor of Meta, though he expressed significant doubts about whether all forms of AI training would be considered

How is it ever wrong to see public websites? (Score:2)

by sabbede ( 2678435 )

Yes, they may be in real hot water with the torrents, but scraping publicly available websites cannot be wrong. It's public information in public view. That should probably be tossed out, and the proceedings should focus on the pirated material.

Re: (Score:2)

by Okind ( 556066 )

> [...] scraping publicly available websites cannot be wrong. It's public information in public view.

This depends entirely on the copyright laws where you live / read the stuff.

For example, newspapers generally publish some articles for free reading by the general public. They are in public view, and thus also public information (as in, available for the public for a specific, narrow purpose).

But the newspaper still holds the copyright. And the (implied) licence to read their news articles most certainly does not allow you to copy them, for example. So scraping is almost certainly a copyright violation (i

Re: (Score:1)

by innocent_white_lamb ( 151825 )

Your computer creates a copy of the article to show on your screen when you load the article.

Re: (Score:2)

by sabbede ( 2678435 )

Well, in that case its distribution that gets one in trouble. It's not illegal to photocopy a newspaper, but you can't then distribute those copies. I'm not even sure if it's technically legal for a professor to copy an article and pass it out in class, but I doubt any have gotten in trouble for it.

Someone posted a list of recent court decisions that say scraping for AI training is fair use due to it being transformative. If that's the case, and the jurisdictions line up, that portion of the suit will

Jail plz??? (Score:1)

by gloof ( 10503342 )

Sooooo, when is this guy going to go to prison for this?

Throw the book at 'em (Score:2)

by hwstar ( 35834 )

I despise the whole concept behind Facebook.

You're basically submitting yourself to a continuous session under the data harvesting fluoroscope.

Sorry, I never signed up and never will sign up.

Socializing live by going out to real physical events is much better for you.

Sooo...facebok is fair game ? (Score:2)

by sit1963nz ( 934837 )

Scraping facebook of all its data etc is fair game "to train an AI" ?

Why do I suspect that Zac would disagree....

Copyright: to promote the useful arts and sciences (Score:2)

by Larry_Dillon ( 20347 )

I have no love for Zuck, and believe under current precedents that the conduct was illegal.

But on the other hand, US copyright is broken and has strayed from the original purpose and social bargain. Thus I have some sympathy for the argument that training AI is the definition of promoting the useful arts and sciences.

I'm more upset about the way publishers license ebooks to libraries and restrict academic papers.

Marriage Ceremony: An incredible metaphysical sham of watching God and the
law being dragged into the affairs of your family.
-- O. C. Ogilvie