Penguin Random House Underscores Copyright Protection in AI Rebuff (thebookseller.com)
- Reference: 0175281497
- News link: https://tech.slashdot.org/story/24/10/19/0121240/penguin-random-house-underscores-copyright-protection-in-ai-rebuff
- Source link: https://www.thebookseller.com/news/penguin-random-house-underscores-copyright-protection-in-ai-rebuff
> Penguin Random House has amended its copyright wording across all imprints globally, confirming it will appear "in imprint pages across our markets." The new wording states: "No part of this book may be used or reproduced in any manner for the purpose of training artificial intelligence technologies or systems," and will be included in all new titles and any backlist titles that are reprinted.
>
> The statement also "expressly reserves [the titles] from the text and data mining exception," in accordance with a European Parliament directive. The move specifically to ban the use of its titles by AI firms for the development of chatbots and other digital tools comes amid a slew of copyright infringement cases in the US and reports that large tranches of pirated books have already been used by tech companies to train AI tools. In 2024, several academic publishers including Taylor & Francis, Wiley and Sage have announced partnerships to license content to AI firms.
[1] https://www.thebookseller.com/news/penguin-random-house-underscores-copyright-protection-in-ai-rebuff
Agree (Score:2)
The corpus of public domain books, freely available for AI training, much better written, would be good starting point for training AI models on literature.
Even the penny dreadfulls, dime novels, pulp magazines are better written than much of the modern books.
Reality: We should push the copyright office, Congress to
limit copyright for written works on paper or electronic to 50 years from the earliest of
- date of first publication - revisions, author's cut, etc. do not extend copyright on the original work
-
Re: (Score:1)
> - if unpublished, then 50 years from the youngest author's 35th birthday.
So if an 86 year old and an 87 year old write something together, they already lost copyright one year before it was written (since it was certainly unpublished when it was not yet written).
Re: (Score:2)
My take is 5 years after the death of the last passed creator.
That would be enough to close the books for the creator and ensure they'll get a decent closure.
That would mean that at least some of the works of Prince could be free to use.
Much simpler solution (Score:2)
Make it work like a patent with a very limited term but requiring a whole, complete implementation being deposited along with the key tools needed to reproduce it. For recorded stage plays, movies and shows, this would include not only the original source footage but also things like the script and designs or replicas of key props. For literature, it would include authors notes and early drafts, as well as references where applicable for any research materials used, and yes, for software, that would include
They can say that all they want (Score:3)
This is a legal question. Whether or not it's allowed depends on the law, not on the message on the copyight page.
If the law restricts AI training, saying it's not allowed doesn't change that.
If the law doesn't restrict it, saying it's not allowed doesn't somehow change the law.
Re: (Score:2)
They should reword it to be like a shrink-wrap license. If you don't agree, then return the book, etc.
Doesn't Mean Much (Score:3)
I doubt this will stop companies (or individuals) from doing it, though. They'll just do a better job at hiding it and making it harder to prove ever happened in the first place.
Re: (Score:2)
Add to it that there are loopholes in the copyright law protecting parodies.
Re: (Score:2)
> and making it harder to prove ever happened in the first place
its easy to prove when your "AI" models training data contains all the other companies watermarks in the requested test sample.