A federal judge has ruled that training AI on copyrighted works without specific permission to do so is “fair use”. There’s a “but”, though.
According to Fortune (A federal judge says training AI on copyrighted works is ‘fair use,’ but casts doubt on use of pirated materials, written by Sharon Goldman and available here), California District Judge William Alsup said that AI company Anthropic could assert a “fair use” defense against copyright claims for training its Claude AI models on copyrighted books. But the judge also ruled that it mattered exactly how those books were obtained.
According to the judge’s ruling, Anthropic’s use of the books to train Claude was “exceedingly transformative” and constituted “fair use under Section 107 of the Copyright Act.” Anthropic told the court that its AI training was not only permissible, but aligned with the spirit of U.S. copyright law, which it argued “not only allows, but encourages” such use because it promotes human creativity. The company said it copied the books to “study Plaintiffs’ writing, extract uncopyrightable information from it, and use what it learned to create revolutionary technology.”
While training AI models with copyrighted data may be considered fair use, Anthropic’s separate action of building and storing a searchable repository of pirated books is not, Alsup ruled. Alsup noted that the fact that Anthropic later bought a copy of a book it earlier stole off the internet “will not absolve it of liability for the theft, but it may affect the extent of statutory damages.”
Alsup ordered a separate trial on Anthropic’s storage of those pirated books, which could determine the company’s liability and any damages related to that potential infringement. The judge has also not yet ruled whether to grant the case class action status, which could dramatically increase the financial risks to Anthropic if it is found to have infringed on authors’ rights.
In finding that it was “fair use” for Anthropic to train its AI models on books written by three authors—Andrea Bartz, Charles Graeber, and Kirk Wallace Johnson—who had filed a lawsuit against the AI company for copyright violations, Alsup addressed a question that has simmered since before OpenAI’s ChatGPT kick-started the generative AI boom in 2022: Can copyrighted data be used to train generative AI models without the owner’s consent?
Dozens of AI-and-copyright-related lawsuits have been filed over the past three years, most of which hinge on the concept of fair use, a doctrine that allows the use of copyrighted material without permission if the use is sufficiently transformative—meaning it must serve a new purpose or add new meaning, rather than simply copying or substituting the original work.
Alsup’s ruling may set a precedent for these other copyright cases—although it is also likely that many of these rulings will be appealed, meaning it will take years until there is clarity around AI and copyright in the U.S.
So, what do you think? Do you think this ruling that that training AI on copyrighted works will impact other AI copyright cases? Please share any comments you might have or if you’d like to know more about a particular topic.
Image created using Microsoft Designer, using the term “two robot lawyers high fiving while another looks on sadly”.
Disclaimer: The views represented herein are exclusively the views of the authors and speakers themselves, and do not necessarily represent the views held by my employer, my partners or my clients. eDiscovery Today is made available solely for educational purposes to provide general information about general eDiscovery principles and not to provide specific legal advice applicable to any particular circumstance. eDiscovery Today should not be used as a substitute for competent legal advice from a lawyer you have retained and who has agreed to represent you.
Discover more from eDiscovery Today by Doug Austin
Subscribe to get the latest posts sent to your email.




[…] this week! A federal judge sides with Meta in a key AI copyright case the same week Anthropic got a similar ruling in their […]
[…] those books legally. The 7 million books they allegedly pirated (which were discussed in their copyright ruling this week) are another […]
[…] wave of copyright lawsuits brought by publishers, creatives, and Hollywood studios (including this case and this case which had important rulings last […]
[…] note: why have I never heard of Megatron AI?). Let’s see if they’ll learn any lessons from the Anthropic and Meta AI copyright rulings that occurred last […]
[…] court judge William Alsup issued a summary judgment in Bartz v. Anthropic (reported by us here) that largely sided with Anthropic, finding that the company’s usage of the books was “fair […]