A group of prominent intellectual property law professors has weighed in on the high-stakes AI copyright battle between several authors and Meta. In an amicus brief, the scholars argue that using copyrighted content as training data can be considered fair use under U.S. copyright law, if the goal is to create a new and ‘transformative’ tool. This suggests that fair use could potentially apply to Meta’s training process, even if the underlying data was obtained without permission.
This case has a clear piracy angle, as Meta used BitTorrent to download archives of pirated books to use as training material. Notably, the authors argue that, in addition to copying pirated books from Anna’s Archive and Z-Library, in the same process Meta also uploaded pirated books to third parties.
This week, a group of IP Law Professors submitted a “friend of the court” or amicus brief, backing Meta’s fair use defense. The professors, including scholars from Harvard, Emory, Boston University, and Santa Clara University, have different views on the impact of AI but are united in their copyright stance.
The brief stresses that Meta’s alleged use of pirated books as training data can be considered fair use. The source of the training data is not determinative, as long as it’s used to create a new and transformative product, they argue.
“The case law, including binding circuit precedent, holds that internal copying, made in the course of creating new knowledge, is a transformative use that is heavily favored by fair use doctrine,” the professors write.
The professors’ argument is centered around the concept of “transformative use.” They note that using books outside their original ‘reading’ purpose to create an AI model, transforms the purpose of the use. This internal copying, they argue, falls into a category courts have consistently recognized as fair use, also known as “non-expressive use”.
The amicus brief cites several cases to back up their line of reasoning. This includes the Perfect 10 v. Amazon lawsuit, where the Ninth Circuit found that it was fair use when Google created thumbnails using images copied from unauthorized “pirate” sites, because the resulting image search tool was transformative.
The authors cited conflicting cases, but the professors note that cases where fair use was denied typically involved copyright infringement related to personal consumption, rather than use of content to create something new.
The brief distinguishes this case from those cited by the plaintiffs, which involved unauthorized copying for direct consumptive use (e.g., downloading for personal enjoyment). In contrast, Meta’s internal copies were allegedly not perceived by humans but used to build a new tool.
“Fair use, like copyright as a whole, ‘is not a privilege reserved for the well behaved’,” the brief notes. “Fair use doctrine should focus on the consequences of a ruling for knowledge and expression. Other considerations should be left for other legal regimes.”
Other countries, including Japan, have reportedly crafted exceptions in their law to allow tech companies to train LLMs on copyrighted material, without permission.
The U.S. has no such exceptions, but the professors urge the court to consider fair use. As the VCR and other innovations showed, copyright shouldn’t stand in the way of new tools and developing technologies.
@01011@monero.town So if I’ve illegally downloaded every paper and book published by Matthew Sag, Zahr K. Said, Jessica Silbey and Rebecca Tushnet and used that content to create an app that would output legal briefs in their voice saying whatever someone was willing to pay me to make it say, that is now legal? Or would that not qualify as “transformative” as shilling for $$$ is what lawyers have been doing for centuries?
I download and read pirated books because I am in the process of writing a new and transformative novel.
Now we just need to classify human minds as AI, and so everything downloaded is to train them.
AI = Actual Intelligence
I mean, humans are certainly a form of intelligence. You can also argue that they’re artificially created through impregnation. So, humans can be said to be AI.
So it’s fair use when the AI reads something it didn’t pay for, but for humans it’s “piracy”.
Funny what money does.
“I was just downloading that to train my neural net”
Sounds like piracy is legal now, cool. Not that the legal status was going to deter me before, but this might embolden me more.
It’s always been legal to download, but not upload.
Not true, unpunished or hard to enforce doesn’t mean it is legal.
I didn’t say “unpunished or hard to enforce”.
No you didn’t but the main reason people get the wrong impression that downloading is legal is that it mostly goes unpunished and is very hard to enforce.
It goes unpunished because it’s not illegal.
If you have evidence otherwise, feel free to share.
In the USA.
These whores will do anything when a mega corpo pays…
Where Disney at?
So piracy is legal now. We can use piracy data to train our brain models. No one can say it’s not innovative nor transformative.
It’s way better at driving cars.
We are advancing humanity!
So piracy is legal now.
For corporations.
Rank-and-file citizens have no new (or restored) rights.
Piracy was always defined as sharing and not downloading iirc
Where I am you can download whatever as long as you don’t share
Isn’t the download a share?
Sharing is letting someone download stuff from you. And I am not even sure if here you aren’t actually allowed to share privately to few people
So the person you are downloading from is breaking the law but you are not. Odd law but I envy your situation.
I know you are all jelly
I don’t trust that the lawyers weren’t bribed.
Look at the Men At Work plagiarism case regarding Land Down Under. That was a transformative work yet the band lost the case to the estate of a nursery rhyme.
Isn’t this the argument for remixing? If they use pieces of work from other sources, but recombine them in novel ways, it is original? I would say this is a win, but I have a feeling a typical artist will not be afforded the same defense.
Unlike a cover, a remix differs in that it uses the original recording, and is not just a re-creation of it.
This means to release a remix legally, you must seek copyright permission from the original artist or band who created the song or the sample you want to remix or reuse.
you must seek copyright permission from the original artist or band who created the song or the sample you want to remix or reuse.
this is false. people do it out of politeness not because its required.
edit: I might be mistaken for sampling/reuse i was thinking of parodys
Sampling yes, remixing no. Remixes require permission, in sampling you could ask permission but tbh the OG artist probably stole the sample too.
I’m pretty sure remixes still need permission. I could be wrong but I know covers definitely need to seek legal permission from the copyright holder before they can use the song.