Meta's latest legal wheeze is to insist that pirating books is fair use, actually. And it might be working.

artifex@piefed.social · 7 hours ago

Meta's latest legal wheeze is to insist that pirating books is fair use, actually. And it might be working.

Archangel1313@lemmy.ca · 3 hours ago

I absolutely love the fact that all these companies are laying the legal groundwork to destroy intellectual property rights altogether. If they win enough of these cases, then every pirate on the open seas sails under a flag of amnesty.

artifex@piefed.social · 3 hours ago

No, I expect they’ll be more like “rules for thee but not for me”

merc@sh.itjust.works · 36 minutes ago

deleted by creator

TheObviousSolution@lemmy.ca · edit-2 2 hours ago

So we can pirate books as well as long as we aren’t able to reproduce them verbatim from memory as well?

Judge Vince Chhabria either accepts whatever bribes and offers he’s probably getting offered and sides with Meta, or it will eventually go on to the Supreme Court where they most definitely will. That’s the part of this that will work the most under an administration of no accountability.

InternetCitizen2@lemmy.world · 7 minutes ago

Tell the judge you are training a neural network… it just happens to also be you.

melfie@lemy.lol · edit-2 4 hours ago

Looking forward to Jellyfin getting a LLM to train locally on movie preferences so everyone’s library is fair use. Wait, is this why LLMs are being shoehorned into everything? 🤔

PointyFluff@lemmy.ml · 2 hours ago

did they have a library card? if so, then fuck off.

Goodlucksil@lemmy.dbzer0.com · 7 hours ago

Classic “the end justifies the means” (bad) defense. If ISPs can send letter for torrenting, and Facebook torrented a lot, Facebook deserves a fair punishment.

Archer@lemmy.world · 43 minutes ago

lol it would be hilarious if they could order Facebook disconnected from the Internet like a pleb hit with a copyright complaint

GameOverFlow@lemmy.zip · 7 hours ago

Not deserves, needs.

☂️-@lemmy.ml · 5 hours ago

sure. thanks meta, anna’s archive will help me with my reading list, thanks.

rc__buggy@sh.itjust.works · 5 hours ago

We can train our NI (Natural Intelligence) models.

ClownStatue@piefed.social · 59 minutes ago

To demand shrubberies?

HaunchesTV@feddit.uk · 3 hours ago

Just spitballing…

If you were to train a model on just one book, as long as you don’t prompt it to create an exact copy (maybe just some indiscernible differences) then presumably that’s fair use.

Then, since we know AI generated work can’t be copyrighted, does that essentially create a copyright-free version of the text which can be freely distributed?

ArbitraryValue@sh.itjust.works · 6 hours ago

We’re going to end up in a situation where whatever is necessary to train AI is permitted, and the main question is whether that will be through (re)interpretation of existing law or the passage of a new law.

ctrl_alt_esc@lemmy.ml · 6 hours ago

Good thing I have a local model running that’s constantly learning, for precisely this reason

panda_abyss@lemmy.ca · 5 hours ago

I’m still collecting media before I can start the training process.

XLE@piefed.social · 3 hours ago

If anything, this is proof you should be next in line for a large venture capital infusion!

frustrated_phagocytosis@fedia.io · 5 hours ago

As long as they cannot copyright what they generate from using the pirated materials

ryathal@sh.itjust.works · 6 hours ago

Arguing that training models isn’t fair use us going to be a massive uphill battle, it’s basically reading the book but with a computer. It’s not actually a big deal to people, unless you hold the copyright to a ton of works and want to get a percentage of all the AI income these companies have made.

Torrenting the books is likely absolutely copyright infringement, but that has relatively low payout compared to the money these companies are getting for their models. The training being fair use means that rights holders can’t try to take any money from the model’s use. The statutory limits for infringement even at per work levels aren’t significant compared to the legal cost of proving it happened.

OfCourseNot@fedia.io · 5 hours ago

There’s an argument to be made that it is, in fact, not ‘reading’. The training of the model could be considered a lossy compression of the data. And streaming movies in a lossy compression format is not fair use, is it?

ryathal@sh.itjust.works · 3 hours ago

The model doesn’t stream out anyone’s content though. The article mentions that the plaintiffs have provided no examples of a prompt that creates anything substantial.

Streaming a lossy compression would generally be infringement, but there is definitely a point where it becomes not infringement if it’s lossy enough.

What a model generally stores, is factual information that isn’t copyright in the first place. It’s storing word counts, sentence lengths, sentiment analysis, and so on.

Fatal@piefed.social · 4 hours ago

It’s not the storage of the information that matters as much as the presentation. Google’s search index stores a huge amount of copyrighted material, even losslessly. But they only present small snippets at a time which is not considered copyright infringement. The question really is whether or not the information being presented by the models is in a format which is considered copyright infringement. So far, courts have not found that they are.

Grimy@lemmy.world · edit-2 7 hours ago

They didn’t say seeding is fair use, just inherently part of torrenting. Good thing Sarah Silverman has pc gamer there to pander for her.