Meta allegedly torrented over 81TB worth of pirated books to train its AI models

Current Events

Current Events » Meta allegedly torrented over 81TB worth of pirated books to train its AI models
https://arstechnica.com/tech-policy/2025/02/meta-torrented-over-81-7tb-of-pirated-books-to-train-ai-authors-say/

remember kids, piracy is bad
https://i.imgur.com/TGkNCva.gif https://i.imgur.com/8mWCvA4.gif
How do you know that Zuckerbeg didnt just get them a library card?
Moustache twirling villain
https://i.imgur.com/U3lt3H4.jpg- Kerbey
It is well known that all the LLMs are based on massive theft. Its why people laughed so hard about OpenAI crying about DeepSeek "copying" their data or whatever. Current AI/LLMs would not exist if the creators had been stopped from stealing mountains of data to train on. They could probably have done it legitimately with enough time but clearly they didn't want to take the time.
They're not music or movies, so nobody's going to care.
Meta also allegedly modified settings "so that the smallest amount of seeding possible could occur," a Meta executive in charge of project management, Michael Clark, said in a deposition.
And to think, they didn't even seed!
https://i.imgur.com/TGkNCva.gif https://i.imgur.com/8mWCvA4.gif
FL81 posted...
And to think, they didn't even seed!
Those selfish bastards
https://imgur.com/u2HR4nG
TMOG posted...
They're not music or movies, so nobody's going to care.

Textbook companies sure care when you pirate their mandatory $400 book thats slightly different from last years edition.
The difference between fiction and reality is that fiction has to make sense.
Hejiru posted...
Textbook companies sure care when you pirate their mandatory $400 book thats slightly different from last years edition.
*meanwhile Facebook downloads literal millions of textbooks*
https://i.imgur.com/TGkNCva.gif https://i.imgur.com/8mWCvA4.gif
Post #9 was unavailable or deleted.
81TB? Pfft. I have a system at work that generates 81TB of data in a single month. Git gud, Zuckyboy.
If you treat people as equals, they start to think they ARE your equals.
http://www.gamefaqs.com/boards/1005-warhammer-40k
Hejiru posted...
Textbook companies sure care when you pirate their mandatory $400 book thats slightly different from last years edition.

Lmao I remember people sharing pdfs of textbooks back when I was in college. I think even some professors looked the other way.
MabinogiFan posted...
Lmao I remember people sharing pdfs of textbooks back when I was in college. I think even some professors looked the other way.
I pirated my Ethics textbook and I shared the link with all in my class who asked.

Zero.

Regrets.
He/Him http://guidesmedia.ign.com/guides/9846/images/slowpoke.gif https://i.imgur.com/M8h2ATe.png
https://i.imgur.com/6ezFwG1.png
Storm_Shadow posted...
81TB? Pfft. I have a system at work that generates 81TB of data in a single month. Git gud, Zuckyboy.
For what it's worth, the average eBook is, like, 500 KB or something lol
https://i.imgur.com/TGkNCva.gif https://i.imgur.com/8mWCvA4.gif
Storm_Shadow posted...
81TB? Pfft. I have a system at work that generates 81TB of data in a single month. Git gud, Zuckyboy.
Yeah but is that data worth anything?
DrizztLink posted...
I pirated my Ethics textbook and I shared the link with all in my class who asked.

Zero.

Regrets.
Being able to ctrlF a textbook is a godly ability to have, saving money aside

did that every chance I could
You haven't set a signature for the message boards yet
Those are rookie numbers
Lancool II | Z690 Tomahawk | 12700K | Fuma 2 | RTX 3070Ti | 32GB 3600MHz | FireCuda 530 1TB | Inland NVMe 1TB | P3 Plus 4TB | RM750x
FL81 posted...
And to think, they didn't even seed!
Lol
"May the Father of Understanding guide you."
SomeUsername529 posted...
It is well known that all the LLMs are based on massive theft. Its why people laughed so hard about OpenAI crying about DeepSeek "copying" their data or whatever. Current AI/LLMs would not exist if the creators had been stopped from stealing mountains of data to train on. They could probably have done it legitimately with enough time but clearly they didn't want to take the time.


No fair! We stole it first!

I've seen "content producers" cry similarly.
http://youtube.com/TheYoungTurks/videos
http://youtube.com/SamSeder/videos http://RightWingWatch.org http://reddit.com/r/BreadTube http://fb.me/OccupyDemocrats
Omg they use the sites I occasionally visit
I'm a Taurus. Currently playing: Oldschool Runescape & DMC 3 Ubisoft Port. He/Him
Post #20 was unavailable or deleted.
Meanwhile the founder of Reddit who tried to pirate books from a archive and spread the knowledge to general public got arrested or something like that
HERRO EvryBody. HOW ARE YOU? FINE OKAY!
Homeless_Waifu posted...
Meanwhile the founder of Reddit who tried to pirate books from a archive and spread the knowledge to general public got arrested or something like that
what happened to Aaron Swartz was really fucked up
https://i.imgur.com/TGkNCva.gif https://i.imgur.com/8mWCvA4.gif
Current Events » Meta allegedly torrented over 81TB worth of pirated books to train its AI models