this post was submitted on 24 Jun 2025
598 points (98.7% liked)

Technology

72319 readers
2522 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] vane@lemmy.world 21 points 1 week ago* (last edited 1 week ago) (2 children)

Ok so you can buy books scan them or ebooks and use for AI training but you can't just download priated books from internet to train AI. Did I understood that correctly ?

[–] forkDestroyer 5 points 1 week ago (3 children)

Make an AI that is trained on the books.

Tell it to tell you a story for one of the books.

Read the story without paying for it.

The law says this is ok now, right?

[–] nednobbins@lemmy.zip 7 points 1 week ago

Sort of.

If you violated laws in obtaining the book (eg stole or downloaded it without permission) it's illegal and you've already violated the law, no matter what you do after that.

If you obtain the book legally you can do whatever you want with that book, by the first sale doctrine. If you want to redistribute the book, you need the proper license. You don't need any licensing to create a derivative work. That work has to be "sufficiently transformed" in order to pass.

[–] Enkimaru@lemmy.world 4 points 1 week ago

The LLM is not repeating the same book. The owner of the LLM has the exact same rights to do with what his LLM is reading, as you have to do with what ever YOU are reading.

As long as it is not a verbatim recitation, it is completely okay.

According to story telling theory: there are only roughly 15 different story types anyway.

[–] booly@sh.itjust.works 1 points 1 week ago

The law says this is ok now, right?

No.

The judge accepted the fact that Anthropic prevents users from obtaining the underlying copyrighted text through interaction with its LLM, and that there are safeguards in the software that prevent a user from being able to get an entire copyrighted work out of that LLM. It discusses the Google Books arrangement, where the books are scanned in the entirety, but where a user searching in Google Books can't actually retrieve more than a few snippets from any given book.

Anthropic get to keep the copy of the entire book. It doesn't get to transmit the contents of that book to someone else, even through the LLM service.

The judge also explicitly stated that if the authors can put together evidence that it is possible for a user to retrieve their entire copyrighted work out of the LLM, they'd have a different case and could sue over it at that time.

[–] nednobbins@lemmy.zip 3 points 1 week ago

That's my understanding too. If you obtained them legally, you can use them the same way anyone else who obtained them legally could use them.