this post was submitted on 09 Jan 2024
484 points (98.4% liked)

Technology

72471 readers
3315 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

‘Impossible’ to create AI tools like ChatGPT without copyrighted material, OpenAI says::Pressure grows on artificial intelligence firms over the content used to train their products

(page 2) 50 comments
sorted by: hot top controversial new old
[–] ugjka@lemmy.world 8 points 2 years ago

TBH I only use LLMs when traditional search fails and even then I'm not sure if I'm getting something useful or hallucination. I need better search engines not fancy AI bullshitters

[–] Daxtron2@startrek.website 8 points 2 years ago

I've learned from lemmy that individual's abuse of copyright is good👍

LLMs trained on copyrighted material and suddenly everyone is an advocate for more strict copyright enforcement?

[–] baseless_discourse@mander.xyz 5 points 2 years ago

Yeah, I also have no way to own a billion dollar. Sucks for both of us...

[–] pacology@lemmy.world 5 points 2 years ago

We’ll, strictly speaking you could have an AI that only knows about the world up to 1928 and talks like it’s 1928.

[–] autotldr@lemmings.world 5 points 2 years ago

This is the best summary I could come up with:


The developer OpenAI has said it would be impossible to create tools like its groundbreaking chatbot ChatGPT without access to copyrighted material, as pressure grows on artificial intelligence firms over the content used to train their products.

Chatbots such as ChatGPT and image generators like Stable Diffusion are “trained” on a vast trove of data taken from the internet, with much of it covered by copyright – a legal protection against someone’s work being used without permission.

AI companies’ defence of using copyrighted material tends to lean on the legal doctrine of “fair use”, which allows use of content in certain circumstances without seeking the owner’s permission.

John Grisham, Jodi Picoult and George RR Martin were among 17 authors who sued OpenAI in September alleging “systematic theft on a mass scale”.

Getty Images, which owns one of the largest photo libraries in the world, is suing the creator of Stable Diffusion, Stability AI, in the US and in England and Wales for alleged copyright breaches.

The submission said it backed “red-teaming” of AI systems, where third-party researchers test the safety of a product by emulating the behaviour of rogue actors.


The original article contains 530 words, the summary contains 190 words. Saved 64%. I'm a bot and I'm open source!

[–] randon31415@lemmy.world 5 points 2 years ago (1 children)

I wonder if the act of picking cotton was copyrighted, would we had got the cotton gin? We have automated most non-creative pursues and displaced their workers. Is it because people can take joy out of creative pursues that we balk at the automation? If you have a particular style in picking items to fulfill Amazon orders, should that be copyrighted and protected from being used elsewhere?

[–] MaxVoltage@lemmy.world 4 points 2 years ago* (last edited 2 years ago) (1 children)

Bro the cotton gin literally led to millions of black slaves because now it was profitable. Worst example possible

i literally coughed i laughed so hard

load more comments (1 replies)
[–] PeterPoopshit@lemmy.world 4 points 2 years ago* (last edited 2 years ago)

My hot take is that it's not like most of those independent artists are getting compensated fairly by the companies that own them anyway if at all. Stealing ai training content is just stealing from corporations. Corporations who are probably politically fighting to keep things worse for the average person in your country.

Theft is "a crime" but I never saw anyone complaining about how unfair it was all those times I myself got fucked over by google bullshitting their way out of giving me my ad revenue. If normal people can't profit from stuff like this, we shouldn't be doing anything to protect the profits of evil corporations.

[–] ChrislyBear@lemmy.world 3 points 2 years ago (19 children)

So if I look at a painting study it and then emulate the original painter's artstyle, then I'm in breach of their copyright?

Or if I read a lot of fantasy like GRRM or JK Rowling and I also write a fantasy book and say, that they were my Inspiration, I'm breaching their copyright??

That's not how it works, and if it is, it shouldn't be!

Sure, if a start reproducing work, i.e. plagiarizing the work of others, then I'm doing sth wrong.

And to spin this further: If I raise a child on children's books by a specific author, am I breaching copyright, when my child enters the workforce and starts to earn money???? Stupid, yes! But so are the copyright claims against LLMs, in my opinion.

load more comments (19 replies)
[–] IzzyScissor@lemmy.world 3 points 2 years ago (1 children)

Help Help! My business model is illegal, but it makes SO MUCH money! What do I doooo?

load more comments (1 replies)
[–] charonn0@startrek.website 2 points 2 years ago

Sounds like a fatal problem. That's a shame.

load more comments
view more: ‹ prev next ›