this post was submitted on 07 Aug 2024

32 points (92.1% liked)

Open Source

31114 readers

1 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Posts must be relevant to the open source ideology
No NSFW content
No hate speech, bigotry, etc

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 5 years ago

MODERATORS

kevincox@lemmy.ml

CrypticCoffee@lemmy.ml

Lettuceeatlettuce@lemmy.ml

32

Can AI even be open source? It's complicated (miniza.pages.dev)

submitted 10 months ago by marvelous_coyote@lemm.ee to c/opensource@lemmy.ml

57 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] cmnybo@discuss.tchncs.de 35 points 10 months ago (9 children)

It's rather hard to open source the model when you trained it off a bunch of copyrighted content that you didn't have permission to use.

[–] flamingmongoose@lemmy.blahaj.zone 4 points 10 months ago (1 children)

BERT and early versions of GPT were trained on copyright free datasets like Wikipedia and out of copyright books. Unsure if those would be big enough for the modern ChatGPT types

[–] chebra@mstdn.io 2 points 10 months ago (1 children)

@flamingmongoose @cmnybo

> copyright free datasets like Wikipedia

🤦‍♂️

[–] flamingmongoose@lemmy.blahaj.zone 1 points 10 months ago

What's up with that? Appreciate they're permissive rather than copyright free as such

load more comments (7 replies)