this post was submitted on 18 May 2025
76 points (98.7% liked)
LocalLLaMA
3013 readers
10 users here now
Welcome to LocalLLaMA! Here we discuss running and developing machine learning models at home. Lets explore cutting edge open source neural network technology together.
Get support from the community! Ask questions, share prompts, discuss benchmarks, get hyped at the latest and greatest model releases! Enjoy talking about our awesome hobby.
As ambassadors of the self-hosting machine learning community, we strive to support each other and share our enthusiasm in a positive constructive way.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
This also means anyone wanting to mess around and subvert society can create a whole corpus of disinformation and put it out for the LLM spiders to pick up.
They're just sucking up and ingesting whatever's out there unquestioningly, with little regard to its veracity. For the record, I think this is a BAD idea. Then again...
I don't see how this comment is related to the content of this article. This is a bunch of information about how LLMs work under the hood, it has nothing to do with how they're supposedly "sucking up and ingesting whatever's out there unquestioningly." I don't see anything about LLM training mentioned here, it's about how they function once they have been trained.
Theres a very vocal subset of the ai-hater Lemmy population that thinks
Theres plenty of models trained on completely open public domain information and released under a permissive license. This isnt the era of tayAI twitter garbage fed sloppo models anymore. All the newest models are trained on 90% synthetic data, 10% RFHL done by contracted out educators with degrees making a quick buck through easy remote work.
But that doesnt matter to the emotionally and political charged Lemmy leftist with liberal arts degrees who dont care to understand the realities behind machine learning.
No, the modern AI bubble begins and ends for them with their art being stolen by facebook/meta without so much as a handslap by the govt then having stablediffusion rubbed in their face automation threatening their livelyhood by smug greedy tech bros without a shred of respect for human creativity.
So in retaliation, the Lemmings throw tantrums in the comments of all ai gen post babbling about how the newest batch of didital computer tools to cut down manual work is destroying everything, and clutch on to the venence fantasy they can still 'poison the AI that stole my work!' By saying the magic words like a SCP cognitohazard.
The reality is the only one still scraping your slop is ad sellers and big brother, while the only human data being fed into modern chatgpt is from someone with an associates degree in an academic field.
Ive chosen to allow the comment to stay in this scenario as I dont believe in censorship especially if the post isnt against stated guidelines. I am against fostering echo chambers.
However, c/localllama was always intended to be a small island of safe space for ML enthusiast to talk and share the hobby in a positive construtive way without fear of being attacked/shit on by the general Lemmy population who just dont get what we do here except that we support 'AI'. Haters who dont understand can go to literally any other community to circlejerk without pushback, I think a few fuckAI communities exist just for that purpose. So If these kind of cloak-and-dagger wink wink nudge nudge antagonistic comments about 'poisoning teh AI!' become more common I'll update guidelines and start enforcing them appropriately.