LocalLLaMA

4583 readers

73 users here now

Welcome to LocalLLaMA! Here we discuss running and developing machine learning models at home. Lets explore cutting edge open source neural network technology together.

Get support from the community! Ask questions, share prompts, discuss benchmarks, get hyped at the latest and greatest model releases! Enjoy talking about our awesome hobby.

As ambassadors of the self-hosting machine learning community, we strive to support each other and share our enthusiasm in a positive constructive way.

Rules:

Rule 1 - No harassment or personal character attacks of community members. I.E no namecalling, no generalizing entire groups of people that make up our community, no baseless personal insults.

Rule 2 - No comparing artificial intelligence/machine learning models to cryptocurrency. I.E no comparing the usefulness of models to that of NFTs, no comparing the resource usage required to train a model is anything close to maintaining a blockchain/ mining for crypto, no implying its just a fad/bubble that will leave people with nothing of value when it burst.

Rule 3 - No comparing artificial intelligence/machine learning to simple text prediction algorithms. I.E statements such as "llms are basically just simple text predictions like what your phone keyboard autocorrect uses, and they're still using the same algorithms since <over 10 years ago>.

Rule 4 - No implying that models are devoid of purpose or potential for enriching peoples lives.

founded 2 years ago

MODERATORS

pax@sh.itjust.works

noneabove1182@sh.itjust.works

Smokeydope@lemmy.world

MonsterBug@sh.itjust.works

Anthropic's 'On the Biology of a LLM' got a massive update: Features fascinating deep dives into how models process information behind the scenes (transformer-circuits.pub)

submitted 11 months ago by Smokeydope@lemmy.world to c/localllama@sh.itjust.works

8 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] fubarx@lemmy.world 2 points 11 months ago (1 children)

This also means anyone wanting to mess around and subvert society can create a whole corpus of disinformation and put it out for the LLM spiders to pick up.

They're just sucking up and ingesting whatever's out there unquestioningly, with little regard to its veracity. For the record, I think this is a BAD idea. Then again...

[–] FaceDeer@fedia.io 6 points 11 months ago (1 children)

I don't see how this comment is related to the content of this article. This is a bunch of information about how LLMs work under the hood, it has nothing to do with how they're supposedly "sucking up and ingesting whatever's out there unquestioningly." I don't see anything about LLM training mentioned here, it's about how they function once they have been trained.

[–] Smokeydope@lemmy.world 7 points 11 months ago* (last edited 11 months ago)

Theres a very vocal subset of the ai-hater Lemmy population that thinks

the only machine learning models are ones made by mgacorporations like facebook ans openai using stolen internet data
model creators in 2025 are still using stolen scraped unfiltered internet data for training datasets

Theres plenty of models trained on completely open public domain information and released under a permissive license. This isnt the era of tayAI twitter garbage fed sloppo models anymore. All the newest models are trained on 90% synthetic data, 10% RFHL done by contracted out educators with degrees making a quick buck through easy remote work.

But that doesnt matter to the emotionally and political charged Lemmy leftist with liberal arts degrees who dont care to understand the realities behind machine learning.

No, the modern AI bubble begins and ends for them with their art being stolen by facebook/meta without so much as a handslap by the govt then having stablediffusion rubbed in their face automation threatening their livelyhood by smug greedy tech bros without a shred of respect for human creativity.

So in retaliation, the Lemmings throw tantrums in the comments of all ai gen post babbling about how the newest batch of didital computer tools to cut down manual work is destroying everything, and clutch on to the venence fantasy they can still 'poison the AI that stole my work!' By saying the magic words like a SCP cognitohazard.

The reality is the only one still scraping your slop is ad sellers and big brother, while the only human data being fed into modern chatgpt is from someone with an associates degree in an academic field.

Ive chosen to allow the comment to stay in this scenario as I dont believe in censorship especially if the post isnt against stated guidelines. I am against fostering echo chambers.

However, c/localllama was always intended to be a small island of safe space for ML enthusiast to talk and share the hobby in a positive construtive way without fear of being attacked/shit on by the general Lemmy population who just dont get what we do here except that we support 'AI'. Haters who dont understand can go to literally any other community to circlejerk without pushback, I think a few fuckAI communities exist just for that purpose. So If these kind of cloak-and-dagger wink wink nudge nudge antagonistic comments about 'poisoning teh AI!' become more common I'll update guidelines and start enforcing them appropriately.