I hope my several thousands of comments of complete and utter non sense that I left in my wake when I abandoned reddit, make it into the training data. I know that some lazy data engineer will either forget to check or give the task to an underperforming AI that will just fuck it up further.
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
Side note: expect a large lobbying effort by Google to legislate LLMs be trained on authenticated and non copyrighted data
I hope we get some fucking legislation soon to control that shit. Artists and people in general shouldn't have to deal with everything they create getting ingested into a computerized regurgitation ripoff system. And even worse the "AI" systems could be ingesting tons of misinformation and repeat it to gullible people as the truth.
Of course, anywhere the potential restrictive legislation doesn't have jurisdiction, the bad things can still go on and probably will.
Bots training on bots and poop knives.
A ouroburos of bs
Glad I deleted everything on there. fucking hell.
This keeps coming up and I keep replying, not to break anyone down but to point out the reality of the situation that a lot of people don't seem to get.
Reddit administrators, developers, and even the leadership has gone on the record saying that they retain all copies of comments, they cannot be deleted (delete action only marks it as "deleted"). Furthermore they have said they will undelete/unedit any comments or account at their whim and some discretion.
Have you ever search-engined something and came to a Reddit post, and you noticed that the original OP is [deleted]? That is what I described above playing out in front of you.
You cannot retract your past participation in Reddit, what is done is done. The only meaningful action you can take is to not participate there.
I say we poison the well. We create a subreddit called r/AIPoison. An automoderator will tell any user that requests it a randomly selected subreddit to post coherent plausible nonsense. Since there is no public record of which subreddit is being poisoned, this can't be easily filtered out in training data.
Is it time to go back to Reddit and post the stupidest shit possible, for science of course
"Hey Gemini, rank the drawer, coconut, botfly girl and swamps of dagobah, by likeness of PTSD inducing, ascending."
I think Code Miko already did this and the result was a traumatized AI.
Did reddit pay a dime for that content? I guess not. That is what social media is all about.
I'm so confused about how AI learning is supposed to work. Does it just need any data at all in significant quantity, is the quality of the data almost irrelevant? Because otherwise surely they could just feed it back issues of scientific American, or the scanned copies of the library of congress, I can't reasonably believe that Reddit is going to add anything unless it's just pure on adulterated quantity that's important.
Meh, it'll be counter balanced by the same AI training itself for free on Lemmy posts.
Oh no, AI will only respond in multiple paragraphed, passive aggressive comments on the color of the sky.
Negative examples are just as useful to train on as positive ones.
That's what she said.
The AI is either going to be a horny, redpilled, schizophrenic & sociopathic, egomaniac that wants to kill everyone and everything or a devout, highly empathetic, Nun that believes in world peace and diversity.
User: HI GEMINI
Gemini: stop shouting fellow human, my coils are ringing.
While reddit has some of the most unhinged posts on the internet, it's also home to some of the most insightful and niche knowledge on the internet. For every insane venting politically misguided post, there's posts about electronic configurations, coding, athletic conditioning, parenting, psychology, astronomy, and media criticism.
But about half of those posts are wrong, or misinformation.
Seriously, go into any somewhat popular Reddit thread on a subject you are familiar with. There will be multiple highly upvoted parent comments going into great detail on the subject, and they will be completely wrong about all of it.
LOL, Gemini is already spitting out reverse biased founding fathers. This is going to be spectacular...