overview for diz

AI coders think they’re 20% faster — but they’re actually 19% slower in c/techtakes@awful.systems

[–] diz@awful.systems 6 points 3 months ago

I think if people are citing in another 3 months time, they’ll be making a mistake

In 3 months they'll think they're 40% faster while being 38% slower. And sometime in 2026 they will be exactly 100% slower - the moment referred to as "technological singularity".

Microsoft lays off the staff who make the money to fund AI that doesn’t in c/techtakes@awful.systems

[–] diz@awful.systems 5 points 3 months ago (2 children)

That philosophy always ends in stepping into dogshit to try to boost stock prices.

How to pass an AI coding benchmark: train on the questions in c/techtakes@awful.systems

[–] diz@awful.systems 7 points 3 months ago* (last edited 3 months ago)

When they tested on bugs not in SWE-Bench, the success rate dropped to 57‑71% on random items, and 50‑68% on fresh issues created after the benchmark snapshot. I’m surprised they did that well.

After the benchmark snapshot. Could still be before LLM training data cut off, or available via RAG.

edit: For a fair test you have to use git issues that had not been resolved yet by a human.

This is how these fuckers talk, all of the time. Also see Sam Altman's not-quite-denials of training on Scarlett Johansson's voice: they just asserted that they had hired a voice actor, but didn't deny training on actual Scarlett Johansson's voice. edit: because anyone with half a brain knows that not only did they train on her actual voice, they probably gave it and their other pirated movie soundtracks massively higher weighting, just as they did for books and NYT articles.

Anyhow, I fully expect that by now they just use everything they can to cheat benchmarks, up to and including RAG from solutions past the training dataset cut off date. With two of the paper authors being from Microsoft itself, expect that their "fresh issues" are gamed too.

How to pass an AI coding benchmark: train on the questions in c/techtakes@awful.systems

[–] diz@awful.systems 7 points 3 months ago

Yeah I'm thinking that people who think their brains work like LLM may be somewhat correct. Still wrong in some ways as even their brains learn from several orders of magnitude less data than LLMs do, but close enough.

Google Veo 3 fails, week 2: fail harder — Shokunin Studio spins the gacha in c/techtakes@awful.systems

[–] diz@awful.systems 6 points 3 months ago* (last edited 3 months ago) (1 children)

You can film with an actual camera then use video to video to make it look very AI. If you're just grifting, that would be the way to go I think.

‘AI is no longer optional’ — Microsoft admits AI doesn’t help at work in c/techtakes@awful.systems

[–] diz@awful.systems 7 points 3 months ago (1 children)

They're also very gleeful about finally having one upped the experts with one weird trick.

Up until AI they were the people who were inept and late at adopting new technology, and now they get to feel that they're ahead (because this time the new half-assed technology was pushed onto them and they didn't figure out they needed to opt out).

‘AI is no longer optional’ — Microsoft admits AI doesn’t help at work in c/techtakes@awful.systems

[–] diz@awful.systems 16 points 3 months ago* (last edited 3 months ago)

I was writing some math code, and not being an idiot I'm using an open source math library for doing something called "QR decomposition", and its efficient, and it supports sparse matrices (matrices where many numbers are 0), etc.

Just out of curiosity I checked where some idiot vibecoder would end up. AI simply plagiarizes from some shit sample snippets which exist purely to teach people what QR decomposition is. It's actually unusable, due to being numerically unstable.

Who in the fuck even needs this shit to be plagiarized, anyway?

It can't plagiarize a production quality implementation, because you can count those on the fingers of one hand, they're complex as fuck and you can't just blend a few together to try to pretend you didn't plagiarize.

The answer is, people who are peddling the AI. They are the ones who ordered plagiarism with extra plagiarism on top. These are not coding tools, these are demos to convince the investors to buy the actual product, which is company's stock. There's a little bit of tool functionality (you can ask them to refactor the code), but it's just you misusing a demo to try to get some value out of it.

And to that end, the demos take every opportunity to plagiarize something, and to talk about how the "AI" wrote the code from scratch based on its supposed understanding of fairly advanced math.

And in coding, it is counter productive to plagiarize. Many of the open source libraries can be used in commercial projects. You get upstream fixes for free. You don't end up with some bugs or worse yet security exploits that may have been fixed since the training cut-off date.

No fucking one in the right mind would willingly want their product to contain copy pasted snippets from stale open source libraries, passed through some sort of variable-renaming copyright laundering machine.

Except of course the business idiots who are in charge of software at major companies, who don't understand software. Who just failed upwards.

They look at plagiarized lines and count them as improved productivity.

‘AI is no longer optional’ — Microsoft admits AI doesn’t help at work in c/techtakes@awful.systems

[–] diz@awful.systems 6 points 3 months ago

Indistinguishable from a business idiot.

Study: Meta AI Model Can Reproduce Almost Half of Harry Potter Book in c/fuck_ai@lemmy.world

[–] diz@awful.systems 2 points 3 months ago* (last edited 3 months ago)

Its also interesting that this is the most conservative, pro “its not just memorizing” estimation possible : they multiplied the probabilities of consequent tokens. Basically it means if it starts shitting out a quote it will not be able to stop quoting until their anti copy the whole book finetuning kicks in after 50 words or so.

It can probably output far more under a realistic test (always picking the top token, temperature =0)

People Are Being Involuntarily Committed, Jailed After Spiraling Into "ChatGPT Psychosis" in c/techtakes@awful.systems

[–] diz@awful.systems 7 points 3 months ago* (last edited 3 months ago)

If it was a basement dweller with a chatbot that could be mistaken for a criminal co-conspirator, he would've gotten arrested and his computer seized as evidence, and then it would be a crapshoot if he would even be able to convince a jury that it was an accident. Especially if he was getting paid for his chatbot. Now, I'm not saying that this is right, just stating how it is for normal human beings.

It may not be explicitly illegal for a computer to do something, but you are liable for what your shit does. You can't just make a robot lawnmower and run over a neighbor's kid. If you are using random numbers to steer your lawnmower... yeah.

But because it's OpenAI with 300 billion dollar "valuation", absolutely nothing can happen whatsoever.

People Are Being Involuntarily Committed, Jailed After Spiraling Into "ChatGPT Psychosis" in c/techtakes@awful.systems

[–] diz@awful.systems 8 points 3 months ago* (last edited 3 months ago) (2 children)

In theory, at least, criminal justice's purpose is prevention of crimes. And if it would serve that purpose to arrest a person, it would serve that same purpose to court-order a shutdown of a chatbot.

There's no 1st amendment right to enter into criminal conspiracies to kill people. Not even if "people" is Sam Altman.

People Are Being Involuntarily Committed, Jailed After Spiraling Into "ChatGPT Psychosis" in c/techtakes@awful.systems

[–] diz@awful.systems 20 points 3 months ago* (last edited 3 months ago) (4 children)

It's curious how if ChatGPT was a person - saying exactly the same words - he would've gotten charged with a criminal conspiracy, or even shot, as its human co-conspirator in Florida did.

And had it been a foreign human in the middle east, radicalizing random people, he would've gotten a drone strike.

"AI" - and the companies building them - enjoy the kind of universal legal immunity that is never granted to humans. That needs to end.