overview for diz

AI solves every river crossing puzzle, we can go home now [content warning: botshit] in c/techtakes@awful.systems

[–] diz@awful.systems 6 points 4 weeks ago

Yeah, that's a great example.

The other thing is that unlike art, source code is already made to be consumed by a machine. It is not any more transformative to convert source code to equivalent source code, than it is to re-encode a video.

The only thing they do that is "transformative" is using source code not for compiling it but for defrauding the investors.

AI solves every river crossing puzzle, we can go home now [content warning: botshit] in c/techtakes@awful.systems

[–] diz@awful.systems 5 points 4 weeks ago* (last edited 4 weeks ago) (14 children)

Other funny thing: it only became a fully automatic plagiarism machine when it claimed that it wrote the code (referring to itself by name which is a dead giveaway that the system prompt makes it do that).

I wonder if code is where they will ultimately get nailed to the wall for willful copyright infringement. Code is too brittle for their standard approach, "we sort of blurred a lot of works together so its ours now, transformative use, fuck you, prove that you don't just blur other people's work together, huh?".

But also for a piece of code, you can very easily test if the code has the same "meaning" - you can implement a parser that converts code to an expression graph, and then compare that. Which makes it far easier to output code that is functionally identical to the code they are plagiarizing, but looks very different.

But also I estimate approximately 0% probability that the assholes working on that wouldn't have banter between themselves about copyright laundering.

edit: Another thing is that since it can have no own conception of what "correct" behavior is for a piece of code being plagiarized, it would also plagiarize all the security exploits.

This hasn't been a big problem for the industry, because only short snippets were being cut and pasted (how to make some stupid API call, etc), but with generative AI whole implementations are going to get plagiarized wholesale.

Unlike any other work, code comes with its own built in, essentially irremovable "watermark" in the form of security exploits. In several thousands lines of code, there would be enough "watermark" for identification.

We test Google Veo: impressive demo, unusable results in c/techtakes@awful.systems

[–] diz@awful.systems 10 points 4 weeks ago

Having worked in computer graphics myself, it is spot on that this shit is uncontrollable.

I think the reason is fundamental - if you could control it more you would put it too far from any of the training samples.

That being said video enhancements along the lines of applying this as a filter to 3d rendered CGI or another video, that could (to some extent) work. I think the perception of realism will fade as it gets more familiar - it is pretty bad at lighting, but in a new way.

Google's Gemini 2.5 pro is out of beta. in c/techtakes@awful.systems

[–] diz@awful.systems 5 points 1 month ago

Well, it did reach for "I double checked it, I'm totally sure now" language.

From the perspective of trying to convince the top brass that they are making good progress towards creating an artificial psychopath - not just an artificial human - it's pretty good.

Google's Gemini 2.5 pro is out of beta. in c/techtakes@awful.systems

[–] diz@awful.systems 5 points 1 month ago* (last edited 1 month ago)

Still seems terminally AI pilled to me, an iteration or two later. "5 digit multiplication is borderline", how is that useful?

I think there's a combination of it being a pinnacle of billions and billions of dollars, and probably theirs firing people for slightest signs of AI skepticism. There's another data point, "reasoning math & code" is released as stable by Google without anyone checking if it can do any kind of math.

edit: imagine that a calculator manufacturer in 1970s is so excited about microprocessors they release an advanced scientific calculator that can't multiply two 6 digit numbers (while their earlier discrete component model could). Outside the crypto sphere, that sort of insanity is new.

Google's Gemini 2.5 pro is out of beta. in c/techtakes@awful.systems

[–] diz@awful.systems 7 points 1 month ago

Yeah, I'd also bet on the latter. They also added a fold-out button that shows you the code it wrote (folded by default), but you got to unfold it or notice that it is absent.

Google's Gemini 2.5 pro is out of beta. in c/techtakes@awful.systems

[–] diz@awful.systems 10 points 1 month ago

Oh and also for the benefit of our AI fanboys who can't understand why we would expect something as mundane from this upcoming super-intelligence, as doing math, here's why:

Google's Gemini 2.5 pro is out of beta. in c/techtakes@awful.systems

[–] diz@awful.systems 6 points 1 month ago* (last edited 1 month ago)

Also, I just noticed something really fucking funny:

(arrows are for the sake of people like llllll...)

Google's Gemini 2.5 pro is out of beta. in c/techtakes@awful.systems

[–] diz@awful.systems 9 points 1 month ago (5 children)

lmao: they have fixed this issue, it seems to always run python now. Got to love how they just put this shit in production as "stable" Gemini 2.5 pro with that idiotic multiplication thing that everyone knows about, and expect what? to Eliza Effect people into marrying Gemini 2.5 pro?

Google's Gemini 2.5 pro is out of beta. in c/techtakes@awful.systems

[–] diz@awful.systems 6 points 1 month ago* (last edited 1 month ago) (2 children)

there was a directive that if it were asked a math question that you can’t do in your brain or some very similar language it should forward it to the calculator module.

The craziest thing about leaked prompts is that they reveal the developers of these tools to be complete AI pilled morons. How in the fuck would it know if it can or can't do it "in its brain" lol.

edit: and of course, simultaneously, their equally idiotic fanboys go "how stupid of you to expect it to use a calculating tool when it said it used a calculating tool" any time you have some concrete demonstration of it sucking ass, while simultaneously the same kind of people are lauding the genius of system prompts half of which are asking it to meta-reason.

Google's Gemini 2.5 pro is out of beta. in c/techtakes@awful.systems

[–] diz@awful.systems 11 points 1 month ago (3 children)

Thing is, it has tool integration. Half of the time it uses python to calculate it. If it uses a tool, that means it writes a string that isn't shown to the user, which runs the tool, and tool results are appended to the stream.

What is curious is that instead of request for precision causing it to use the tool (or just any request to do math), and then presence of the tool tokens causing it to claim that a tool was used, the requests for precision cause it to claim that a tool was used, directly.

Also, all of it is highly unnatural texts, so it is either coming from fine tuning or from training data contamination.

Eliezer uses the tragic death of someone to smugly (and falsely) further his rhetoric in c/sneerclub@awful.systems

[–] diz@awful.systems 3 points 1 month ago

I don't think we need to go as far as evopsych here... it may just be an artifact of modeling the environment at all - you learn to model other people as part of the environment, you re-use models across people (some people are mean, some people are nice, etc).

Then weather happens, and you got yourself a god of bad weather and a god of good weather, or perhaps a god of all weather who's bipolar.

As far as language goes it also works the other way, we over used these terms in application to computers, to the point that in relation to computers "thinking" no longer means it is actually thinking.