diz

joined 2 years ago
[–] diz@awful.systems 20 points 3 weeks ago* (last edited 3 weeks ago) (4 children)

It's curious how if ChatGPT was a person - saying exactly the same words - he would've gotten charged with a criminal conspiracy, or even shot, as its human co-conspirator in Florida did.

And had it been a foreign human in the middle east, radicalizing random people, he would've gotten a drone strike.

"AI" - and the companies building them - enjoy the kind of universal legal immunity that is never granted to humans. That needs to end.

[–] diz@awful.systems 11 points 3 weeks ago* (last edited 3 weeks ago)

I appreciate the sentiment but I also hate the whole "AI is a power loom for coding".

The power loom for coding is called "git clone".

What "AI" (LLM) tools provide is just English as a programming language with plagiarized sum total of all open source as the standard library. English is a shit programming language. LLMs are shit at compiling it. Open source is awesome. Plagiarized open source is "meh" - you can not apply upstream patches.

[–] diz@awful.systems 4 points 3 weeks ago

So, the judge says:

In cases involving uses like Meta’s, it seems like the plaintiffs will often win, at least where those cases have better-developed records on the market effects of the defendant’s use.

And what is that supposed to ever look like? Do authors need a better developed record of effects of movies on book sales, to get paid for movie adaptations, too?

[–] diz@awful.systems 12 points 3 weeks ago (5 children)

It's called sarcasm.

[–] diz@awful.systems 1 points 3 weeks ago* (last edited 3 weeks ago)

One thing that I couldn't easily figure out is what is the constant factor. If the constant factor is significantly worse than for Strassen, then it would be much slower than Strassen except for very large matrices.

Let's say the constant factor is k.

N should be large enough that N^((log(49)-log(48))/log(4)) > k where k is the constant factor. Let's say the difference in exponents is x, then

N^x > k

log(N)*x > log(k)

N > exp(log(k)/x)

N > k^(1/x)

So lets say x is 0.01487367169 , then we're talking [constant factor]^67 for how big the matrix has to be?

So, 2^67 sized matrix (2^134 entries in it) if Google's is 2x greater constant than Strassen.

That don't even sound right, but I double checked, (k^67) ^ 0.01487367169 is approximately k.

edit: I'm not sure what the cross over points would be if you use Google's then Strassen's then O( n^3 )

Also, Strassen’s algorithm works on reals (and of course, on complex numbers), while the new "improvement" reduces by 1 the number of real multiplications required for a product of two 4x4 complex-valued matrices.

[–] diz@awful.systems 9 points 3 weeks ago* (last edited 3 weeks ago)

Old McDonald had a startup, iyo io o[4-mini].

It's funny how just today in a completely unrelated context a generative ai enthusiast used an example of OpenAI getting sued by NYT as a reason why they wouldn't commit some other malfeasance because they'd get caught if they did.

[–] diz@awful.systems 5 points 3 weeks ago* (last edited 3 weeks ago)

Its not about moats, it's about open source community (whose code had been trained on) coming out with pitchforks. It has nothing to do with moats.

You are way overselling coding agents.

Re-creating some open source project with a similar function is literally the only way a coding agent can pretend to be a programmer.

I tried latest models for code and they are in fact capable of shitting out a thousand lines of working code at a time, which obviously can only be obtained via plagiarism since they are also incapable of writing the most trivial code for a novel situation. And the neat thing about plagiarism is that once you start you can keep going since there's more of compatible code where it came from.

[–] diz@awful.systems 7 points 3 weeks ago* (last edited 3 weeks ago)

Yeah I'm thinking this one may be special cased, perhaps they wrote a generator of river crossing puzzles with corresponding conversion to "is_valid_state" or some such. I should see if I can get it to write something really ridiculous into "is_valid_state".

Other thing is that in real life its like "I need to move 12 golf carts, one has low battery, I probably can't tow more than 3 uphill, I can ask Bob to help but he will be grumpy...", just a tremendous amount of information (most of it irrelevant) with tremendous^tremendous^ possible moves (most of them possible to eliminate by actual thinking).

[–] diz@awful.systems 2 points 4 weeks ago

Honestly 16 bit would be easier because there would just be so much less code.

[–] diz@awful.systems 3 points 4 weeks ago* (last edited 4 weeks ago) (7 children)

Pre-LLM, I had to sit through one or two annual videos to the sense of “dont cut and paste from open source, better yet don’t even look at GPLd code you arent working on” and had to do a click test with questions like “is it ok if you rename all the variables yes no”. Ohh and I had to run a scanning tool as part of the release process.

I don’t think its the FSD they would worry about, but GPL especially v3. Nobody gives a shit if it steals some leetcode snippet, or cuts and pastes some calls to a stupid API.

But if you have a “coding agent” just replicating GPL code wholesale, thousands and thousands of lines, it would be very obvious. And not all companies ship shitcode. Apple is a premium product and ages old patched CVEs from open source cropping up in there wouldn’t be exactly premium.

[–] diz@awful.systems 5 points 4 weeks ago (1 children)

That is not equivalent, though; other solutions to "can not be left unattended" exist; just ask Kristi Noem.

[–] diz@awful.systems 3 points 4 weeks ago (1 children)

LLM snippets are so 2024. Coding agents, baby.

view more: ‹ prev next ›