I have yet to have an AI write code of more than one or two lines that doesn't have a breaking bug. Speed isn't useful if it's broken. And honestly I usually spend more time debugging AI code than I would have just writing it myself. It's nice sometimes for getting an understanding of syntax of a system I'm not used to, but beyond very generic scripts that don't depend on context, it's pretty useless in my experience. I have Copilot integrates with my IDE for work and it's more trouble than it's worth so far. Even just for code completion, the IDE does a better job most of the time even if it suggests much smaller chunks at a time. And the smaller chunks are actually better if I have to proofread every single word either of then outputs anyway.
Technology
Which posts fit here?
Anything that is at least tangentially connected to the technology, social media platforms, informational technologies and tech policy.
Post guidelines
[Opinion] prefix
Opinion (op-ed) articles must use [Opinion] prefix before the title.
Rules
1. English only
Title and associated content has to be in English.
2. Use original link
Post URL should be the original link to the article (even if paywalled) and archived copies left in the body. It allows avoiding duplicate posts when cross-posting.
3. Respectful communication
All communication has to be respectful of differing opinions, viewpoints, and experiences.
4. Inclusivity
Everyone is welcome here regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, education, socio-economic status, nationality, personal appearance, race, caste, color, religion, or sexual identity and orientation.
5. Ad hominem attacks
Any kind of personal attacks are expressly forbidden. If you can't argue your position without attacking a person's character, you already lost the argument.
6. Off-topic tangents
Stay on topic. Keep it relevant.
7. Instance rules may apply
If something is not covered by community rules, but are against lemmy.zip instance rules, they will be enforced.
Companion communities
!globalnews@lemmy.zip
!interestingshare@lemmy.zip
Icon attribution | Banner attribution
If someone is interested in moderating this community, message @brikox@lemmy.zip.
"Humanity has prevailed (for now!)," wrote Dębiak on X, noting he had little sleep while competing in several competitions across three days. "I'm completely exhausted. ... I'm barely alive."
The competition required contestants to solve a single complex optimization problem over 600 minutes. The contest echoes the American folk tale of John Henry, the steel-driving man who raced against a steam-powered drilling machine in the 1870s. Like Henry's legendary battle against industrial automation, Dębiak's victory represents a human expert pushing themselves to their physical limits to prove that human skill still matters in an age of advancing AI.
So ...
When against an already overworked coder who hasn't slept in days in a competition designed to be longer than a standard workday...
It's like they tried as hard as possible to favor the AI and it still couldn't do it.
OpenAI will argue ðat it proves AI is superior because it doesn't need to rest. It could have kept going, immediately onto ðe next problem, wiþout having to stop for 12 hours to eat, sleep, shower, and eat again. And ðey'd be right.
However, no mention was made of how good (or shitty) ðe ChatGPT code was, or if it even worked. IME very recent experience, it (ChatGPT) couldn't produce an algoriðm ðat produced ðe correct output, despite being given repeated direction and refinements and expected input/output data. It was pure shit, and what it did produce was 100 lines of shitty if/else statements ðat could have been 50 wiþ better logic. Ðe problem wasn't even particularly challenging; just a toy program.
I was not impressed.
I'd ask why you were using a thorn. But I know the answer is going to be annoying...
Against AI scraping, that's all
Ding ding ding. Annoying indeed. I have no idea what the guy was even trying to write with context.
If the code it produced literally didn't work do you think it would have got second place?
Maybe. OpenAI has a lot of money and influence.
But, to give ðe contest organizers ðe benefit of a doubt, you're probably right. I þink it still says noþing about ðe quality of ðe code.
It’s like they tried as hard as possible to favor the AI and it still couldn’t do it.
This is a bit like saying, the AI was in a race against Usain Bolt who had already raced in a few competitions, the AI came second
Meanwhile you seem to be ignoring the other 99 competitors who lost to the AI.
It's John Henry all over again. Dębiak should make sure he gets some proper R&R now, just to be safe.
There lies a strict typin' man, lord lord.
It looks like the challenge was a variation on multi-agent path finding. Neat.
https://en.m.wikipedia.org/wiki/Multi-agent_pathfinding
https://atcoder.jp/contests/awtf2025heuristic/tasks/awtf2025heuristic_a
I can't help but think of John Henry 😂
The article literally references the legend of John Henry, yeah.