this post was submitted on 12 Mar 2026
54 points (93.5% liked)
Programming
26022 readers
279 users here now
Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!
Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.
Hope you enjoy the instance!
Rules
Rules
- Follow the programming.dev instance rules
- Keep content related to programming in some way
- If you're posting long videos try to add in some form of tldr for those who don't want to watch videos
Wormhole
Follow the wormhole through a path of communities !webdev@programming.dev
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
The gains, where they exist, are nowhere near that much. In some cases, it makes developers slower (even though they think they're a bit faster):
https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/
Have you actually read the study? People keep citing this study without reading it.
They grabbed like 8 devs who did not have pre-existing set up workflows for optimizing AI usage, and just throw them into it as a measure of "does it help"
Imagine if I grabbed 8 devs who had never used neovim before and threw them into it without any plugins installed or configuration and tried to use that as a metric for "is nvim good for productivity"
People need to stop quoting this fuckass study lol, its basically meaningless.
Im a developer using agentic workflows with over 17 years experience.
I am telling you right now, with the right setup, I weekly turn 20 hour jobs into 20 minute jobs.
Predominantly large "bulk" operations that are mostly just boilerplate code that is necessary, where the AI has an existing huge codebase to draw from as samples and I just give it instructions of "see what already exists? implement more of that following "
A great example is integration testing where like 99% of the code is just boilerplate.
Arrange the same setup every time. Arrange your request following an openapi spec file. Send the request. Assert on the response based on the openapi spec.
I had an agent pump out 120 integration tests based on a spec file yesterday and they were, for the most part, 100% correct, yesterday. In like an hour.
The same volume of work would've easily taken me way longer.