Technology

81451 readers

4022 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

908

Open-source game engine Godot is drowning in 'AI slop' code contributions: 'I don't know how long we can keep it up' (www.pcgamer.com)

submitted 1 day ago by ryujin470@fedia.io to c/technology@lemmy.world

171 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] Jyek@sh.itjust.works 3 points 19 hours ago* (last edited 19 hours ago) (2 children)

Maybe we need a way to generate checksums during version creation (like file version history) and during test runs of code that would be submitted along side the code as a sort of proof of work that AI couldn't easily recreate. It would make code creation harder for actual developers as well but it may reduce people trying to quickly contribute code the LLMs shit out.

A lightweight plugin that runs in your IDE maybe. So anytime you are writing code and testing it, the plugin is modifying a validation file that shows what you were doing and the results of your tests and debugging. Could then write an algorithm that gives a confidence score to the validation file and either triggers manual review or submits obviously bespoke code.

[–] crater2150@feddit.org 2 points 7 hours ago (1 children)

What exactly would you checksum? All intermediate states that weren't committed, and all test run parameters and outputs? If so, how would you use that to detect an LLM? The current agentic LLM tools also do several edits and run tests for the thing they're writing, then edit more until their tests work.

So the presence of test runs and intermediate states isn't really indicative of a human writing code and I'm skeptical that distinguishing between steps a human would do and steps an LLM would do is any easier or quicker than distinguishing based on the end result.

[–] Jyek@sh.itjust.works 1 points 40 minutes ago (1 children)

You could time stamp changes and progress to a file. Record results of tests and output and give an approximate algorithmic confidence rating about how bespoke the process of writing that code was. Even agentic AI rapidly spits out code like a machine would where humans take time and think about things as they go. They make typos and go back and correct them. Code tests fail and debugging looks different between an agent and a human. We need to fingerprint how agents write code and use agentic code processed through this sort of validation looks versus what it looks like for humans to do the same.

[–] crater2150@feddit.org 1 points 18 minutes ago

This basically amounts to a key/interaction logger in the IDE. I'd suspect this would prevent many people contributing to projects using something like that, at least I wouldn't install such a plug-in.

[–] Jyek@sh.itjust.works 2 points 18 hours ago

This could, in theory, also be used by universities to validate submitted papers to weed out AI essays.