this post was submitted on 24 Oct 2025
62 points (100.0% liked)

Privacy

3195 readers
371 users here now

Icon base by Lorc under CC BY 3.0 with modifications to add a gradient

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Hamartiogonic@sopuli.xyz 5 points 2 months ago (2 children)

If it’s learning based on screenshots, it can only learn to play really slow games. FPS games would require video.

[–] theunknownmuncher@lemmy.world 11 points 2 months ago* (last edited 2 months ago)

You're massively underestimating the power or big data. Think about a dataset of millions of screenshot sequences.

You could have said something very similar about LLMs learning semantic meaning while being training on basically random garbage text from the internet

[–] bigboitricky@lemmy.world 6 points 2 months ago

Also it wouldn't need to use actual video it could just use the buttons, check mouse inputs, and see what app you're in. An occasional screencap could be used, but if you can be listened to with a high DPI mouse, you can have this work too.