this post was submitted on 17 Feb 2026
32 points (100.0% liked)

Hardware

6243 readers
134 users here now

All things related to technology hardware, with a focus on computing hardware.


Some other hardware communities across Lemmy:


Rules (Click to Expand):

  1. Follow the Lemmy.world Rules - https://mastodon.world/about

  2. Be kind. No bullying, harassment, racism, sexism etc. against other users.

  3. No Spam, illegal content, or NSFW content.

  4. Please stay on topic, adjacent topics (e.g. software) are fine if they are strongly relevant to technology hardware. Another example would be business news for hardware-focused companies.

  5. Please try and post original sources when possible (as opposed to summaries).

  6. If posting an archived version of the article, please include a URL link to the original article in the body of the post.


Icon by "icon lauk" under CC BY 3.0

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] JGrffn@lemmy.world 1 points 3 days ago

I'm not sure there's many options other than cynical doomerism for analysing this situation. My uneducated guess? They probably already ran out of real world data, and are now forced to produce ridiculous amounts of LLM-generated data to try and continue the training process like this.

Other alternatives I can think of:

  • they might be creating multiple model versions and keeping them for iteration metrics
  • they might need to ingest a lot more real-world data to continue. Since video has been such a focus as of late, maybe they're building huge video libraries for the models? Or maybe they're creating their own real-world data with high detail.
  • my most doomer take is that this is the beginning of a vastly deeper authoritarian online state, where a LOT more data is getting collected from EVERYONE and being fed into both new training data for models, as well as knowledge bases for context for models to work on top of.

We've known for a while that they're running out of training data, so it makes a lot of sense to either generate more data with models, create it at big scale without AI, or collect even more data in even more invasive ways from everyone online. There's literally no other reason I can think of to buy the entire stock of WD drives for 2026 2 months into the year.