this post was submitted on 13 Aug 2023
374 points (74.8% liked)

Technology

76358 readers
1162 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Buffaloaf@lemmy.world 5 points 2 years ago (3 children)

I asked Bard how ChatGPT can fix itself, here's what it said:

ChatGPT can fix itself in a number of ways, including:

Retraining on a larger dataset. ChatGPT is trained on a massive dataset of text and code, but it can always be retrained on a larger dataset. This would help ChatGPT to learn new things and improve its performance.

Using a different algorithm. ChatGPT is currently trained using the GPT-3 algorithm, but there are other algorithms that could be used. For example, ChatGPT could be trained using the Megatron-Turing NLG algorithm, which is one of the most powerful language models in the world.

Using a combination of algorithms. ChatGPT could also be trained using a combination of algorithms. This would allow ChatGPT to benefit from the strengths of different algorithms.

Using a feedback loop. ChatGPT could be used to generate text, and then this text could be used to train ChatGPT. This would allow ChatGPT to learn from its own mistakes and improve its performance over time.

Using human feedback. ChatGPT could also be used to generate text, and then this text could be reviewed by humans. The human feedback could then be used to improve ChatGPT's performance.

[–] lagomorphlecture@lemm.ee 10 points 2 years ago (1 children)

"Using a feedback loop. ChatGPT could be used to generate text, and then this text could be used to train ChatGPT. This would allow ChatGPT to learn from its own mistakes and improve its performance over time."

So basically create its own Fox News and see how that goes.

[–] vrighter@discuss.tchncs.de 12 points 2 years ago (1 children)

this is widely known to very quickly destroy your model

[–] Buffaloaf@lemmy.world 4 points 2 years ago

Remember, this is Bard's advice for ChatGPT

[–] chaogomu@kbin.social 7 points 2 years ago (1 children)

The feedback loop is already happening, and is called model collapse.

It's not a good thing.

[–] FaceDeer@kbin.social 3 points 2 years ago

The full suggestion includes "This would allow ChatGPT to learn from its own mistakes", which implies that the text it generated would be evaluated and curated before being sent back into it for training. That, as well as including non-AI-generated text along with the AI generated stuff, should stop model collapse.

Model collapse is basically inbreeding, with similar causes and similar solutions. A little inbreeding is not inherently bad, indeed it's used frequently when you're trying to breed an organism to have specific desirable characteristics.

[–] FaceDeer@kbin.social 4 points 2 years ago

If having an AI tell researchers that they should base its next iteration off of Megatron isn't the plot of a Michael Bay Transformers movie already, it should have been.