Similar case from 2 years ago with Whisper when transcribing German.
I'm confused by this. Didn't we have pretty decent speech-to-text already, before LLMs? It wasn't perfect but at least didn't hallucinate random things into the text? Why the heck was that replaced with this stuff??
I'm just confused because I remember using Dragon Naturally Speaking for Windows 98 in the 90s and it worked pretty accurately already back then for dictation and sometimes it feels as if all of that never happened.