this post was submitted on 09 Apr 2024
32 points (97.1% liked)

Open Source

31114 readers
1 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 5 years ago
MODERATORS
 

Is there any good speech to text transcription app?

top 14 comments
sorted by: hot top controversial new old
[–] shortwavesurfer@monero.town 21 points 1 year ago (2 children)

For android there is FUTO voice input as an fdroid repo https://app.futo.org/fdroid/repo

[–] makingStuffForFun@lemmy.ml 8 points 1 year ago (2 children)

I can't believe how good this is! I'm on a really busy street (in the middle of Sydney, George Street actually), and it's put in commas in, it handles question marks and all my texts. I can't believe how good this system is and to think that it is local and open source is so impressive.

[–] shortwavesurfer@monero.town 4 points 1 year ago (1 children)

I know, right? It's a really amazing piece of work when it comes right down to it. They have a donation thing, but since I don't have Google Play Services, I can't use it, and they don't have the crypto donation set up yet.

[–] makingStuffForFun@lemmy.ml 3 points 1 year ago

Thanks for letting me know about the donation thing. I suffer from RSI so I'm always happy to donate to things that help. Much appreciated. And as before, this is written with the keyboard voice to text. It's now very, very noisy in a bar.

[–] nightm4re@feddit.de 2 points 1 year ago

Holy crap, the quality of the generated input is incredible.

[–] barbara@lemmy.ml 1 points 1 year ago

Works great. Thank you.

[–] poVoq@slrpnk.net 7 points 1 year ago* (last edited 1 year ago) (2 children)

https://github.com/mkiol/dsnote

But basically look for something that utilizes the Whisper AI model. This list has a few options, but I am not sure if there is newer stuff: https://github.com/sindresorhus/awesome-whisper

[–] sturlabragason@lemmy.world 4 points 1 year ago

Yeah Whisper is really good. It can also deal with a bunch of non english languages.

[–] barbara@lemmy.ml 1 points 1 year ago

That's good to know, thank you.

Futo voice input got damn good for german and english at least. They use whisper afaik

[–] makingStuffForFun@lemmy.ml 1 points 1 year ago

Check out Talon Voice. It's a bit of getting your head around, but I use it for not just transcribing, but also coding, and general computer navigation.

[–] lps@lemmy.ml 1 points 1 year ago
[–] j4k3@lemmy.ml 0 points 1 year ago (1 children)

The offline AI that I tried a few months ago probably needed training on the noise environment to get decent results. I forget which ones I tried but likely the ones with extensions already in Oobabooga Textgen. I was messing with text to speech mostly, but some of the ones that do TTS also have packages and examples for STT. Nothing I tried for offline generation was good enough to speak as an AI prompt without manual corrections.

[–] B0rax@feddit.de 1 points 1 year ago

Did you try whisper?