this post was submitted on 11 Jun 2025
761 points (99.0% liked)

People Twitter

7487 readers
845 users here now

People tweeting stuff. We allow tweets from anyone.

RULES:

  1. Mark NSFW content.
  2. No doxxing people.
  3. Must be a pic of the tweet or similar. No direct links to the tweet.
  4. No bullying or international politcs
  5. Be excellent to each other.
  6. Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician.

founded 2 years ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] Deflated0ne@lemmy.world 94 points 2 weeks ago (2 children)

I believe him, I also believe that Nintendo has secret rooms of deaf kids somewhere. No clue what they'd use them for. But if the news broke that someone found a dungeon owned by Nintendo full of deaf kids i wouldn't be surprised.

[–] lowleveldata@lemmy.world 9 points 2 weeks ago

Would be useful for accessibility test

[–] FooBarrington@lemmy.world 4 points 2 weeks ago (1 children)

Mostly recreational purposes I assume

[–] Deflated0ne@lemmy.world 1 points 2 weeks ago

If Nintendo was based in the US I'd have included that possibility in my post above.

[–] egonallanon@lemm.ee 64 points 2 weeks ago (2 children)

Under delivering yet again Sean. I want my deaf children!

[–] Bakkoda@sh.itjust.works 28 points 2 weeks ago

They are great for stress relief. You can yell at them all fuckin day.

[–] crypto@sh.itjust.works 1 points 2 weeks ago
[–] lime@feddit.nu 41 points 2 weeks ago (6 children)

it's crazy to me that for all the ai "advances" in the past few years nobody has thought to improve subtitling.

[–] ChaoticNeutralCzech@feddit.org 15 points 2 weeks ago

Poor deaf kids. Not because they're being held captive but because they're relying on shitty automatic captions.
For example, Czech was only added very recently and the captions really suck, they change the meaning of most sentences and even include spelling errors.

Everyone making scripted videos should at least:

  1. go through their script to convert it into a transcript (match what's actually been said – looking at you CGP Grey – and remove visual cues)
  2. upload it for YouTube's auto-timing (which is not perfect but we'll take it)

Too bad the FCC's captioning act is toothless, even TV stations (like HBO) uploading their content to YouTube don't bother importing captions even though they're legally required to.

[–] monkeyslikebananas2@lemmy.world 11 points 2 weeks ago

I’m sure they have… they just aren’t currently incentivized to do so

[–] Ansis100@lemmy.world 8 points 2 weeks ago (2 children)

My student friend tells me that the auto-generated captions for non-English MS Teams lecture recordings recently have improved significantly and have even become usable.

[–] GreenCrunch@lemmy.today 1 points 3 days ago

I had a lecture for an aerospace class a few years back. What the professor said is "what is a perfect gas?" - what the caption software produced is "what is a prefect ass?"

Captioning is hard, man...

[–] lime@feddit.nu 1 points 2 weeks ago

not for my language. they are hilariously bad.

[–] Knock_Knock_Lemmy_In@lemmy.world 7 points 2 weeks ago (1 children)

Andrew Ng did a video when he gradually added noise to the training audio to improve the quality.

But here we are dealing with homophones so it's not just turning speech to text, it also needs to be context aware.

Possible but too expensive to implement automatically.

[–] lime@feddit.nu 6 points 2 weeks ago (1 children)

context awareness is the entire point of language models tho :(

[–] Knock_Knock_Lemmy_In@lemmy.world 2 points 2 weeks ago (1 children)

I'm highlighting that speech to text and context awareness are different skills.

YouTube is unlikely to waste loads of compute power on subtitles that don't need it just to capture the occasional edge case.

[–] lime@feddit.nu 2 points 2 weeks ago (2 children)

i mean, it's a one-time-per-video thing. they already do tons of processing on every upload.

[–] Knock_Knock_Lemmy_In@lemmy.world 3 points 2 weeks ago (1 children)

So if you can reduce compute there then you save money.

There is no technical difficulty. It's a business decision.

[–] lime@feddit.nu 2 points 2 weeks ago (1 children)

right now they're dynamically generating subtitles every time. that's way more compute.

[–] aow@sh.itjust.works 1 points 1 week ago (1 children)

For real? That's incredibly dumb/expensive compared to one subtitle roll. Can you share where you saw that?

[–] lime@feddit.nu 1 points 1 week ago* (last edited 1 week ago)

well, i have no evidence of this. however. looking at the way auto-generated subtitles are served at youtube right now, they are sent individually word-by-word from the server, pick up filler words like "uh", and sometimes pause for several seconds in the middle of sentences. and they're not sent by websocket, which means they go through multiple requests over the course of a video. more requests means the server works harder because it can't just stream the text like it does the video, and the only reason they'd do that other than incompetence (which would surely have been corrected by now, it's been like this for years) is if the web backend has to wait for the next word to be generated.

i would love to actually know what's going on if anyone has any insight.

[–] starchylemming@lemmy.world 2 points 2 weeks ago

it would be an improvement. thats not what we are doing anymore

new tech is there to make everyone more miserable

[–] chiliedogg@lemmy.world 6 points 2 weeks ago* (last edited 2 weeks ago) (2 children)

It's even worse for captions.

Captions and subtitles aren't even the same thing.

In fact, most DVD players don't even pass the code captioning through HDMI ports, so old captioned DVDs don't work anymore.

[–] glitchdx@lemmy.world 7 points 1 week ago

i too watched that technology connections video

[–] lime@feddit.nu 4 points 2 weeks ago* (last edited 2 weeks ago)

~~explain.~~ edited with explanation. i've seen the technology connections video, thanks.

my comment is still about the actual post above, and i was specifically thinking about auto-generated subs rather than, say, movies. apparently that's not obvious.

[–] Vinstaal0@feddit.nl 1 points 2 weeks ago (1 children)

Probably because a lot of countries either dub the content or it is already in their native laguage. You generally see a lot of subtitles on OpenSubtitles of countries like The Netherlands where that doesn't happen

[–] lime@feddit.nu 2 points 2 weeks ago (2 children)
[–] vxx@lemmy.world 5 points 2 weeks ago

Auto generated subtitles don't sell ads and don't aquire personal data.

[–] Vinstaal0@feddit.nl 1 points 2 weeks ago (1 children)

No, but on the fields where there is money to be made for subtitles, like movies and tv shows.

[–] lime@feddit.nu 1 points 2 weeks ago (1 children)

what does that have to do with the OP though?

[–] Vinstaal0@feddit.nl 1 points 2 weeks ago (1 children)

Where do you think money for development of new shit comes from? From places where money is made, like TV, movies etc. YouTube doesn't really make more money because the ads are there.

And Twitter hahaha I can only laugh at that

[–] lime@feddit.nu 1 points 2 weeks ago (2 children)

...i feel like you're having a different conversation than i am.

load more comments (2 replies)
[–] Gerudo@lemm.ee 28 points 2 weeks ago

Sounds like what someone would say if they were hiding deaf kids.

[–] Default_Defect@midwest.social 26 points 2 weeks ago (1 children)

OH, now we're too high and mighty to have a deaf kid closet? Fame changes people.

[–] runner_g@lemmy.blahaj.zone 5 points 1 week ago

Fucking woke bullshit. (/s because someone is gonna think I'm serious)

[–] reksas@sopuli.xyz 25 points 2 weeks ago

greedy nintendo hoarding all the deaf kids to themselves

[–] rumba@lemmy.zip 10 points 2 weeks ago (1 children)

How has speech to text gotten so bad. Dictation and keyboard taps are just going to hell with AI.

[–] Duamerthrax@lemmy.world 7 points 1 week ago (2 children)

The AI bros refuse to even allow proof reading and corrections.

I've never had voice commands work right for my speech. Something I had to demonstrate every time my friends ask why I won't use the speech commands on their TV.

Speech interpretation just doesn't work for everyone and I personally refuse to alter how I speak for a computer.

[–] sunzu2@thebrainbin.org 3 points 1 week ago

Same here, if this PoS computer can't handle me at my best 12 beers in, it does not deserve me at my worst, sober!

[–] Randelung@lemmy.world 1 points 1 week ago

Aaron? Iron urn?

[–] crawancon@lemm.ee 7 points 2 weeks ago

nokidsseesky

[–] flandish@lemmy.world 7 points 2 weeks ago (1 children)
[–] kamen@lemmy.world 6 points 1 week ago (8 children)

AI will take over the world /s

[–] gabbath@lemmy.world 4 points 1 week ago

It has, it's just stupid.

load more comments (7 replies)
load more comments
view more: next ›