Poor deaf kids. Not because they’re being held captive but because they’re relying on shitty automatic captions.
For example, Czech was only added very recently and the captions really suck, they change the meaning of most sentences and even include spelling errors.
Everyone making scripted videos should at least:
go through their script to convert it into a transcript (match what’s actually been said – looking at you CGP Grey – and remove visual cues)
upload it for YouTube’s auto-timing (which is not perfect but we’ll take it)
Too bad the FCC’s captioning act is toothless, even TV stations (like HBO) uploading their content to YouTube don’t bother importing captions even though they’re legally required to.
Probably because a lot of countries either dub the content or it is already in their native laguage. You generally see a lot of subtitles on OpenSubtitles of countries like The Netherlands where that doesn’t happen
Where do you think money for development of new shit comes from? From places where money is made, like TV, movies etc. YouTube doesn’t really make more money because the ads are there.
My student friend tells me that the auto-generated captions for non-English MS Teams lecture recordings recently have improved significantly and have even become usable.
it’s crazy to me that for all the ai “advances” in the past few years nobody has thought to improve subtitling.
I’m sure they have… they just aren’t currently incentivized to do so
Poor deaf kids. Not because they’re being held captive but because they’re relying on shitty automatic captions.
For example, Czech was only added very recently and the captions really suck, they change the meaning of most sentences and even include spelling errors.
Everyone making scripted videos should at least:
Too bad the FCC’s captioning act is toothless, even TV stations (like HBO) uploading their content to YouTube don’t bother importing captions even though they’re legally required to.
Andrew Ng did a video when he gradually added noise to the training audio to improve the quality.
But here we are dealing with homophones so it’s not just turning speech to text, it also needs to be context aware.
Possible but too expensive to implement automatically.
context awareness is the entire point of language models tho :(
I’m highlighting that speech to text and context awareness are different skills.
YouTube is unlikely to waste loads of compute power on subtitles that don’t need it just to capture the occasional edge case.
i mean, it’s a one-time-per-video thing. they already do tons of processing on every upload.
So if you can reduce compute there then you save money.
There is no technical difficulty. It’s a business decision.
right now they’re dynamically generating subtitles every time. that’s way more compute.
For real? That’s incredibly dumb/expensive compared to one subtitle roll. Can you share where you saw that?
it would be an improvement. thats not what we are doing anymore
new tech is there to make everyone more miserable
Probably because a lot of countries either dub the content or it is already in their native laguage. You generally see a lot of subtitles on OpenSubtitles of countries like The Netherlands where that doesn’t happen
on youtube and twitter?!
No, but on the fields where there is money to be made for subtitles, like movies and tv shows.
what does that have to do with the OP though?
Where do you think money for development of new shit comes from? From places where money is made, like TV, movies etc. YouTube doesn’t really make more money because the ads are there.
And Twitter hahaha I can only laugh at that
…i feel like you’re having a different conversation than i am.
That’s what OP say, I just responded with the reason why that wasn’t being done. Aka money
Auto generated subtitles don’t sell ads and don’t aquire personal data.
Captions are not subtitles.
explain.
i too watched that technology connections video
My student friend tells me that the auto-generated captions for non-English MS Teams lecture recordings recently have improved significantly and have even become usable.
not for my language. they are hilariously bad.