Using AI for image transcripts, yay or nay?

Gonzako@lemmy.world · 2 months ago

Using AI for image transcripts, yay or nay?

hendrik@palaver.p3x.de · 2 months ago

I’d ask someone who needs these transcriptions first. I tend more towards “Nay”. I mean if they want AI transcriptions, I guess they could just run their own AI. And that way they get to choose between human and AI ones. I’m kind of against flooding the internet with AI content as long as the recipients can do it themselves.

Lumidaub@feddit.org · 2 months ago

That’s a good point but wouldn’t it be preferable to have one AI run one time instead of several of them doing the work again and again?

(Assuming that we’re even okay with AI generated descriptions in the first place which I’m not for reasons I’ve laid out in my other comments but I’m talking hypothetically)

Meldrik@lemmy.wtf · edit-2 2 months ago

Alternatively, it’s built into the platform. So when someone uploads an image to Lemmy a local AI model does the description.

Edit: Then it could even be marked as AI generated and people could choose to be exposed to it or not.

Rain World: Slugcat Game@lemmy.world · 1 month ago

people think local ai is the panacea, when ai must have a shit-ton of content scraped from the internet, and ~countless hours churning in datacenters, for the model to be produced in the first place

hendrik@palaver.p3x.de · edit-2 2 months ago

Really hard to tell. I mean there are situations in which people think they’re doing someone a favour. But they’re really not. Upside of doing it individually is: affected people get to pick the model they like best. And they can prompt it however they like. Depends a bit on your expertise on the matter if your pre-generated stuff is on the same level or more a disservice. Upside of pre-generating it once is: maybe a bit less CO2 in the atmosphere and a few less trees killed. But that certainly depends on how many people read those descriptions. If there’s just 2 people with screenreaders out there, who don’t even click on all the images, you might very well be wasting compute. And have a negative balance on the environment.