Using AI for image transcripts, yay or nay?

Gonzako@lemmy.world · 2 days ago

Using AI for image transcripts, yay or nay?

KatherinaReichelt@feddit.org · 2 hours ago

I think that technology can really help us here. OCR on images is mostly solved. If you know what PaddleOCR can do, those people on Mastodon who are whining about others not including an image description for a screenshot seem really annoying. It is possible to do this directly on your computer without any costs, without the need for beefy hardware. So no need to try to force everyone else to include transcriptions for screenshot, no need to attack other people, just do it yourself and enjoy the text on the screenshot. Technology can really help us here.

This also does kind of apply to AI image descriptions. Try it and put an image into Gemini and ask it to describe it. You will be surprised. AI can totally give you a workable description of an image. The problem here is that those AI tools can get quite expensive when you are using them a lot and that many disabled people do not have much money. So in my opinion it totally is ok to include AI image descriptions.

I think that there are too many people in the fediverse who do not know the current state of the technology and hate AI for maybe the right reasons, but who are missing out how it could help them.