Hi!

While I really enjoy seeing many of my fellow man being accommodating to people with disabilities. I find manually transcribing every image I post to be very tiring.

I thought that I could at least use some sort of AI to help with image transcripts, tho, that could probably be better used by the actual person with the disability.

So thats the question, should I skip the transcribing of an image or let an AI do it?

  • Tamlyn@lemmy.zip
    link
    fedilink
    English
    arrow-up
    8
    arrow-down
    7
    ·
    12 days ago

    A lot artists doesn’t want that their art is used on ai. You can’t prevent that if you let ai summarize your images. So don’t use ai for that

    • Lumidaub@feddit.org
      link
      fedilink
      English
      arrow-up
      5
      ·
      12 days ago

      Those are different mechanisms. Object recognition doesn’t mean the AI is now trained on the image and can reproduce it (which is btw why AI can still “visually” recognise what’s in an image that has been nightshaded/glazed).

      • Sir. Haxalot@nord.pub
        link
        fedilink
        English
        arrow-up
        3
        ·
        12 days ago

        This is true but it’s also important to remember that if you use an AI model hosted by the same party that trains it it’s likely that they will pass any data you input to the training stage. Unless you have an enterprise contract regulating training use.

        OP mentioned he will use a self-hosted LLM though and in that case it’s no risk of the data being used for training.

        • Lumidaub@feddit.org
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          12 days ago

          I mean, if you put any image online that hasn’t been protected/poisoned in some way, you have to (unfortunately) assume it’s in some AI’s training data anyway. If the tradeoff for a useful description (! See my other comments about the lack of usefulness) is that an image is also fed into one more training corpus, that would be worth a thought, imho. If the image is protected/poisoned, I’d indeed encourage this whole hypothetical process, just to further sabotage the data.

    • Gonzako@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      5
      ·
      12 days ago

      I was actually thinking of using a self-hosted LLM for these tasks. I wanna dig again into it and I got access to computers on the cheap