I have nearly every service imaginable running and have now started a new project.

I am creating a searchable stock photo archive for my lan. It has been a very interesting project but think i may have crossed the line into overkill lol.

I had hundreds of stock photo cds from the 90s I have turned them all into ISO’s.

I then spent ages dealing with some strange cdrom layouts but got all the images off.

I then converted them all to JPG.

I have now setup a batch script that dedupes then takes the images in 2k batches, runs them through a ai vision model to add keywords and descriptions; as they have none.

They are then copied to a folder where I have photoprism running as the front end and I only have 4k done so far but they look amazing and the search and descriptions are really accurate and useful.

400k more images to go but at least it should all be automated now.

  • Analog@lemmy.ml
    link
    fedilink
    English
    arrow-up
    5
    ·
    5 hours ago

    This is awesome!

    Personally I would have used TIFF and either Immich or ResourceSpace (a DAM - meant for this kind of thing, but also maybe more institutional than you want.)

    • shellington@piefed.zipOP
      link
      fedilink
      English
      arrow-up
      4
      ·
      6 hours ago

      Might be worth a try.

      The thing that has surprised me the most is how good this AI model is at accurately knowing what is in a picture when the model itself is only 3GB in size.

    • shellington@piefed.zipOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      5 hours ago

      kind of getting ready for the day when the open web becomes pretty much unusable due to ai and id requirements

      • IratePirate@feddit.org
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        5 hours ago

        And when the slopocalypse and technofascism have blown over, we crawl out of our digital bunkers and repopulate the wastelands of cyberspace with… stock images of Hide the Pain Harold? I love it! 😄 Although I could think of more valuable data to hoard for that event.

    • shellington@piefed.zipOP
      link
      fedilink
      English
      arrow-up
      4
      ·
      6 hours ago

      This feeds into my ultimate project which I think will take me all summer.

      I plan to create a lan wide search that has kiwix, tube archivist, ubooquity, paperless, jellyfin, stock images and stash.

      Then I would have a unified search point and i think it would make kiwix far more usable by not having to go to the specific zim first. BBut, it is a tricky project as some things are nice and have api’s others don’t

    • shellington@piefed.zipOP
      link
      fedilink
      English
      arrow-up
      6
      ·
      6 hours ago

      No i’m not just kind of thought it would be nice to have, that way if i ever need to make any cards or banners i have loads of stuff in every category, with no ugly AI pics like most search engine image searches show these days.