Do you host your own ML / AI / LLM? What do you use, and what do you use it for?

  • SuspiciousCarrot78@aussie.zoneOP
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    1
    ·
    edit-2
    5 hours ago

    You can get a P40 for much less than that, if your case can hold full height card. It’s an old card but its 24GB, 400GB/s.

    Else yeah…$3-4,000 is about table stakes, which doesn’t amortise for just AI (not for my use cases anyway). I’d love a Strix but Santa is stingy.

    Me - I have a fetish for tiny, low power computers. 1L lenovos, raspberry pis etc. That limits what I can run but with constraint comes inginuity. So I’m making an expert system for myself.

    https://codeberg.org/BobbyLLM/picoGURU

    It’s not cooked yet (this is actually the first time I’m sharing it in public; it’s not in installable state and the repo is new) but once it’s done, I can have an always on local brain in a 2W envelope that runs fast. Might even port it to C64…I need an excuse to purchase the new Commodore ultimate.

    • irmadlad@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      edit-2
      5 hours ago

      I was thinking something along the lines of:

      • AMD Ryzen 9 9950X3D 4.3GHz
      • NVIDIA Tesla M10 Quad
      • 96GB DDR5
      • 4TB 990 EVO M.2

      Which, with all the other accoutrements like water cooling, etc, will put me right at the $4k mark.

      https://codeberg.org/BobbyLLM/picoGURU

      I’ll check it out.

        • irmadlad@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          2
          ·
          4 hours ago

          Well, I haven’t had any new equipment in 15 years. I always buy used or refurb’d. I’m getting old. I figure, I’ve worked hard enough, might as well enjoy the fruits of my labor.

      • SuspiciousCarrot78@aussie.zoneOP
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        2
        ·
        5 hours ago

        Nice bit of kit that. Very nice. Planning on serious AI shenanigans?

        I’ll check it out.

        Cool. It’s not ready any time soon but when it is, I’ll announce it and make sure it’s callable via SSH / terminal / OpenAI style chat end point.

        That way you don’t need anything fancier than a nice terminal to call it.

        • irmadlad@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          2
          ·
          4 hours ago

          Planning on serious AI shenanigans

          Absolutely. Like I said, if I’m going to do it, I want to do it up right. I don’t want to come back in 5+ minutes for a result. LOL