Do you host your own AI?

SuspiciousCarrot78@aussie.zone · 6 hours ago

Do you host your own AI?

SuspiciousCarrot78@aussie.zone · edit-2 5 hours ago

You can get a P40 for much less than that, if your case can hold full height card. It’s an old card but its 24GB, 400GB/s.

Else yeah…$3-4,000 is about table stakes, which doesn’t amortise for just AI (not for my use cases anyway). I’d love a Strix but Santa is stingy.

Me - I have a fetish for tiny, low power computers. 1L lenovos, raspberry pis etc. That limits what I can run but with constraint comes inginuity. So I’m making an expert system for myself.

https://codeberg.org/BobbyLLM/picoGURU

It’s not cooked yet (this is actually the first time I’m sharing it in public; it’s not in installable state and the repo is new) but once it’s done, I can have an always on local brain in a 2W envelope that runs fast. Might even port it to C64…I need an excuse to purchase the new Commodore ultimate.

irmadlad@lemmy.world · edit-2 5 hours ago

I was thinking something along the lines of:

AMD Ryzen 9 9950X3D 4.3GHz
NVIDIA Tesla M10 Quad
96GB DDR5
4TB 990 EVO M.2

Which, with all the other accoutrements like water cooling, etc, will put me right at the $4k mark.

https://codeberg.org/BobbyLLM/picoGURU

I’ll check it out.

Natanox@discuss.tchncs.de · 5 hours ago

Damn bro, you’re treating yourself for sure! The RAM alone is what, 2k? 🥴

irmadlad@lemmy.world · 4 hours ago

Well, I haven’t had any new equipment in 15 years. I always buy used or refurb’d. I’m getting old. I figure, I’ve worked hard enough, might as well enjoy the fruits of my labor.

SuspiciousCarrot78@aussie.zone · 5 hours ago

Nice bit of kit that. Very nice. Planning on serious AI shenanigans?

I’ll check it out.

Cool. It’s not ready any time soon but when it is, I’ll announce it and make sure it’s callable via SSH / terminal / OpenAI style chat end point.

That way you don’t need anything fancier than a nice terminal to call it.

irmadlad@lemmy.world · 4 hours ago

Planning on serious AI shenanigans

Absolutely. Like I said, if I’m going to do it, I want to do it up right. I don’t want to come back in 5+ minutes for a result. LOL