LLM with Web Search functionality

SubArcticTundra@lemmy.ml · 10 hours ago

LLM with Web Search functionality

artyom@piefed.social · 8 hours ago

What’s a “strix board”? Is that necessary?

vapeloki@lemmy.world · 8 hours ago

AMD Strix is an APU, optimized for AI. It is the cheapest option I am aware of to run bigger models at home. 2k for 56GB VRAM, and less den 300W total power Budget.

One could run smaller models. But for the context sizes required for research work, that is nearly impossible.

Also, external services, like openrouter, can be used to use models hosted in the cloud.

But for self hosted, you need something that can run models with at least 15GB of VRAM + Context. For comparison. Our highly quantized model uses 20GB of vram. For our 4 slots we need another 20GB on top of it (around 5GB for 254k tokens), making it 40GB.