• ffhein@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    3 hours ago

    Ah, multiple GPUs? For some reason I thought you meant that with exllamav3 you had managed to load a model which was larger than your VRAM.