

1·
19 hours agoI also have a 5060 (ti) with 16GB of RAM. I tend to use GPT-OSS:20B or Qwen3:14B with a context of ~30k. I have custom system prompt for my style of reponse I like on open web ui. That takes up about 14GB of my 16GB VRAM
But yeah it is slower and not as “smart” as the cloud based models, but I think the inconvenience of the speed and having to fact check/test code is worth the privacy and environmental trade offs
I am also kinda new, but it seems like it leans towards multiple accounts. Some lemmy instances don’t federate, so I have two.
And then it seems like there is a alot of style and content overlapp between pixelfed and mastadon. So I just have a pixelfed account and follow a few folks from Mastond there.
It would be weird to see pixel fed type posts on my lemmy feeds, but I guess that is just how am using it so far