Guide to Self Hosting LLMs Faster/Better than Ollama

brucethemoose@lemmy.world · edit-2 1 month ago

Guide to Self Hosting LLMs Faster/Better than Ollama

gravitas_deficiency@sh.itjust.works · 1 month ago

Wtf are you talking about. PCIe passthrough exists.

brucethemoose@lemmy.world · 1 month ago

I would not recommend that for performance reasons, AFAIK.

Windows is fine, I should make that more clear.

gravitas_deficiency@sh.itjust.works · 1 month ago

Huh, really? Is there that much of a perf hit using passthrough? I’d have assumed that the bottleneck isn’t actually the PCIE, so much as it is the beefiness of the GPU crunching the model.

brucethemoose@lemmy.world · edit-2 1 month ago

I have not tested WSL or VMs in Windows in awhile, but my impression is that “it depends” and you should use the native windows version unless you are having some major installation issues.

kitnaht@lemmy.world · edit-2 1 month ago

Why would you even bother trying to run this all through a VM when you can just run it directly? If you’re to the point of using VMs, you don’t need this tutorial anyways.

Are you seriously telling me you’re jumping through all the hoops to spin up a VM on Linux, and then doing all the configuration for GPU passthrough, because you can’t just figure out how to run it locally?

gravitas_deficiency@sh.itjust.works · 1 month ago

Bro this is a community for sharing knowledge and increasing the technical aptitude of fellow users by doing said sharing. Maybe instead of shitting on a pretty solid digest of the fundamentals of setting up something like this, try adding to the body of knowledge instead.