Comment

Is there a way to host an LLM in a docker container on my home server but still leverage the GPU on my main PC?

Sort:hotnew top

LodeMike@lemmy.today ⁨6⁩ ⁨months⁩ ago
No?

source
azl@lemmy.sdf.org ⁨6⁩ ⁨months⁩ ago
You would need to run the LLM on the system that has the GPU (your main PC). The front-end (typically a WebUI) could run in a docker container and make API calls to your LLM system. Unfortunately that requires the model to always be loaded in the VRAM on your main PC, severely reducing what you can do with that computer, GPU-wise.

source