Comment

Comment on What exactly is a self-hosted small LLM actually good for (<= 3B)

herseycokguzelolacak@lemmy.ml ⁨6⁩ ⁨months⁩ ago

for coding tasks you need web search and RAG. It’s not the size of the model that matters, since even the largest models find solutions online.

Sort:hotnew top

catty@lemmy.world ⁨6⁩ ⁨months⁩ ago
Any suggestions for solutions?

source
- wise_pancake@lemmy.ca ⁨6⁩ ⁨months⁩ ago
  Open webui lets you install a ton of different search providers out of the box, but you do need sn API key for most and I haven’t vetted them
  
  I’m trying to get Kagi to work with Phi4 and not having success.
  
  source
  - catty@lemmy.world ⁨6⁩ ⁨months⁩ ago
    Thanks, when I get some time soon, I’ll have another look at it and cherry ai with a local install of ollama
    
    source
- herseycokguzelolacak@lemmy.ml ⁨6⁩ ⁨months⁩ ago
  Not on top of my head, but there must be something. llama.cpp and vllm have basically solved the inference problem for LLMs. What you need is a RAG solution on top that also combines it with web search.
  
  source