What is a self-hosted small LLM actually good for (<= 3B)

catty@lemmy.world · edit-2 2 months ago

What is a self-hosted small LLM actually good for (<= 3B)

catty@lemmy.world · 1 month ago

Any suggestions for solutions?

herseycokguzelolacak@lemmy.ml · 1 month ago

Not on top of my head, but there must be something. llama.cpp and vllm have basically solved the inference problem for LLMs. What you need is a RAG solution on top that also combines it with web search.

wise_pancake@lemmy.ca · 1 month ago

Open webui lets you install a ton of different search providers out of the box, but you do need sn API key for most and I haven’t vetted them

I’m trying to get Kagi to work with Phi4 and not having success.

catty@lemmy.world · 1 month ago

Thanks, when I get some time soon, I’ll have another look at it and cherry ai with a local install of ollama