One binary, no Docker
Single Go binary with llama.cpp statically linked via cgo. The web dashboard is embedded too. No subprocess at runtime, no compose file, no port juggling. Download, run.
How →
A CLI that turns a folder of documents and shell commands into a queryable AI knowledge base. Runs fully offline on CPU. Ship the result as one zip — anyone can chat against it on a plane.
local-agents init my-handbook scaffolds a folder and lets you pick a CPU-tested chat model from a curated short list. You drop PDFs into data/, edit wiki-instructions.md (the LLM prompt template), and run local-agents wiki generate to turn each source into a structured wiki article. local-agents build packages everything — wiki + RAG chunks + skills — into a zip. Send the zip. The recipient runs local-agents import, then local-agents serve opens the dashboard, or local-agents chat drops into a terminal REPL. Throughout, no telemetry, no accounts, no cloud.
chat and serve make zero outbound calls. eval is the dev-time exception and needs an API key.The scaffold + init + doctor commands work end-to-end. wiki generate, chat, build, serve, import, eval are stubs that return not implemented. See where we stand for the milestone-by-milestone status.