Local LLM Tools
Local LLM ToolsLocal model runners, inference servers, gateways, and self-hosted chat or agent tools.
Use caseNeeds verification
Repository · Auto enriched
A high-throughput and memory-efficient inference and serving engine for LLMs
Stars and freshness are shown as source-backed signals, but this repository is still Auto enriched. Do not treat star count as a final recommendation.
GitHub stars at fetch time
Forks at fetch time
Primary language
Local model runners, inference servers, gateways, and self-hosted chat or agent tools.
No tool relationship is attached yet.