Skip to content

Binary Tool Landscape for Token-Efficient LLM Local Dev (2026-02-17)

Scope (Track 1 of 3)

This is Track 1/3 in the research set: 1. Binary tools (coding agents, local CLIs, proxies/gateways, local runtime/control plane). 2. Cloud services (see docs/paid-cloud-services-token-efficiency-landscape.md). 3. Skills, MCP, and ecosystem (see docs/coding-agent-skill-usage-and-complex-task-playbook.md).

1. Executive summary

  1. Put context compaction before model calls: use rtk for runtime terminal-output reduction and repomix/files-to-prompt for deterministic repo-context packing; this cuts low-signal tokens before they ever hit model APIs. rtk repomix files-to-prompt
  2. Route all coding-agent traffic through a local LiteLLM proxy with response/semantic cache and explicit fallback routing; this is the highest-ROI local control point for token and cost policy. LiteLLM Caching LiteLLM Routing
  3. For repeated long-prefix workloads, pair local inference (vLLM/Ollama/llama.cpp server) with OpenAI-compatible endpoints and prefix reuse where supported; this lowers prefill overhead and keeps iteration local. vLLM APC Ollama OpenAI compatibility llama.cpp
  4. Add self-hosted observability from day one (Langfuse or OpenLIT) so cache-hit, token, and cost deltas are measurable by route; otherwise token optimization efforts become guesswork. Langfuse token & cost tracking OpenLIT destination docs
  5. Treat retrieval as a local-token budget problem: use local vector stores (Qdrant, sqlite-vec) and constrain retrieved payload before generation. Qdrant quickstart sqlite-vec

Terminal-output and context-compaction CLIs

  • rtk: runtime command rewriting + output compression for agent terminal calls; best when your token waste is in shell/test/log verbosity. rtk
  • repomix: one-shot repository packing with --compress and --token-count-tree; best before large repo-level analysis tasks. repomix
  • files-to-prompt: deterministic file selection/formatting (--ignore, --extension, --line-numbers) for controlled prompt payloads. files-to-prompt
  • aider repo map: auto-ranked repository map with --map-tokens budget control; best for iterative editing with token-aware global context. aider repo map

Local gateways and inference runtimes

  • LiteLLM Proxy: local AI gateway with response cache, semantic cache options, and routing/fallback policies. LiteLLM Caching LiteLLM Routing
  • vLLM: local serving with Automatic Prefix Caching (enable_prefix_caching=True) for shared-prompt workflows. vLLM APC
  • Ollama: local OpenAI-compatible API endpoint for local models and tool-calling support in existing clients. Ollama OpenAI compatibility
  • llama.cpp server: lightweight local OpenAI-compatible HTTP server for GGUF models. llama.cpp

Token/cost observability (self-hosted-first)

Local retrieval/indexing utilities

  • Qdrant local: Docker-first local vector DB for retrieval pipelines. Qdrant quickstart
  • sqlite-vec: in-process SQLite vector extension; useful for lightweight local retrieval without extra service overhead. sqlite-vec

Prompt/eval tooling in local workflows

  • promptfoo CLI: local eval and provider-call caching to stop prompt regressions that inflate token use. Promptfoo caching Promptfoo CLI
  • tenkai: coding-agent experimentation framework for controlled A/B runs (config, prompt, and tool-chain alternatives) with repeatability and statistical tests. tenkai

Explicit rtk pass: adjacent “rtk-like” options

  • Better than rtk when you need one-shot repo context packaging (not runtime command filtering): repomix. repomix
  • Better than rtk for persistent edit sessions where the tool auto-manages global code context in a token budget: aider repo map. aider repo map
  • Better than rtk for deterministic, scriptable, minimal context selection in CI/local scripts: files-to-prompt. files-to-prompt
  • Better than all three for cross-provider token policy enforcement and caching in one place: LiteLLM Proxy (gateway layer). LiteLLM

3. Comparison table (binary/local tools)

Scoring: 0-3 (higher is better). For setup/overhead, 3 means low complexity/low overhead.

Tool Category Token reduction impact Latency impact Setup complexity Operational overhead Maturity (GitHub snapshot 2026-02-17 UTC) License/safe adoption Compatibility (Codex,Claude Code,LiteLLM,Bedrock,local models)
rtk Runtime output compaction 3 2 3 3 822★, pushed 2026-02-17 (api) MIT Codex: indirect (shell workflow, unverified); Claude Code: explicit hook docs; LiteLLM/Bedrock: indirect; local models: indirect
repomix Repo packing/compression 3 2 3 3 21,907★, pushed 2026-02-17 (api) MIT Codex/Claude Code: yes (pre-processed context files); LiteLLM/Bedrock: N/A direct; local models: yes
files-to-prompt Deterministic context curation 2 3 3 3 2,603★, pushed 2025-02-19 (api) Apache-2.0 Codex/Claude Code: yes; LiteLLM/Bedrock: N/A direct; local models: yes
aider repo map Token-budgeted code mapping 3 2 2 2 40,701★ (Aider-AI/aider), pushed 2026-02-16 (api) Apache-2.0 Codex: pattern-compatible (unverified direct); Claude Code: pattern-compatible; LiteLLM/Bedrock/local: via provider config
LiteLLM Gateway cache/routing 3 2 2 2 36,194★, pushed 2026-02-17 (api) NOASSERTION on API snapshot; verify in repo before enterprise rollout Codex: OpenAI-compatible path (unverified); Claude Code: via Anthropic model routing; LiteLLM: native; Bedrock: yes; local models: yes
vLLM Local inference with prefix cache 3 2 1 1 70,493★, pushed 2026-02-17 (api) Apache-2.0 Codex/Claude Code: via OpenAI-compatible endpoint + gateway (unverified direct); LiteLLM: yes; Bedrock: no direct; local models: yes
Ollama Local model runtime 2 2 3 3 162,782★, pushed 2026-02-17 (api) MIT Codex/Claude Code: via OpenAI-compatible endpoint (unverified direct); LiteLLM: yes; Bedrock: no direct; local models: native
Langfuse Self-host token/cost observability 2 1 1 1 22,006★, pushed 2026-02-17 (api) NOASSERTION on API snapshot; verify in repo before enterprise rollout Codex/Claude Code: indirect; LiteLLM: documented integration; Bedrock/local: via telemetry ingestion
OpenLIT Self-host LLM observability 2 1 2 2 2,210★, pushed 2026-02-17 (api) Apache-2.0 Codex/Claude Code: indirect; LiteLLM/Bedrock/local: via OpenTelemetry/SDK path
Qdrant Local retrieval store 2 2 2 2 28,837★, pushed 2026-02-17 (api) Apache-2.0 Codex/Claude Code: indirect; LiteLLM: via semantic cache option; Bedrock/local: retrieval-side compatible
sqlite-vec In-process retrieval/indexing 2 3 3 3 6,937★, pushed 2026-02-13 (api) Apache-2.0 Codex/Claude Code/LiteLLM/Bedrock: indirect; local models: retrieval-side compatible
promptfoo Local eval + caching 1 2 3 2 10,488★, pushed 2026-02-17 (api) MIT Codex/Claude Code: workflow-level; LiteLLM/Bedrock/local: provider adapters/OpenAI-compatible paths

4. Implementation playbook (local-first)

A. Install and baseline compaction tools

brew install rtk-ai/tap/rtk
npm i -g repomix
uv tool install files-to-prompt || pipx install files-to-prompt

# Verify tool-level capabilities
rtk gain || true
repomix --token-count-tree 500
files-to-prompt . --ignore "*.lock" --extension md --extension py --line-numbers > context.txt
Sources: rtk repomix files-to-prompt

B. Put LiteLLM in front as local policy gateway

# litellm_config.yaml
model_list:
  - model_name: claude-fast
    litellm_params:
      model: anthropic/claude-sonnet-4-20250514
      api_key: os.environ/ANTHROPIC_API_KEY
  - model_name: local-vllm
    litellm_params:
      model: hosted_vllm/my-local-model
      api_base: http://127.0.0.1:8000/v1
  - model_name: local-ollama
    litellm_params:
      model: ollama/qwen3-coder
      api_base: http://127.0.0.1:11434

litellm_settings:
  cache: true
  cache_params:
    type: redis
    ttl: 600

router_settings:
  routing_strategy: cost-based-routing
litellm --config ./litellm_config.yaml
Sources: LiteLLM Caching LiteLLM Routing LiteLLM Anthropic provider

C. Run local inference backends with prefix reuse where applicable

# vLLM example (prefix caching enabled)
vllm serve meta-llama/Llama-3.1-8B-Instruct --enable-prefix-caching

# Ollama example
ollama pull qwen3-coder
ollama serve
Sources: vLLM APC vLLM engine args Ollama OpenAI compatibility

D. Add token/cost observability and dashboards

# Langfuse k8s helm (self-hosted)
helm repo add langfuse https://langfuse.github.io/langfuse-k8s
helm repo update
helm install langfuse langfuse/langfuse -n langfuse

# LiteLLM -> Langfuse callback (in litellm config)
# litellm_settings:
#   callbacks: ["langfuse_otel"]
# OpenLIT zero-code instrumentation
openlit-instrument --otlp-endpoint "http://localhost:4318" --service-name "agent-stack" python app.py
Sources: Langfuse self-host deployment Langfuse LiteLLM integration OpenLIT docs

E. Enforce context budgets in CI

# promptfooconfig.yaml (example)
prompts:
  - file://prompt.md
providers:
  - openai:gpt-4.1-mini

# Keep repeat eval calls cached locally
commandLineOptions:
  cache: true
promptfoo eval
promptfoo cache clear  # use during controlled invalidation
Sources: Promptfoo caching Promptfoo CLI

5. Ideal binary stack for Codex/Claude Code + LiteLLM

  • Ingress layer: coding agent CLI (Codex/Claude Code) with shell output reduced via rtk and targeted file/context packaging via repomix or files-to-prompt. rtk repomix files-to-prompt
  • Policy/gateway layer: local LiteLLM proxy for routing, response caching, and semantic cache-backed reuse. LiteLLM Caching LiteLLM Routing
  • Inference layer: hybrid local (vLLM/Ollama/llama.cpp) + remote provider routes selected by policy. vLLM APC Ollama llama.cpp
  • Retrieval layer: local vector service (Qdrant) or embedded extension (sqlite-vec) for tight payload control. Qdrant quickstart sqlite-vec
  • Observability layer: self-hosted Langfuse/OpenLIT for route-by-route token and cost instrumentation. Langfuse token tracking OpenLIT docs

Compatibility note: direct Codex CLI base-URL override behavior is unverified in this report; treat Codex-specific proxy wiring as an implementation check item.

6. Adopt now / try next / monitor

Adopt now

  1. rtk + LiteLLM cache/routing + Langfuse/OpenLIT as the minimum token-efficiency control plane.
  2. repomix --token-count-tree in preflight scripts for large-repo tasks.
  3. Prompt/eval cache gates in CI with promptfoo.

Try next

  1. vLLM prefix-caching routes for repeated static-prefix coding tasks.
  2. Qdrant semantic cache through LiteLLM for repeated near-duplicate requests.
  3. aider repo map token-budget patterns in long multi-file edits.

Monitor

  1. New context-compression proxies similar to rtk (quality regression risk if compression removes high-signal details).
  2. Licensing/enterprise terms for tools that show NOASSERTION in API metadata snapshots.
  3. Gateway lock-in risk if custom routing policy becomes overly provider-specific.

All links below were accessed on 2026-02-17 unless otherwise noted.

Required discovery sources

Primary implementation references used in recommendations

8. Aggressive Hunt Expansion (Iteration 3: Cross-Host + Cross-Check)

Last automated refresh: 2026-02-18 02:43 UTC

This section replaces the prior merged sample with a larger, cross-host catalog and an independent second-pass cross-check.

Iteration summary: 1. Pass 1 (GitHub + GitLab + Codeberg): - Raw crawl inputs: 11,543 GitHub repos, 507 GitLab repos, 423 Codeberg repos. - Binary-relevant merged pool after dedupe/filtering: 12,109 repos. - Ranked shortlist generated: 430 (360 GitHub, 40 GitLab, 30 Codeberg). 2. Pass 2 (GitLab + Codeberg alternate query vocabulary, independent cross-check): - Raw crawl inputs: 305 GitLab repos, 0 Codeberg repos returned for this query set. - Binary-relevant filtered set: 288. - Net-new vs pass 1 merged pool: 273. 3. Final ranking after cross-check merge and cleanup filters: - Stronger AI-signal filter (llm/mcp/openai/claude/codex/ollama/vllm/prompt/rag/vector/litellm/...) to reduce generic false positives. - Non-GitHub quality guard for final inclusion: stars > 0 and recent activity window. - Final catalog size: 414 tools (360 GitHub, 26 GitLab, 28 Codeberg). 4. Note: the GitHub crawl hit secondary rate limiting near the end of topic queries; this was mitigated by cross-host expansion and independent pass-2 validation.

Pinned inclusion check (requested): - danicat/tenkai (11 stars, updated 2026-02-01): experimentation framework for coding agents, useful for controlled A/B comparisons of toolchains/prompts.

Category mix (Top 414):

Category Count
MCP Tooling 290
Coding CLIs 27
Gateway/Proxy 26
Other 17
Local Runtime 13
Retrieval/Index 13
Context Compression 10
Prompt/Eval 10
Orchestration 8

Category × host breakdown (Top 414):

Category Total GitHub GitLab Codeberg
MCP Tooling 290 263 19 8
Coding CLIs 27 19 0 8
Gateway/Proxy 26 20 5 1
Other 17 14 0 3
Local Runtime 13 10 0 3
Retrieval/Index 13 8 1 4
Context Compression 10 10 0 0
Prompt/Eval 10 8 1 1
Orchestration 8 8 0 0

Comprehensive catalog grouped by category (Top 414):

MCP Tooling (290)

Repository Host Stars Updated
n8n-io/n8n github 174983 2026-02-18
google-gemini/gemini-cli github 94741 2026-02-18
modelcontextprotocol/servers github 78862 2026-02-14
sansan0/TrendRadar github 46469 2026-02-09
upstash/context7 github 46001 2026-02-17
zhayujie/chatgpt-on-wechat github 41304 2026-02-16
BerriAI/litellm github 36202 2026-02-18
bytedance/UI-TARS-desktop github 28042 2026-01-14
github/github-mcp-server github 27024 2026-02-17
ChromeDevTools/chrome-devtools-mcp github 25794 2026-02-17
assafelovic/gpt-researcher github 25339 2026-02-01
jlowin/fastmcp github 22898 2026-02-18
modelcontextprotocol/python-sdk github 21709 2026-02-17
activepieces/activepieces github 20854 2026-02-17
oraios/serena github 20326 2026-02-17
1Panel-dev/MaxKB github 20132 2026-02-12
microsoft/mcp-for-beginners github 14399 2026-02-17
ruvnet/claude-flow github 14156 2026-02-17
triggerdotdev/trigger.dev github 13667 2026-02-17
czlonkowski/n8n-mcp github 13593 2026-02-10
modelcontextprotocol/typescript-sdk github 11618 2026-02-18
tadata-org/fastapi_mcp github 11548 2025-11-24
0xJacky/nginx-ui github 10666 2026-02-18
Portkey-AI/gateway github 10630 2026-01-26
hangwin/mcp-chrome github 10407 2026-01-06
JoeanAmier/XHS-Downloader github 10089 2026-02-11
yusufkaraaslan/Skill_Seekers github 9592 2026-02-15
mcp-use/mcp-use github 9180 2026-02-18
D4Vinci/Scrapling github 9052 2026-02-18
xpzouying/xiaohongshu-mcp github 8851 2026-02-17
open-metadata/OpenMetadata github 8710 2026-02-18
xinnan-tech/xiaozhi-esp32-server github 8613 2026-02-18
awslabs/mcp github 8170 2026-02-18
lastmile-ai/mcp-agent github 8035 2026-01-25
Upsonic/Upsonic github 7784 2026-02-17
firerpa/lamda github 7612 2026-02-15
idosal/git-mcp github 7597 2025-08-22
AgentDeskAI/browser-tools-mcp github 7069 2025-03-26
0x4m4/hexstrike-ai github 6895 2025-11-06
modelcontextprotocol/registry github 6428 2026-02-17
grab/cursor-talk-to-figma-mcp github 6314 2026-01-28
CoplayDev/unity-mcp github 6021 2026-02-17
BrowserMCP/mcp github 5795 2025-04-24
Klavis-AI/klavis github 5635 2026-02-17
mrexodia/ida-pro-mcp github 5587 2026-02-08
firecrawl/firecrawl-mcp-server github 5519 2026-02-17
executeautomation/mcp-playwright github 5227 2025-12-13
nanbingxyz/5ire github 5023 2026-01-23
FunnyWolf/Viper github 4959 2026-01-18
casibase/casibase github 4440 2026-02-15
u14app/deep-research github 4430 2026-02-10
getsentry/XcodeBuildMCP github 4336 2026-02-17
httprunner/httprunner github 4256 2025-12-11
wgpsec/ENScan_GO github 4214 2026-01-05
UnicomAI/wanwu github 4104 2026-02-10
txn2/kubefwd github 4043 2026-02-18
open-webui/mcpo github 3975 2025-12-08
modelcontextprotocol/csharp-sdk github 3905 2026-02-18
Pimzino/spec-workflow-mcp github 3888 2026-02-08
modelcontextprotocol/go-sdk github 3874 2026-02-17
exa-labs/exa-mcp-server github 3812 2026-02-17
groupultra/telegram-search github 3747 2026-02-17
antvis/mcp-server-chart github 3677 2026-01-22
opensumi/core github 3598 2026-02-04
archestra-ai/archestra github 3574 2026-02-18
campfirein/cipher github 3519 2026-01-25
dinoki-ai/osaurus github 3503 2026-02-18
mobile-next/mobile-mcp github 3450 2026-02-03
zebbern/claude-code-guide github 3422 2026-02-18
icip-cas/PPTAgent github 3342 2026-02-17
haris-musa/excel-mcp-server github 3340 2026-01-19
liaokongVFX/MCP-Chinese-Getting-Started-Guide github 3330 2025-04-23
IBM/mcp-context-forge github 3285 2026-02-18
modelcontextprotocol/java-sdk github 3180 2026-02-17
huangjunsen0406/py-xiaozhi github 3178 2026-01-07
opensolon/solon github 2705 2026-02-15
microsoft/mcp github 2612 2026-02-18
go-nunu/nunu github 2544 2025-08-25
davepoon/buildwithclaude github 2442 2026-02-17
zcaceres/markdownify-mcp github 2411 2026-02-11
maximhq/bifrost github 2385 2026-02-18
sparfenyuk/mcp-proxy github 2263 2026-02-17
deedy5/ddgs github 2192 2026-02-14
lemonade-sdk/lemonade github 2183 2026-02-18
bytebase/dbhub github 2128 2026-02-03
brightdata/brightdata-mcp github 2033 2026-02-05
metatool-ai/metamcp github 2032 2026-02-08
cjo4m06/mcp-shrimp-task-manager github 2023 2025-08-21
jamubc/gemini-mcp-tool github 1988 2025-11-25
aws/amazon-q-developer-cli github 1894 2026-02-03
samanhappy/mcphub github 1809 2026-02-11
martin-ger/esp32_nat_router github 1789 2026-02-17
genieincodebottle/generative-ai github 1778 2026-02-17
mcp-router/mcp-router github 1747 2026-01-24
OpenAgentPlatform/Dive github 1732 2026-02-13
ravitemer/mcphub.nvim github 1720 2026-01-18
glidea/zenfeed github 1644 2025-11-22
stacklok/toolhive github 1601 2026-02-18
GongRzhe/Office-Word-MCP-Server github 1595 2025-12-31
mark3labs/mcphost github 1554 2026-02-15
timescale/pg-aiguide github 1541 2026-02-03
GongRzhe/Office-PowerPoint-MCP-Server github 1513 2025-12-31
f/mcptools github 1484 2025-12-18
isaacphi/mcp-language-server github 1447 2026-02-01
rulego/rulego github 1430 2026-02-05
taylorwilsdon/google_workspace_mcp github 1414 2026-02-18
chongdashu/unreal-mcp github 1401 2025-04-22
korotovsky/slack-mcp-server github 1370 2026-02-12
MicrosoftDocs/mcp github 1364 2026-02-12
modelcontextprotocol/php-sdk github 1359 2026-02-18
doobidoo/mcp-memory-service github 1343 2026-02-17
CoderGamester/mcp-unity github 1330 2026-02-15
universal-tool-calling-protocol/code-mode github 1313 2026-02-08
BlackSnufkin/LitterBox github 1307 2025-11-12
zinja-coder/jadx-ai-mcp github 1263 2026-01-24
modelcontextprotocol/kotlin-sdk github 1258 2026-02-17
MiniMax-AI/MiniMax-MCP github 1255 2026-01-07
modelcontextprotocol/swift-sdk github 1247 2026-02-17
refreshdotdev/web-eval-agent github 1235 2026-02-11
maxritter/claude-pilot github 1237 2026-02-17
qdrant/mcp-server-qdrant github 1222 2026-01-28
benborla/mcp-server-mysql github 1209 2026-01-27
grafbase/grafbase github 1207 2026-01-23
NPC-Worldwide/npcpy github 1196 2026-02-16
metoro-io/mcp-golang github 1192 2025-09-02
Devin-AXIS/A2V github 1201 2025-11-12
moyangzhan/langchain4j-aideepin github 1165 2026-01-23
containers/kubernetes-mcp-server github 1159 2026-02-17
joey-zhou/xiaozhi-esp32-server-java github 1139 2026-01-13
designcomputer/mysql_mcp_server github 1131 2025-06-05
AI-QL/tuui github 1126 2026-02-14
moltis-org/moltis github 1116 2026-02-17
chunkhound/chunkhound github 1039 2026-02-17
dmayboroda/minima github 1037 2026-01-22
chatmcp/mcp-server-chatsum github 1033 2024-12-04
caol64/wenyan-mcp github 1024 2026-01-30
cgoinglove/better-chatbot github 1020 2026-01-31
sheshbabu/zen github 1020 2025-12-16
IvanMurzak/Unity-MCP github 1018 2026-02-17
robotmcp/ros-mcp-server github 1011 2026-02-17
GongRzhe/Gmail-MCP-Server github 1008 2025-08-06
openops-cloud/openops github 991 2026-02-17
ref-tools/ref-tools-mcp github 978 2026-01-09
iFurySt/RedNote-MCP github 969 2025-05-11
gyoridavid/short-video-maker github 968 2025-06-21
svnscha/mcp-windbg github 964 2025-12-26
patruff/ollama-mcp-bridge github 962 2025-04-20
JetBrains/mcp-jetbrains github 941 2026-01-07
inkeep/agents github 935 2026-02-18
mongodb-js/mongodb-mcp-server github 918 2026-02-17
splx-ai/agentic-radar github 912 2025-11-27
SecretiveShell/MCP-Bridge github 905 2025-12-08
kimsungwhee/apple-docs-mcp github 905 2025-11-08
QuantGeekDev/mcp-framework github 900 2026-02-05
neo4j-contrib/mcp-neo4j github 900 2026-02-03
datalayer/jupyter-mcp-server github 896 2026-02-06
kardolus/chatgpt-cli github 889 2026-02-04
stickerdaniel/linkedin-mcp-server github 888 2026-02-17
can1357/oh-my-pi github 877 2026-02-18
asheshgoplani/agent-deck github 873 2026-02-17
Azure-Samples/AI-Gateway github 866 2026-02-14
alioshr/memory-bank-mcp github 861 2025-08-20
hyper-mcp-rs/hyper-mcp github 858 2026-02-17
mcpjungle/MCPJungle github 858 2026-02-16
microsoft/wassette github 841 2026-02-16
negokaz/excel-mcp-server github 836 2025-07-19
zaidmukaddam/scira-mcp-chat github 830 2025-12-11
rohitg00/kubectl-mcp-server github 821 2026-02-10
ankorstore/yokai github 819 2026-01-22
alexander-zuev/supabase-mcp-server github 813 2025-09-26
golf-mcp/golf github 811 2026-01-31
php-mcp/server github 811 2025-08-09
ArcadeAI/arcade-mcp github 810 2026-02-18
kontext-dev/browser-use-mcp-server github 805 2025-07-10
dyoshikawa/rulesync github 806 2026-02-18
nickclyde/duckduckgo-mcp-server github 803 2025-03-11
context-space/context-space github 800 2025-10-22
Davidyz/VectorCode github 799 2025-12-26
erwinkramer/bank-api github 792 2026-02-14
apify/apify-mcp-server github 785 2026-02-17
johnhuang316/code-index-mcp github 780 2026-01-09
browser-use/vibetest-use github 764 2025-09-02
LeslieLeung/glean github 751 2026-02-17
CodeGraphContext/CodeGraphContext github 750 2026-02-17
mbailey/voicemode github 738 2026-02-18
iosifache/annas-mcp github 730 2026-01-14
bgauryy/octocode-mcp github 729 2026-02-11
Cyfrin/aderyn github 727 2026-02-15
Joooook/12306-mcp github 724 2025-10-10
LSTM-Kirigaya/openmcp-client github 724 2026-02-16
BigSweetPotatoStudio/HyperChat github 712 2025-08-18
dadbodgeoff/drift github 709 2026-02-13
lgazo/drawio-mcp-server github 695 2026-02-12
zcaceres/fetch-mcp github 694 2025-09-02
xing5/mcp-google-sheets github 679 2026-01-31
adhikasp/mcp-client-cli github 665 2025-12-02
Kymo-MCP/mcpcan github 661 2026-02-06
rusiaaman/wcgw github 642 2026-02-11
tbxark/mcp-proxy github 636 2026-01-26
universal-tool-calling-protocol/python-utcp github 636 2025-12-03
mondaycom/vibe github 613 2026-02-16
thorsten/phpMyFAQ github 612 2026-02-17
elastic/mcp-server-elasticsearch github 610 2026-02-17
vercel/next-devtools-mcp github 614 2026-02-12
asaotomo/FofaMap github 599 2026-01-22
mark3labs/mcp-filesystem-server github 599 2025-11-24
cyberkaida/reverse-engineering-assistant github 599 2026-02-09
wgpsec/cloudsword github 595 2026-02-03
joreilly/FantasyPremierLeague github 593 2026-02-17
automation-ai-labs/mcp-link github 599 2025-04-03
TM9657/flow-like github 587 2026-02-17
joreilly/GalwayBus github 581 2026-02-17
jacksteamdev/obsidian-mcp-tools github 580 2025-07-13
truffle-ai/dexto github 580 2026-02-17
rekog-labs/MCP-Nest github 578 2026-02-16
ktnyt/cclsp github 565 2026-01-23
spences10/mcp-sequentialthinking-tools github 564 2026-02-10
etsd-tech/mcp-pointer github 559 2025-11-12
getsentry/sentry-mcp github 557 2026-02-12
Canner/wren-engine github 554 2026-02-16
cyproxio/mcp-for-security github 550 2026-02-14
web-agent-master/google-search github 550 2025-04-06
jonigl/mcp-client-for-ollama github 538 2026-02-10
marcoaapfortes/Mantic.sh github 538 2026-02-04
GoogleCloudPlatform/cloud-run-mcp github 535 2026-02-16
irockel/tda github 535 2026-02-16
neka-nat/freecad-mcp github 527 2026-02-11
samuelgursky/davinci-resolve-mcp github 527 2026-01-27
ferrislucas/iterm-mcp github 525 2025-09-20
mihaelamj/cupertino github 525 2026-02-05
subnetmarco/pgmcp github 522 2025-09-25
line/line-bot-mcp-server github 520 2026-02-17
MxIris-Reverse-Engineering/ida-mcp-server github 516 2025-05-29
centralmind/gateway github 516 2025-07-18
alpacahq/alpaca-mcp-server github 511 2026-02-03
maquina-app/rails-mcp-server github 503 2026-02-16
marmotdata/marmot github 502 2026-02-17
SylphxAI/pdf-reader-mcp github 498 2026-02-16
chroma-core/chroma-mcp github 494 2025-09-17
magicyuan876/mineru-tianshu github 493 2026-01-22
bitbonsai/mcp-obsidian github 493 2026-02-16
anaisbetts/mcp-youtube github 491 2025-03-20
dbt-labs/dbt-mcp github 492 2026-02-17
ggozad/haiku.rag github 485 2026-02-17
jasonjmcghee/claude-debugs-for-you github 485 2025-12-20
makafeli/n8n-workflow-builder github 485 2025-08-04
OTA-Tech-AI/web-agent-protocol github 491 2025-06-19
Wh0am123/MCP-Kali-Server github 481 2026-01-20
yuniko-software/minecraft-mcp-server github 480 2026-02-16
controlplaneio-fluxcd/flux-operator github 479 2026-02-17
jtang613/GhidrAssistMCP github 478 2026-02-15
whchien/ai-trader github 475 2026-02-04
Softeria/ms-365-mcp-server github 471 2026-02-15
AmeNetwork/aser github 469 2025-10-11
cyanheads/atlas-mcp-server github 468 2025-07-22
PV-Bhat/vibe-check-mcp-server github 466 2026-02-11
VikashLoomba/copilot-mcp github 465 2026-02-16
jina-ai/MCP github 464 2026-01-29
ihor-sokoliuk/mcp-searxng github 461 2026-02-17
Nexus-Router/nexus github 459 2026-02-05
director-run/director github 459 2026-01-08
baryhuang/mcp-remote-macos-use github 458 2025-06-10
agentic-community/mcp-gateway-registry github 449 2026-02-18
fforster/gitlab-mcp gitlab 83 2026-02-17
goern/forgejo-mcp codeberg 32 2026-02-11
lx-industries/rmcp-openapi gitlab 10 2026-02-17
lx-industries/rmcp-actix-web gitlab 6 2026-02-17
byteflavour/goern-forgejo-mcp-flake codeberg 3 2026-02-08
goern/bundestag-mcp codeberg 1 2025-03-14
hectorm/capek codeberg 1 2026-02-17
luxferre/EXAI codeberg 1 2025-10-31
maloma/mcp codeberg 1 2026-01-23
nevermindinventory/everything-claude-code codeberg 1 2026-01-23
ropzislaw/AIPex codeberg 1 2025-12-01
cjoprey/obsidian-mcp-server gitlab 1 2026-02-16
universalamateur1/reclaim-mcp-server gitlab 1 2026-02-15
gitlab-da/use-cases/ai/ai-research/ollama/ollama-rag-tool-agents-mcp-first-learning-steps gitlab 2 2026-02-09
kalilinux/packages/mcp-kali-server gitlab 2 2026-01-20
piatra-open-source-projects/gitlab-mcp-server gitlab 2 2026-01-29
shebe-oss/shebe gitlab 1 2026-02-05
gitlab-learn-labs/webinars/gitlab-duo-agent-platform/duo-mcp-workshop gitlab 2 2026-01-09
TheWolfOfChain/mcp-T2M gitlab 1 2025-12-01
adem.sammoudi1409/mcp-server gitlab 1 2025-12-06
heingroup/ivoryos-suite/ivoryos-mcp gitlab 1 2025-12-04
masx200/gitlab-mcp-server gitlab 1 2026-01-04
nso-developer/cisco-nso-mcp-server gitlab 1 2025-12-21
phd5392441/mcp4bim gitlab 1 2026-01-08
CochainComplex/tractatus-thinking gitlab 3 2025-08-29
UniversalStandards/mcp-mcp gitlab 1 2025-10-22
mionemedia/gitlab-mcp-server gitlab 1 2025-07-15

Coding CLIs (27)

Repository Host Stars Updated
CherryHQ/cherry-studio github 39898 2026-02-17
iOfficeAI/AionUi github 16266 2026-02-17
plandex-ai/plandex github 14992 2025-10-03
github/copilot-cli github 8462 2026-02-17
modelscope/FunClip github 5344 2025-07-11
ericc-ch/copilot-api github 2497 2025-11-10
FullAgent/fulling github 2373 2026-02-11
TimmyOVO/deepseek-ocr.rs github 2132 2026-02-12
instructa/ai-prompts github 1003 2025-03-28
kbwo/ccmanager github 861 2026-02-12
ekkinox/yai github 856 2024-07-31
RAIT-09/obsidian-agent-client github 750 2026-02-17
dustinblackman/oatmeal github 744 2024-06-01
njbrake/agent-of-empires github 709 2026-02-18
vybestack/llxprt-code github 634 2026-02-18
addyosmani/git2txt github 549 2024-11-22
badlogic/pi-skills github 507 2026-02-02
ObservedObserver/async-code github 506 2025-11-18
Dicklesworthstone/coding_agent_session_search github 463 2026-02-18
plaintextaccounting/acceptarium codeberg 3 2026-02-17
Bekalah/codex-14499 codeberg 1 2026-02-10
Saegor/Georges codeberg 1 2026-02-13
cevangel/groqbash codeberg 1 2026-01-26
cyphersnake/agents-in-a-chest.nvim codeberg 1 2025-08-14
hiiamboris/red-codex codeberg 1 2025-08-23
x-cmd/skill codeberg 1 2025-11-04
yaadata/codex.nvim codeberg 1 2026-02-15

Gateway/Proxy (26)

Repository Host Stars Updated
apify/crawlee github 21727 2026-02-17
QuantumNous/new-api github 17748 2026-02-12
tensorzero/tensorzero github 10971 2026-02-18
apify/crawlee-python github 8080 2026-02-17
katanemo/plano github 5094 2026-02-17
algorithmicsuperintelligence/optillm github 3332 2026-01-28
looplj/axonhub github 1916 2026-02-17
APIParkLab/APIPark github 1555 2025-10-26
0xCrunchyy/10x github 1383 2026-01-05
bricks-cloud/BricksLLM github 1154 2025-01-05
KenyonY/openai-forward github 985 2025-03-15
Nya-Foundation/NyaProxy github 946 2025-07-07
Xerxes-2/clewdr github 938 2026-02-14
xing61/zzz-api github 901 2026-02-02
theopenco/llmgateway github 878 2026-02-17
rtk-ai/rtk github 843 2026-02-17
vllora/vllora github 776 2026-02-17
danilofalcao/cursor-deepseek github 602 2026-01-14
adaline/gateway github 577 2026-02-07
sgasser/pasteguard github 505 2026-02-09
kurama622/llm.nvim codeberg 1 2025-12-02
hongch6661/mix-web-demo gitlab 1 2026-02-17
soapbox-pub/claude-gateway gitlab 1 2026-01-08
icfoss/OpenIoT/lorawan-range-mapper_v1.0 gitlab 1 2024-07-12
my-aws-project/AWS-VPC-Automation gitlab 1 2025-01-13
tenergy_development_team/tenergy32hub_loragateway_pressure gitlab 1 2025-09-20

Other (17)

Repository Host Stars Updated
CopilotKit/CopilotKit github 28840 2026-02-18
e2b-dev/E2B github 10919 2026-02-18
Integuru-AI/Integuru github 4539 2026-02-12
Intelligent-Internet/ii-agent github 3159 2026-02-04
i-am-bee/beeai-framework github 3106 2026-02-13
BAAI-Agents/Cradle github 2451 2024-11-07
LjyYano/Thinking_in_Java_MindMapping github 1639 2026-01-18
devnen/Chatterbox-TTS-Server github 1024 2026-02-12
stakpak/agent github 956 2026-02-18
litanlitudan/skyagi github 787 2023-09-21
gety-ai/apple-on-device-openai github 770 2025-10-02
Lex-au/Orpheus-FastAPI github 660 2025-07-05
normal-computing/fuji-web github 584 2026-01-06
oil-oil/wolfcha github 513 2026-02-18
no_miti/eleventy-plugin-lollmao codeberg 1 2025-09-29
pvlerick/build-a-llm-from-scratch codeberg 1 2026-02-17
timedin/ExecAI codeberg 1 2024-11-11

Local Runtime (13)

Repository Host Stars Updated
badlogic/pi-mono github 13185 2026-02-17
algorithmicsuperintelligence/openevolve github 5395 2026-02-04
Michael-A-Kuykendall/shimmy github 3681 2026-01-16
mostlygeek/llama-swap github 2416 2026-02-16
vm0-ai/vm0 github 1006 2026-02-18
shyamsaktawat/OpenAlpha_Evolve github 970 2025-05-31
Pavelevich/llm-checker github 629 2026-02-17
EricLBuehler/candle-vllm github 598 2026-01-28
devflowinc/uzi github 561 2025-06-04
Chen-zexi/vllm-cli github 472 2026-01-25
tusharhero/ollama-guix codeberg 13 2025-04-29
1ndevelopment/termux-utilities codeberg 1 2026-01-31
leozqin/precis codeberg 1 2025-05-30

Retrieval/Index (13)

Repository Host Stars Updated
sigoden/aichat github 9312 2026-01-29
opactorai/Claudable github 3732 2025-12-04
LazyAGI/LazyLLM github 3725 2026-02-14
smallcloudai/refact github 3477 2026-02-17
aiflowy/aiflowy github 2032 2026-02-11
madroidmaq/mlx-omni-server github 662 2025-12-21
51bitquant/ai-hedge-fund-crypto github 511 2025-09-05
giselles-ai/giselle github 482 2026-02-18
akselmo/artificial-rage codeberg 27 2025-08-23
beitragsrunde/beitragsrunde codeberg 5 2026-02-13
pmakholm/bragi codeberg 2 2024-12-09
danielsreichenbach/ansible-role-garage codeberg 1 2026-01-19
wordpress-premium/wpbakery-page-builder gitlab 1 2024-10-26

Context Compression (10)

Repository Host Stars Updated
mufeedvh/code2prompt github 7138 2026-02-16
memodb-io/Acontext github 3016 2026-02-17
entireio/cli github 2618 2026-02-18
pchalasani/claude-code-tools github 1452 2026-02-16
DeepMyst/Mysti github 888 2025-12-28
context-machine-lab/sleepless-agent github 798 2026-01-01
massgen/MassGen github 744 2026-02-16
marv1nnnnn/llm-min.txt github 670 2025-10-05
gmickel/gmickel-claude-marketplace github 512 2026-02-17
NeoLabHQ/context-engineering-kit github 482 2026-02-16

Prompt/Eval (10)

Repository Host Stars Updated
raga-ai-hub/RagaAI-Catalyst github 16099 2026-02-11
latitude-dev/latitude-llm github 3894 2026-02-17
Agenta-AI/agenta github 3849 2026-02-17
openlit/openlit github 2210 2026-02-17
microsoft/prompty github 1158 2026-02-14
thu-coai/Safety-Prompts github 1127 2024-02-27
EmbeddedLLM/JamAIBase github 1081 2026-02-10
Doriandarko/RepoToTextForLLMs github 784 2024-05-21
melindachang/asoiaf-llm-benchmarking codeberg 1 2025-08-20
khaytsus/chatdb gitlab 1 2025-12-02

Orchestration (8)

Repository Host Stars Updated
frankbria/ralph-claude-code github 6977 2026-02-17
UfoMiao/zcf github 5495 2026-02-05
rllm-org/rllm github 5109 2026-02-17
mindfold-ai/Trellis github 2239 2026-02-17
covibes/zeroshot github 1175 2026-02-17
DmitrySolana/Molt-Pi-Maker github 1095 2026-02-01
yohey-w/multi-agent-shogun github 904 2026-02-18
softaworks/agent-toolkit github 607 2026-02-08