Binary Tool Landscape for Token-Efficient LLM Local Dev (2026-02-17)¶
Scope (Track 1 of 3)¶
This is Track 1/3 in the research set:
1. Binary tools (coding agents, local CLIs, proxies/gateways, local runtime/control plane).
2. Cloud services (see docs/paid-cloud-services-token-efficiency-landscape.md).
3. Skills, MCP, and ecosystem (see docs/coding-agent-skill-usage-and-complex-task-playbook.md).
1. Executive summary¶
- Put context compaction before model calls: use
rtkfor runtime terminal-output reduction andrepomix/files-to-promptfor deterministic repo-context packing; this cuts low-signal tokens before they ever hit model APIs. rtk repomix files-to-prompt - Route all coding-agent traffic through a local LiteLLM proxy with response/semantic cache and explicit fallback routing; this is the highest-ROI local control point for token and cost policy. LiteLLM Caching LiteLLM Routing
- For repeated long-prefix workloads, pair local inference (vLLM/Ollama/llama.cpp server) with OpenAI-compatible endpoints and prefix reuse where supported; this lowers prefill overhead and keeps iteration local. vLLM APC Ollama OpenAI compatibility llama.cpp
- Add self-hosted observability from day one (Langfuse or OpenLIT) so cache-hit, token, and cost deltas are measurable by route; otherwise token optimization efforts become guesswork. Langfuse token & cost tracking OpenLIT destination docs
- Treat retrieval as a local-token budget problem: use local vector stores (
Qdrant,sqlite-vec) and constrain retrieved payload before generation. Qdrant quickstart sqlite-vec
2. Categories and recommended shortlist¶
Terminal-output and context-compaction CLIs¶
rtk: runtime command rewriting + output compression for agent terminal calls; best when your token waste is in shell/test/log verbosity. rtkrepomix: one-shot repository packing with--compressand--token-count-tree; best before large repo-level analysis tasks. repomixfiles-to-prompt: deterministic file selection/formatting (--ignore,--extension,--line-numbers) for controlled prompt payloads. files-to-promptaiderrepo map: auto-ranked repository map with--map-tokensbudget control; best for iterative editing with token-aware global context. aider repo map
Local gateways and inference runtimes¶
LiteLLM Proxy: local AI gateway with response cache, semantic cache options, and routing/fallback policies. LiteLLM Caching LiteLLM RoutingvLLM: local serving with Automatic Prefix Caching (enable_prefix_caching=True) for shared-prompt workflows. vLLM APCOllama: local OpenAI-compatible API endpoint for local models and tool-calling support in existing clients. Ollama OpenAI compatibilityllama.cppserver: lightweight local OpenAI-compatible HTTP server for GGUF models. llama.cpp
Token/cost observability (self-hosted-first)¶
Langfuse: self-hostable tracing with usage/cost ingestion/inference and LiteLLM integration. Langfuse token & cost tracking Langfuse LiteLLM integration Langfuse self-host deploymentOpenLIT: OpenTelemetry-native self-hosted LLM observability with token/cost dashboards and CLI/SDK instrumentation. OpenLIT docs
Local retrieval/indexing utilities¶
Qdrantlocal: Docker-first local vector DB for retrieval pipelines. Qdrant quickstartsqlite-vec: in-process SQLite vector extension; useful for lightweight local retrieval without extra service overhead. sqlite-vec
Prompt/eval tooling in local workflows¶
promptfooCLI: local eval and provider-call caching to stop prompt regressions that inflate token use. Promptfoo caching Promptfoo CLItenkai: coding-agent experimentation framework for controlled A/B runs (config, prompt, and tool-chain alternatives) with repeatability and statistical tests. tenkai
Explicit rtk pass: adjacent “rtk-like” options¶
- Better than
rtkwhen you need one-shot repo context packaging (not runtime command filtering):repomix. repomix - Better than
rtkfor persistent edit sessions where the tool auto-manages global code context in a token budget:aiderrepo map. aider repo map - Better than
rtkfor deterministic, scriptable, minimal context selection in CI/local scripts:files-to-prompt. files-to-prompt - Better than all three for cross-provider token policy enforcement and caching in one place:
LiteLLM Proxy(gateway layer). LiteLLM
3. Comparison table (binary/local tools)¶
Scoring: 0-3 (higher is better). For setup/overhead, 3 means low complexity/low overhead.
| Tool | Category | Token reduction impact | Latency impact | Setup complexity | Operational overhead | Maturity (GitHub snapshot 2026-02-17 UTC) | License/safe adoption | Compatibility (Codex,Claude Code,LiteLLM,Bedrock,local models) |
|---|---|---|---|---|---|---|---|---|
| rtk | Runtime output compaction | 3 | 2 | 3 | 3 | 822★, pushed 2026-02-17 (api) | MIT | Codex: indirect (shell workflow, unverified); Claude Code: explicit hook docs; LiteLLM/Bedrock: indirect; local models: indirect |
| repomix | Repo packing/compression | 3 | 2 | 3 | 3 | 21,907★, pushed 2026-02-17 (api) | MIT | Codex/Claude Code: yes (pre-processed context files); LiteLLM/Bedrock: N/A direct; local models: yes |
| files-to-prompt | Deterministic context curation | 2 | 3 | 3 | 3 | 2,603★, pushed 2025-02-19 (api) | Apache-2.0 | Codex/Claude Code: yes; LiteLLM/Bedrock: N/A direct; local models: yes |
| aider repo map | Token-budgeted code mapping | 3 | 2 | 2 | 2 | 40,701★ (Aider-AI/aider), pushed 2026-02-16 (api) |
Apache-2.0 | Codex: pattern-compatible (unverified direct); Claude Code: pattern-compatible; LiteLLM/Bedrock/local: via provider config |
| LiteLLM | Gateway cache/routing | 3 | 2 | 2 | 2 | 36,194★, pushed 2026-02-17 (api) | NOASSERTION on API snapshot; verify in repo before enterprise rollout |
Codex: OpenAI-compatible path (unverified); Claude Code: via Anthropic model routing; LiteLLM: native; Bedrock: yes; local models: yes |
| vLLM | Local inference with prefix cache | 3 | 2 | 1 | 1 | 70,493★, pushed 2026-02-17 (api) | Apache-2.0 | Codex/Claude Code: via OpenAI-compatible endpoint + gateway (unverified direct); LiteLLM: yes; Bedrock: no direct; local models: yes |
| Ollama | Local model runtime | 2 | 2 | 3 | 3 | 162,782★, pushed 2026-02-17 (api) | MIT | Codex/Claude Code: via OpenAI-compatible endpoint (unverified direct); LiteLLM: yes; Bedrock: no direct; local models: native |
| Langfuse | Self-host token/cost observability | 2 | 1 | 1 | 1 | 22,006★, pushed 2026-02-17 (api) | NOASSERTION on API snapshot; verify in repo before enterprise rollout |
Codex/Claude Code: indirect; LiteLLM: documented integration; Bedrock/local: via telemetry ingestion |
| OpenLIT | Self-host LLM observability | 2 | 1 | 2 | 2 | 2,210★, pushed 2026-02-17 (api) | Apache-2.0 | Codex/Claude Code: indirect; LiteLLM/Bedrock/local: via OpenTelemetry/SDK path |
| Qdrant | Local retrieval store | 2 | 2 | 2 | 2 | 28,837★, pushed 2026-02-17 (api) | Apache-2.0 | Codex/Claude Code: indirect; LiteLLM: via semantic cache option; Bedrock/local: retrieval-side compatible |
| sqlite-vec | In-process retrieval/indexing | 2 | 3 | 3 | 3 | 6,937★, pushed 2026-02-13 (api) | Apache-2.0 | Codex/Claude Code/LiteLLM/Bedrock: indirect; local models: retrieval-side compatible |
| promptfoo | Local eval + caching | 1 | 2 | 3 | 2 | 10,488★, pushed 2026-02-17 (api) | MIT | Codex/Claude Code: workflow-level; LiteLLM/Bedrock/local: provider adapters/OpenAI-compatible paths |
4. Implementation playbook (local-first)¶
A. Install and baseline compaction tools¶
brew install rtk-ai/tap/rtk
npm i -g repomix
uv tool install files-to-prompt || pipx install files-to-prompt
# Verify tool-level capabilities
rtk gain || true
repomix --token-count-tree 500
files-to-prompt . --ignore "*.lock" --extension md --extension py --line-numbers > context.txt
B. Put LiteLLM in front as local policy gateway¶
# litellm_config.yaml
model_list:
- model_name: claude-fast
litellm_params:
model: anthropic/claude-sonnet-4-20250514
api_key: os.environ/ANTHROPIC_API_KEY
- model_name: local-vllm
litellm_params:
model: hosted_vllm/my-local-model
api_base: http://127.0.0.1:8000/v1
- model_name: local-ollama
litellm_params:
model: ollama/qwen3-coder
api_base: http://127.0.0.1:11434
litellm_settings:
cache: true
cache_params:
type: redis
ttl: 600
router_settings:
routing_strategy: cost-based-routing
litellm --config ./litellm_config.yaml
C. Run local inference backends with prefix reuse where applicable¶
# vLLM example (prefix caching enabled)
vllm serve meta-llama/Llama-3.1-8B-Instruct --enable-prefix-caching
# Ollama example
ollama pull qwen3-coder
ollama serve
D. Add token/cost observability and dashboards¶
# Langfuse k8s helm (self-hosted)
helm repo add langfuse https://langfuse.github.io/langfuse-k8s
helm repo update
helm install langfuse langfuse/langfuse -n langfuse
# LiteLLM -> Langfuse callback (in litellm config)
# litellm_settings:
# callbacks: ["langfuse_otel"]
# OpenLIT zero-code instrumentation
openlit-instrument --otlp-endpoint "http://localhost:4318" --service-name "agent-stack" python app.py
E. Enforce context budgets in CI¶
# promptfooconfig.yaml (example)
prompts:
- file://prompt.md
providers:
- openai:gpt-4.1-mini
# Keep repeat eval calls cached locally
commandLineOptions:
cache: true
promptfoo eval
promptfoo cache clear # use during controlled invalidation
5. Ideal binary stack for Codex/Claude Code + LiteLLM¶
- Ingress layer: coding agent CLI (
Codex/Claude Code) with shell output reduced viartkand targeted file/context packaging viarepomixorfiles-to-prompt. rtk repomix files-to-prompt - Policy/gateway layer: local LiteLLM proxy for routing, response caching, and semantic cache-backed reuse. LiteLLM Caching LiteLLM Routing
- Inference layer: hybrid local (
vLLM/Ollama/llama.cpp) + remote provider routes selected by policy. vLLM APC Ollama llama.cpp - Retrieval layer: local vector service (
Qdrant) or embedded extension (sqlite-vec) for tight payload control. Qdrant quickstart sqlite-vec - Observability layer: self-hosted Langfuse/OpenLIT for route-by-route token and cost instrumentation. Langfuse token tracking OpenLIT docs
Compatibility note: direct Codex CLI base-URL override behavior is unverified in this report; treat Codex-specific proxy wiring as an implementation check item.
6. Adopt now / try next / monitor¶
Adopt now¶
rtk + LiteLLM cache/routing + Langfuse/OpenLITas the minimum token-efficiency control plane.repomix --token-count-treein preflight scripts for large-repo tasks.- Prompt/eval cache gates in CI with
promptfoo.
Try next¶
vLLMprefix-caching routes for repeated static-prefix coding tasks.Qdrantsemantic cache through LiteLLM for repeated near-duplicate requests.aiderrepo map token-budget patterns in long multi-file edits.
Monitor¶
- New context-compression proxies similar to
rtk(quality regression risk if compression removes high-signal details). - Licensing/enterprise terms for tools that show
NOASSERTIONin API metadata snapshots. - Gateway lock-in risk if custom routing policy becomes overly provider-specific.
7. Appendix (search log + links)¶
All links below were accessed on 2026-02-17 unless otherwise noted.
Required discovery sources¶
- Hacker News / Show HN:
- Show HN: liteLLM Proxy Server
- Show HN: Headroom (OSS): Cuts LLM costs by 85%
- Show HN: Mini-swe-agent
- Lobsters:
- Prompt caching: 10x cheaper LLM tokens, but how?
- Claude Skills may be a bigger deal than MCP
- Using LLMs at Oxide
- GitHub Trending and Search:
- GitHub Trending
- Repository search: llm gateway
- rtk repository
- Reddit (required subreddits):
- r/ClaudeAI: rtk token savings post
- r/LocalLLaMA: local coding agent context post
- r/LLMDevs: prompt caching usage
- r/MachineLearning: LLM gateway discussion
- r/OpenAI: prompt caching thread
- Awesome lists:
- Awesome-LLMOps (tensorchord)
- awesome-llmops (awesomelistsio)
- Awesome-LLM-RAG
- awesome-claude-code
Primary implementation references used in recommendations¶
- rtk README
- Repomix README
- files-to-prompt README
- aider repo map docs
- LiteLLM docs root
- vLLM docs
- Ollama docs
- Langfuse docs
- OpenLIT docs
- Qdrant docs
- sqlite-vec repo
- promptfoo docs
8. Aggressive Hunt Expansion (Iteration 3: Cross-Host + Cross-Check)¶
Last automated refresh: 2026-02-18 02:43 UTC
This section replaces the prior merged sample with a larger, cross-host catalog and an independent second-pass cross-check.
Iteration summary:
1. Pass 1 (GitHub + GitLab + Codeberg):
- Raw crawl inputs: 11,543 GitHub repos, 507 GitLab repos, 423 Codeberg repos.
- Binary-relevant merged pool after dedupe/filtering: 12,109 repos.
- Ranked shortlist generated: 430 (360 GitHub, 40 GitLab, 30 Codeberg).
2. Pass 2 (GitLab + Codeberg alternate query vocabulary, independent cross-check):
- Raw crawl inputs: 305 GitLab repos, 0 Codeberg repos returned for this query set.
- Binary-relevant filtered set: 288.
- Net-new vs pass 1 merged pool: 273.
3. Final ranking after cross-check merge and cleanup filters:
- Stronger AI-signal filter (llm/mcp/openai/claude/codex/ollama/vllm/prompt/rag/vector/litellm/...) to reduce generic false positives.
- Non-GitHub quality guard for final inclusion: stars > 0 and recent activity window.
- Final catalog size: 414 tools (360 GitHub, 26 GitLab, 28 Codeberg).
4. Note: the GitHub crawl hit secondary rate limiting near the end of topic queries; this was mitigated by cross-host expansion and independent pass-2 validation.
Pinned inclusion check (requested):
- danicat/tenkai (11 stars, updated 2026-02-01): experimentation framework for coding agents, useful for controlled A/B comparisons of toolchains/prompts.
Category mix (Top 414):
| Category | Count |
|---|---|
| MCP Tooling | 290 |
| Coding CLIs | 27 |
| Gateway/Proxy | 26 |
| Other | 17 |
| Local Runtime | 13 |
| Retrieval/Index | 13 |
| Context Compression | 10 |
| Prompt/Eval | 10 |
| Orchestration | 8 |
Category × host breakdown (Top 414):
| Category | Total | GitHub | GitLab | Codeberg |
|---|---|---|---|---|
| MCP Tooling | 290 | 263 | 19 | 8 |
| Coding CLIs | 27 | 19 | 0 | 8 |
| Gateway/Proxy | 26 | 20 | 5 | 1 |
| Other | 17 | 14 | 0 | 3 |
| Local Runtime | 13 | 10 | 0 | 3 |
| Retrieval/Index | 13 | 8 | 1 | 4 |
| Context Compression | 10 | 10 | 0 | 0 |
| Prompt/Eval | 10 | 8 | 1 | 1 |
| Orchestration | 8 | 8 | 0 | 0 |
Comprehensive catalog grouped by category (Top 414):
MCP Tooling (290)¶
| Repository | Host | Stars | Updated |
|---|---|---|---|
| n8n-io/n8n | github | 174983 | 2026-02-18 |
| google-gemini/gemini-cli | github | 94741 | 2026-02-18 |
| modelcontextprotocol/servers | github | 78862 | 2026-02-14 |
| sansan0/TrendRadar | github | 46469 | 2026-02-09 |
| upstash/context7 | github | 46001 | 2026-02-17 |
| zhayujie/chatgpt-on-wechat | github | 41304 | 2026-02-16 |
| BerriAI/litellm | github | 36202 | 2026-02-18 |
| bytedance/UI-TARS-desktop | github | 28042 | 2026-01-14 |
| github/github-mcp-server | github | 27024 | 2026-02-17 |
| ChromeDevTools/chrome-devtools-mcp | github | 25794 | 2026-02-17 |
| assafelovic/gpt-researcher | github | 25339 | 2026-02-01 |
| jlowin/fastmcp | github | 22898 | 2026-02-18 |
| modelcontextprotocol/python-sdk | github | 21709 | 2026-02-17 |
| activepieces/activepieces | github | 20854 | 2026-02-17 |
| oraios/serena | github | 20326 | 2026-02-17 |
| 1Panel-dev/MaxKB | github | 20132 | 2026-02-12 |
| microsoft/mcp-for-beginners | github | 14399 | 2026-02-17 |
| ruvnet/claude-flow | github | 14156 | 2026-02-17 |
| triggerdotdev/trigger.dev | github | 13667 | 2026-02-17 |
| czlonkowski/n8n-mcp | github | 13593 | 2026-02-10 |
| modelcontextprotocol/typescript-sdk | github | 11618 | 2026-02-18 |
| tadata-org/fastapi_mcp | github | 11548 | 2025-11-24 |
| 0xJacky/nginx-ui | github | 10666 | 2026-02-18 |
| Portkey-AI/gateway | github | 10630 | 2026-01-26 |
| hangwin/mcp-chrome | github | 10407 | 2026-01-06 |
| JoeanAmier/XHS-Downloader | github | 10089 | 2026-02-11 |
| yusufkaraaslan/Skill_Seekers | github | 9592 | 2026-02-15 |
| mcp-use/mcp-use | github | 9180 | 2026-02-18 |
| D4Vinci/Scrapling | github | 9052 | 2026-02-18 |
| xpzouying/xiaohongshu-mcp | github | 8851 | 2026-02-17 |
| open-metadata/OpenMetadata | github | 8710 | 2026-02-18 |
| xinnan-tech/xiaozhi-esp32-server | github | 8613 | 2026-02-18 |
| awslabs/mcp | github | 8170 | 2026-02-18 |
| lastmile-ai/mcp-agent | github | 8035 | 2026-01-25 |
| Upsonic/Upsonic | github | 7784 | 2026-02-17 |
| firerpa/lamda | github | 7612 | 2026-02-15 |
| idosal/git-mcp | github | 7597 | 2025-08-22 |
| AgentDeskAI/browser-tools-mcp | github | 7069 | 2025-03-26 |
| 0x4m4/hexstrike-ai | github | 6895 | 2025-11-06 |
| modelcontextprotocol/registry | github | 6428 | 2026-02-17 |
| grab/cursor-talk-to-figma-mcp | github | 6314 | 2026-01-28 |
| CoplayDev/unity-mcp | github | 6021 | 2026-02-17 |
| BrowserMCP/mcp | github | 5795 | 2025-04-24 |
| Klavis-AI/klavis | github | 5635 | 2026-02-17 |
| mrexodia/ida-pro-mcp | github | 5587 | 2026-02-08 |
| firecrawl/firecrawl-mcp-server | github | 5519 | 2026-02-17 |
| executeautomation/mcp-playwright | github | 5227 | 2025-12-13 |
| nanbingxyz/5ire | github | 5023 | 2026-01-23 |
| FunnyWolf/Viper | github | 4959 | 2026-01-18 |
| casibase/casibase | github | 4440 | 2026-02-15 |
| u14app/deep-research | github | 4430 | 2026-02-10 |
| getsentry/XcodeBuildMCP | github | 4336 | 2026-02-17 |
| httprunner/httprunner | github | 4256 | 2025-12-11 |
| wgpsec/ENScan_GO | github | 4214 | 2026-01-05 |
| UnicomAI/wanwu | github | 4104 | 2026-02-10 |
| txn2/kubefwd | github | 4043 | 2026-02-18 |
| open-webui/mcpo | github | 3975 | 2025-12-08 |
| modelcontextprotocol/csharp-sdk | github | 3905 | 2026-02-18 |
| Pimzino/spec-workflow-mcp | github | 3888 | 2026-02-08 |
| modelcontextprotocol/go-sdk | github | 3874 | 2026-02-17 |
| exa-labs/exa-mcp-server | github | 3812 | 2026-02-17 |
| groupultra/telegram-search | github | 3747 | 2026-02-17 |
| antvis/mcp-server-chart | github | 3677 | 2026-01-22 |
| opensumi/core | github | 3598 | 2026-02-04 |
| archestra-ai/archestra | github | 3574 | 2026-02-18 |
| campfirein/cipher | github | 3519 | 2026-01-25 |
| dinoki-ai/osaurus | github | 3503 | 2026-02-18 |
| mobile-next/mobile-mcp | github | 3450 | 2026-02-03 |
| zebbern/claude-code-guide | github | 3422 | 2026-02-18 |
| icip-cas/PPTAgent | github | 3342 | 2026-02-17 |
| haris-musa/excel-mcp-server | github | 3340 | 2026-01-19 |
| liaokongVFX/MCP-Chinese-Getting-Started-Guide | github | 3330 | 2025-04-23 |
| IBM/mcp-context-forge | github | 3285 | 2026-02-18 |
| modelcontextprotocol/java-sdk | github | 3180 | 2026-02-17 |
| huangjunsen0406/py-xiaozhi | github | 3178 | 2026-01-07 |
| opensolon/solon | github | 2705 | 2026-02-15 |
| microsoft/mcp | github | 2612 | 2026-02-18 |
| go-nunu/nunu | github | 2544 | 2025-08-25 |
| davepoon/buildwithclaude | github | 2442 | 2026-02-17 |
| zcaceres/markdownify-mcp | github | 2411 | 2026-02-11 |
| maximhq/bifrost | github | 2385 | 2026-02-18 |
| sparfenyuk/mcp-proxy | github | 2263 | 2026-02-17 |
| deedy5/ddgs | github | 2192 | 2026-02-14 |
| lemonade-sdk/lemonade | github | 2183 | 2026-02-18 |
| bytebase/dbhub | github | 2128 | 2026-02-03 |
| brightdata/brightdata-mcp | github | 2033 | 2026-02-05 |
| metatool-ai/metamcp | github | 2032 | 2026-02-08 |
| cjo4m06/mcp-shrimp-task-manager | github | 2023 | 2025-08-21 |
| jamubc/gemini-mcp-tool | github | 1988 | 2025-11-25 |
| aws/amazon-q-developer-cli | github | 1894 | 2026-02-03 |
| samanhappy/mcphub | github | 1809 | 2026-02-11 |
| martin-ger/esp32_nat_router | github | 1789 | 2026-02-17 |
| genieincodebottle/generative-ai | github | 1778 | 2026-02-17 |
| mcp-router/mcp-router | github | 1747 | 2026-01-24 |
| OpenAgentPlatform/Dive | github | 1732 | 2026-02-13 |
| ravitemer/mcphub.nvim | github | 1720 | 2026-01-18 |
| glidea/zenfeed | github | 1644 | 2025-11-22 |
| stacklok/toolhive | github | 1601 | 2026-02-18 |
| GongRzhe/Office-Word-MCP-Server | github | 1595 | 2025-12-31 |
| mark3labs/mcphost | github | 1554 | 2026-02-15 |
| timescale/pg-aiguide | github | 1541 | 2026-02-03 |
| GongRzhe/Office-PowerPoint-MCP-Server | github | 1513 | 2025-12-31 |
| f/mcptools | github | 1484 | 2025-12-18 |
| isaacphi/mcp-language-server | github | 1447 | 2026-02-01 |
| rulego/rulego | github | 1430 | 2026-02-05 |
| taylorwilsdon/google_workspace_mcp | github | 1414 | 2026-02-18 |
| chongdashu/unreal-mcp | github | 1401 | 2025-04-22 |
| korotovsky/slack-mcp-server | github | 1370 | 2026-02-12 |
| MicrosoftDocs/mcp | github | 1364 | 2026-02-12 |
| modelcontextprotocol/php-sdk | github | 1359 | 2026-02-18 |
| doobidoo/mcp-memory-service | github | 1343 | 2026-02-17 |
| CoderGamester/mcp-unity | github | 1330 | 2026-02-15 |
| universal-tool-calling-protocol/code-mode | github | 1313 | 2026-02-08 |
| BlackSnufkin/LitterBox | github | 1307 | 2025-11-12 |
| zinja-coder/jadx-ai-mcp | github | 1263 | 2026-01-24 |
| modelcontextprotocol/kotlin-sdk | github | 1258 | 2026-02-17 |
| MiniMax-AI/MiniMax-MCP | github | 1255 | 2026-01-07 |
| modelcontextprotocol/swift-sdk | github | 1247 | 2026-02-17 |
| refreshdotdev/web-eval-agent | github | 1235 | 2026-02-11 |
| maxritter/claude-pilot | github | 1237 | 2026-02-17 |
| qdrant/mcp-server-qdrant | github | 1222 | 2026-01-28 |
| benborla/mcp-server-mysql | github | 1209 | 2026-01-27 |
| grafbase/grafbase | github | 1207 | 2026-01-23 |
| NPC-Worldwide/npcpy | github | 1196 | 2026-02-16 |
| metoro-io/mcp-golang | github | 1192 | 2025-09-02 |
| Devin-AXIS/A2V | github | 1201 | 2025-11-12 |
| moyangzhan/langchain4j-aideepin | github | 1165 | 2026-01-23 |
| containers/kubernetes-mcp-server | github | 1159 | 2026-02-17 |
| joey-zhou/xiaozhi-esp32-server-java | github | 1139 | 2026-01-13 |
| designcomputer/mysql_mcp_server | github | 1131 | 2025-06-05 |
| AI-QL/tuui | github | 1126 | 2026-02-14 |
| moltis-org/moltis | github | 1116 | 2026-02-17 |
| chunkhound/chunkhound | github | 1039 | 2026-02-17 |
| dmayboroda/minima | github | 1037 | 2026-01-22 |
| chatmcp/mcp-server-chatsum | github | 1033 | 2024-12-04 |
| caol64/wenyan-mcp | github | 1024 | 2026-01-30 |
| cgoinglove/better-chatbot | github | 1020 | 2026-01-31 |
| sheshbabu/zen | github | 1020 | 2025-12-16 |
| IvanMurzak/Unity-MCP | github | 1018 | 2026-02-17 |
| robotmcp/ros-mcp-server | github | 1011 | 2026-02-17 |
| GongRzhe/Gmail-MCP-Server | github | 1008 | 2025-08-06 |
| openops-cloud/openops | github | 991 | 2026-02-17 |
| ref-tools/ref-tools-mcp | github | 978 | 2026-01-09 |
| iFurySt/RedNote-MCP | github | 969 | 2025-05-11 |
| gyoridavid/short-video-maker | github | 968 | 2025-06-21 |
| svnscha/mcp-windbg | github | 964 | 2025-12-26 |
| patruff/ollama-mcp-bridge | github | 962 | 2025-04-20 |
| JetBrains/mcp-jetbrains | github | 941 | 2026-01-07 |
| inkeep/agents | github | 935 | 2026-02-18 |
| mongodb-js/mongodb-mcp-server | github | 918 | 2026-02-17 |
| splx-ai/agentic-radar | github | 912 | 2025-11-27 |
| SecretiveShell/MCP-Bridge | github | 905 | 2025-12-08 |
| kimsungwhee/apple-docs-mcp | github | 905 | 2025-11-08 |
| QuantGeekDev/mcp-framework | github | 900 | 2026-02-05 |
| neo4j-contrib/mcp-neo4j | github | 900 | 2026-02-03 |
| datalayer/jupyter-mcp-server | github | 896 | 2026-02-06 |
| kardolus/chatgpt-cli | github | 889 | 2026-02-04 |
| stickerdaniel/linkedin-mcp-server | github | 888 | 2026-02-17 |
| can1357/oh-my-pi | github | 877 | 2026-02-18 |
| asheshgoplani/agent-deck | github | 873 | 2026-02-17 |
| Azure-Samples/AI-Gateway | github | 866 | 2026-02-14 |
| alioshr/memory-bank-mcp | github | 861 | 2025-08-20 |
| hyper-mcp-rs/hyper-mcp | github | 858 | 2026-02-17 |
| mcpjungle/MCPJungle | github | 858 | 2026-02-16 |
| microsoft/wassette | github | 841 | 2026-02-16 |
| negokaz/excel-mcp-server | github | 836 | 2025-07-19 |
| zaidmukaddam/scira-mcp-chat | github | 830 | 2025-12-11 |
| rohitg00/kubectl-mcp-server | github | 821 | 2026-02-10 |
| ankorstore/yokai | github | 819 | 2026-01-22 |
| alexander-zuev/supabase-mcp-server | github | 813 | 2025-09-26 |
| golf-mcp/golf | github | 811 | 2026-01-31 |
| php-mcp/server | github | 811 | 2025-08-09 |
| ArcadeAI/arcade-mcp | github | 810 | 2026-02-18 |
| kontext-dev/browser-use-mcp-server | github | 805 | 2025-07-10 |
| dyoshikawa/rulesync | github | 806 | 2026-02-18 |
| nickclyde/duckduckgo-mcp-server | github | 803 | 2025-03-11 |
| context-space/context-space | github | 800 | 2025-10-22 |
| Davidyz/VectorCode | github | 799 | 2025-12-26 |
| erwinkramer/bank-api | github | 792 | 2026-02-14 |
| apify/apify-mcp-server | github | 785 | 2026-02-17 |
| johnhuang316/code-index-mcp | github | 780 | 2026-01-09 |
| browser-use/vibetest-use | github | 764 | 2025-09-02 |
| LeslieLeung/glean | github | 751 | 2026-02-17 |
| CodeGraphContext/CodeGraphContext | github | 750 | 2026-02-17 |
| mbailey/voicemode | github | 738 | 2026-02-18 |
| iosifache/annas-mcp | github | 730 | 2026-01-14 |
| bgauryy/octocode-mcp | github | 729 | 2026-02-11 |
| Cyfrin/aderyn | github | 727 | 2026-02-15 |
| Joooook/12306-mcp | github | 724 | 2025-10-10 |
| LSTM-Kirigaya/openmcp-client | github | 724 | 2026-02-16 |
| BigSweetPotatoStudio/HyperChat | github | 712 | 2025-08-18 |
| dadbodgeoff/drift | github | 709 | 2026-02-13 |
| lgazo/drawio-mcp-server | github | 695 | 2026-02-12 |
| zcaceres/fetch-mcp | github | 694 | 2025-09-02 |
| xing5/mcp-google-sheets | github | 679 | 2026-01-31 |
| adhikasp/mcp-client-cli | github | 665 | 2025-12-02 |
| Kymo-MCP/mcpcan | github | 661 | 2026-02-06 |
| rusiaaman/wcgw | github | 642 | 2026-02-11 |
| tbxark/mcp-proxy | github | 636 | 2026-01-26 |
| universal-tool-calling-protocol/python-utcp | github | 636 | 2025-12-03 |
| mondaycom/vibe | github | 613 | 2026-02-16 |
| thorsten/phpMyFAQ | github | 612 | 2026-02-17 |
| elastic/mcp-server-elasticsearch | github | 610 | 2026-02-17 |
| vercel/next-devtools-mcp | github | 614 | 2026-02-12 |
| asaotomo/FofaMap | github | 599 | 2026-01-22 |
| mark3labs/mcp-filesystem-server | github | 599 | 2025-11-24 |
| cyberkaida/reverse-engineering-assistant | github | 599 | 2026-02-09 |
| wgpsec/cloudsword | github | 595 | 2026-02-03 |
| joreilly/FantasyPremierLeague | github | 593 | 2026-02-17 |
| automation-ai-labs/mcp-link | github | 599 | 2025-04-03 |
| TM9657/flow-like | github | 587 | 2026-02-17 |
| joreilly/GalwayBus | github | 581 | 2026-02-17 |
| jacksteamdev/obsidian-mcp-tools | github | 580 | 2025-07-13 |
| truffle-ai/dexto | github | 580 | 2026-02-17 |
| rekog-labs/MCP-Nest | github | 578 | 2026-02-16 |
| ktnyt/cclsp | github | 565 | 2026-01-23 |
| spences10/mcp-sequentialthinking-tools | github | 564 | 2026-02-10 |
| etsd-tech/mcp-pointer | github | 559 | 2025-11-12 |
| getsentry/sentry-mcp | github | 557 | 2026-02-12 |
| Canner/wren-engine | github | 554 | 2026-02-16 |
| cyproxio/mcp-for-security | github | 550 | 2026-02-14 |
| web-agent-master/google-search | github | 550 | 2025-04-06 |
| jonigl/mcp-client-for-ollama | github | 538 | 2026-02-10 |
| marcoaapfortes/Mantic.sh | github | 538 | 2026-02-04 |
| GoogleCloudPlatform/cloud-run-mcp | github | 535 | 2026-02-16 |
| irockel/tda | github | 535 | 2026-02-16 |
| neka-nat/freecad-mcp | github | 527 | 2026-02-11 |
| samuelgursky/davinci-resolve-mcp | github | 527 | 2026-01-27 |
| ferrislucas/iterm-mcp | github | 525 | 2025-09-20 |
| mihaelamj/cupertino | github | 525 | 2026-02-05 |
| subnetmarco/pgmcp | github | 522 | 2025-09-25 |
| line/line-bot-mcp-server | github | 520 | 2026-02-17 |
| MxIris-Reverse-Engineering/ida-mcp-server | github | 516 | 2025-05-29 |
| centralmind/gateway | github | 516 | 2025-07-18 |
| alpacahq/alpaca-mcp-server | github | 511 | 2026-02-03 |
| maquina-app/rails-mcp-server | github | 503 | 2026-02-16 |
| marmotdata/marmot | github | 502 | 2026-02-17 |
| SylphxAI/pdf-reader-mcp | github | 498 | 2026-02-16 |
| chroma-core/chroma-mcp | github | 494 | 2025-09-17 |
| magicyuan876/mineru-tianshu | github | 493 | 2026-01-22 |
| bitbonsai/mcp-obsidian | github | 493 | 2026-02-16 |
| anaisbetts/mcp-youtube | github | 491 | 2025-03-20 |
| dbt-labs/dbt-mcp | github | 492 | 2026-02-17 |
| ggozad/haiku.rag | github | 485 | 2026-02-17 |
| jasonjmcghee/claude-debugs-for-you | github | 485 | 2025-12-20 |
| makafeli/n8n-workflow-builder | github | 485 | 2025-08-04 |
| OTA-Tech-AI/web-agent-protocol | github | 491 | 2025-06-19 |
| Wh0am123/MCP-Kali-Server | github | 481 | 2026-01-20 |
| yuniko-software/minecraft-mcp-server | github | 480 | 2026-02-16 |
| controlplaneio-fluxcd/flux-operator | github | 479 | 2026-02-17 |
| jtang613/GhidrAssistMCP | github | 478 | 2026-02-15 |
| whchien/ai-trader | github | 475 | 2026-02-04 |
| Softeria/ms-365-mcp-server | github | 471 | 2026-02-15 |
| AmeNetwork/aser | github | 469 | 2025-10-11 |
| cyanheads/atlas-mcp-server | github | 468 | 2025-07-22 |
| PV-Bhat/vibe-check-mcp-server | github | 466 | 2026-02-11 |
| VikashLoomba/copilot-mcp | github | 465 | 2026-02-16 |
| jina-ai/MCP | github | 464 | 2026-01-29 |
| ihor-sokoliuk/mcp-searxng | github | 461 | 2026-02-17 |
| Nexus-Router/nexus | github | 459 | 2026-02-05 |
| director-run/director | github | 459 | 2026-01-08 |
| baryhuang/mcp-remote-macos-use | github | 458 | 2025-06-10 |
| agentic-community/mcp-gateway-registry | github | 449 | 2026-02-18 |
| fforster/gitlab-mcp | gitlab | 83 | 2026-02-17 |
| goern/forgejo-mcp | codeberg | 32 | 2026-02-11 |
| lx-industries/rmcp-openapi | gitlab | 10 | 2026-02-17 |
| lx-industries/rmcp-actix-web | gitlab | 6 | 2026-02-17 |
| byteflavour/goern-forgejo-mcp-flake | codeberg | 3 | 2026-02-08 |
| goern/bundestag-mcp | codeberg | 1 | 2025-03-14 |
| hectorm/capek | codeberg | 1 | 2026-02-17 |
| luxferre/EXAI | codeberg | 1 | 2025-10-31 |
| maloma/mcp | codeberg | 1 | 2026-01-23 |
| nevermindinventory/everything-claude-code | codeberg | 1 | 2026-01-23 |
| ropzislaw/AIPex | codeberg | 1 | 2025-12-01 |
| cjoprey/obsidian-mcp-server | gitlab | 1 | 2026-02-16 |
| universalamateur1/reclaim-mcp-server | gitlab | 1 | 2026-02-15 |
| gitlab-da/use-cases/ai/ai-research/ollama/ollama-rag-tool-agents-mcp-first-learning-steps | gitlab | 2 | 2026-02-09 |
| kalilinux/packages/mcp-kali-server | gitlab | 2 | 2026-01-20 |
| piatra-open-source-projects/gitlab-mcp-server | gitlab | 2 | 2026-01-29 |
| shebe-oss/shebe | gitlab | 1 | 2026-02-05 |
| gitlab-learn-labs/webinars/gitlab-duo-agent-platform/duo-mcp-workshop | gitlab | 2 | 2026-01-09 |
| TheWolfOfChain/mcp-T2M | gitlab | 1 | 2025-12-01 |
| adem.sammoudi1409/mcp-server | gitlab | 1 | 2025-12-06 |
| heingroup/ivoryos-suite/ivoryos-mcp | gitlab | 1 | 2025-12-04 |
| masx200/gitlab-mcp-server | gitlab | 1 | 2026-01-04 |
| nso-developer/cisco-nso-mcp-server | gitlab | 1 | 2025-12-21 |
| phd5392441/mcp4bim | gitlab | 1 | 2026-01-08 |
| CochainComplex/tractatus-thinking | gitlab | 3 | 2025-08-29 |
| UniversalStandards/mcp-mcp | gitlab | 1 | 2025-10-22 |
| mionemedia/gitlab-mcp-server | gitlab | 1 | 2025-07-15 |
Coding CLIs (27)¶
| Repository | Host | Stars | Updated |
|---|---|---|---|
| CherryHQ/cherry-studio | github | 39898 | 2026-02-17 |
| iOfficeAI/AionUi | github | 16266 | 2026-02-17 |
| plandex-ai/plandex | github | 14992 | 2025-10-03 |
| github/copilot-cli | github | 8462 | 2026-02-17 |
| modelscope/FunClip | github | 5344 | 2025-07-11 |
| ericc-ch/copilot-api | github | 2497 | 2025-11-10 |
| FullAgent/fulling | github | 2373 | 2026-02-11 |
| TimmyOVO/deepseek-ocr.rs | github | 2132 | 2026-02-12 |
| instructa/ai-prompts | github | 1003 | 2025-03-28 |
| kbwo/ccmanager | github | 861 | 2026-02-12 |
| ekkinox/yai | github | 856 | 2024-07-31 |
| RAIT-09/obsidian-agent-client | github | 750 | 2026-02-17 |
| dustinblackman/oatmeal | github | 744 | 2024-06-01 |
| njbrake/agent-of-empires | github | 709 | 2026-02-18 |
| vybestack/llxprt-code | github | 634 | 2026-02-18 |
| addyosmani/git2txt | github | 549 | 2024-11-22 |
| badlogic/pi-skills | github | 507 | 2026-02-02 |
| ObservedObserver/async-code | github | 506 | 2025-11-18 |
| Dicklesworthstone/coding_agent_session_search | github | 463 | 2026-02-18 |
| plaintextaccounting/acceptarium | codeberg | 3 | 2026-02-17 |
| Bekalah/codex-14499 | codeberg | 1 | 2026-02-10 |
| Saegor/Georges | codeberg | 1 | 2026-02-13 |
| cevangel/groqbash | codeberg | 1 | 2026-01-26 |
| cyphersnake/agents-in-a-chest.nvim | codeberg | 1 | 2025-08-14 |
| hiiamboris/red-codex | codeberg | 1 | 2025-08-23 |
| x-cmd/skill | codeberg | 1 | 2025-11-04 |
| yaadata/codex.nvim | codeberg | 1 | 2026-02-15 |
Gateway/Proxy (26)¶
| Repository | Host | Stars | Updated |
|---|---|---|---|
| apify/crawlee | github | 21727 | 2026-02-17 |
| QuantumNous/new-api | github | 17748 | 2026-02-12 |
| tensorzero/tensorzero | github | 10971 | 2026-02-18 |
| apify/crawlee-python | github | 8080 | 2026-02-17 |
| katanemo/plano | github | 5094 | 2026-02-17 |
| algorithmicsuperintelligence/optillm | github | 3332 | 2026-01-28 |
| looplj/axonhub | github | 1916 | 2026-02-17 |
| APIParkLab/APIPark | github | 1555 | 2025-10-26 |
| 0xCrunchyy/10x | github | 1383 | 2026-01-05 |
| bricks-cloud/BricksLLM | github | 1154 | 2025-01-05 |
| KenyonY/openai-forward | github | 985 | 2025-03-15 |
| Nya-Foundation/NyaProxy | github | 946 | 2025-07-07 |
| Xerxes-2/clewdr | github | 938 | 2026-02-14 |
| xing61/zzz-api | github | 901 | 2026-02-02 |
| theopenco/llmgateway | github | 878 | 2026-02-17 |
| rtk-ai/rtk | github | 843 | 2026-02-17 |
| vllora/vllora | github | 776 | 2026-02-17 |
| danilofalcao/cursor-deepseek | github | 602 | 2026-01-14 |
| adaline/gateway | github | 577 | 2026-02-07 |
| sgasser/pasteguard | github | 505 | 2026-02-09 |
| kurama622/llm.nvim | codeberg | 1 | 2025-12-02 |
| hongch6661/mix-web-demo | gitlab | 1 | 2026-02-17 |
| soapbox-pub/claude-gateway | gitlab | 1 | 2026-01-08 |
| icfoss/OpenIoT/lorawan-range-mapper_v1.0 | gitlab | 1 | 2024-07-12 |
| my-aws-project/AWS-VPC-Automation | gitlab | 1 | 2025-01-13 |
| tenergy_development_team/tenergy32hub_loragateway_pressure | gitlab | 1 | 2025-09-20 |
Other (17)¶
| Repository | Host | Stars | Updated |
|---|---|---|---|
| CopilotKit/CopilotKit | github | 28840 | 2026-02-18 |
| e2b-dev/E2B | github | 10919 | 2026-02-18 |
| Integuru-AI/Integuru | github | 4539 | 2026-02-12 |
| Intelligent-Internet/ii-agent | github | 3159 | 2026-02-04 |
| i-am-bee/beeai-framework | github | 3106 | 2026-02-13 |
| BAAI-Agents/Cradle | github | 2451 | 2024-11-07 |
| LjyYano/Thinking_in_Java_MindMapping | github | 1639 | 2026-01-18 |
| devnen/Chatterbox-TTS-Server | github | 1024 | 2026-02-12 |
| stakpak/agent | github | 956 | 2026-02-18 |
| litanlitudan/skyagi | github | 787 | 2023-09-21 |
| gety-ai/apple-on-device-openai | github | 770 | 2025-10-02 |
| Lex-au/Orpheus-FastAPI | github | 660 | 2025-07-05 |
| normal-computing/fuji-web | github | 584 | 2026-01-06 |
| oil-oil/wolfcha | github | 513 | 2026-02-18 |
| no_miti/eleventy-plugin-lollmao | codeberg | 1 | 2025-09-29 |
| pvlerick/build-a-llm-from-scratch | codeberg | 1 | 2026-02-17 |
| timedin/ExecAI | codeberg | 1 | 2024-11-11 |
Local Runtime (13)¶
| Repository | Host | Stars | Updated |
|---|---|---|---|
| badlogic/pi-mono | github | 13185 | 2026-02-17 |
| algorithmicsuperintelligence/openevolve | github | 5395 | 2026-02-04 |
| Michael-A-Kuykendall/shimmy | github | 3681 | 2026-01-16 |
| mostlygeek/llama-swap | github | 2416 | 2026-02-16 |
| vm0-ai/vm0 | github | 1006 | 2026-02-18 |
| shyamsaktawat/OpenAlpha_Evolve | github | 970 | 2025-05-31 |
| Pavelevich/llm-checker | github | 629 | 2026-02-17 |
| EricLBuehler/candle-vllm | github | 598 | 2026-01-28 |
| devflowinc/uzi | github | 561 | 2025-06-04 |
| Chen-zexi/vllm-cli | github | 472 | 2026-01-25 |
| tusharhero/ollama-guix | codeberg | 13 | 2025-04-29 |
| 1ndevelopment/termux-utilities | codeberg | 1 | 2026-01-31 |
| leozqin/precis | codeberg | 1 | 2025-05-30 |
Retrieval/Index (13)¶
| Repository | Host | Stars | Updated |
|---|---|---|---|
| sigoden/aichat | github | 9312 | 2026-01-29 |
| opactorai/Claudable | github | 3732 | 2025-12-04 |
| LazyAGI/LazyLLM | github | 3725 | 2026-02-14 |
| smallcloudai/refact | github | 3477 | 2026-02-17 |
| aiflowy/aiflowy | github | 2032 | 2026-02-11 |
| madroidmaq/mlx-omni-server | github | 662 | 2025-12-21 |
| 51bitquant/ai-hedge-fund-crypto | github | 511 | 2025-09-05 |
| giselles-ai/giselle | github | 482 | 2026-02-18 |
| akselmo/artificial-rage | codeberg | 27 | 2025-08-23 |
| beitragsrunde/beitragsrunde | codeberg | 5 | 2026-02-13 |
| pmakholm/bragi | codeberg | 2 | 2024-12-09 |
| danielsreichenbach/ansible-role-garage | codeberg | 1 | 2026-01-19 |
| wordpress-premium/wpbakery-page-builder | gitlab | 1 | 2024-10-26 |
Context Compression (10)¶
| Repository | Host | Stars | Updated |
|---|---|---|---|
| mufeedvh/code2prompt | github | 7138 | 2026-02-16 |
| memodb-io/Acontext | github | 3016 | 2026-02-17 |
| entireio/cli | github | 2618 | 2026-02-18 |
| pchalasani/claude-code-tools | github | 1452 | 2026-02-16 |
| DeepMyst/Mysti | github | 888 | 2025-12-28 |
| context-machine-lab/sleepless-agent | github | 798 | 2026-01-01 |
| massgen/MassGen | github | 744 | 2026-02-16 |
| marv1nnnnn/llm-min.txt | github | 670 | 2025-10-05 |
| gmickel/gmickel-claude-marketplace | github | 512 | 2026-02-17 |
| NeoLabHQ/context-engineering-kit | github | 482 | 2026-02-16 |
Prompt/Eval (10)¶
| Repository | Host | Stars | Updated |
|---|---|---|---|
| raga-ai-hub/RagaAI-Catalyst | github | 16099 | 2026-02-11 |
| latitude-dev/latitude-llm | github | 3894 | 2026-02-17 |
| Agenta-AI/agenta | github | 3849 | 2026-02-17 |
| openlit/openlit | github | 2210 | 2026-02-17 |
| microsoft/prompty | github | 1158 | 2026-02-14 |
| thu-coai/Safety-Prompts | github | 1127 | 2024-02-27 |
| EmbeddedLLM/JamAIBase | github | 1081 | 2026-02-10 |
| Doriandarko/RepoToTextForLLMs | github | 784 | 2024-05-21 |
| melindachang/asoiaf-llm-benchmarking | codeberg | 1 | 2025-08-20 |
| khaytsus/chatdb | gitlab | 1 | 2025-12-02 |
Orchestration (8)¶
| Repository | Host | Stars | Updated |
|---|---|---|---|
| frankbria/ralph-claude-code | github | 6977 | 2026-02-17 |
| UfoMiao/zcf | github | 5495 | 2026-02-05 |
| rllm-org/rllm | github | 5109 | 2026-02-17 |
| mindfold-ai/Trellis | github | 2239 | 2026-02-17 |
| covibes/zeroshot | github | 1175 | 2026-02-17 |
| DmitrySolana/Molt-Pi-Maker | github | 1095 | 2026-02-01 |
| yohey-w/multi-agent-shogun | github | 904 | 2026-02-18 |
| softaworks/agent-toolkit | github | 607 | 2026-02-08 |