Skip to content

The AI Coding Agent Orchestration Landscape

From Claude Code to Zero-Human Dev Teams

AI Coding Agent 編排工具全景圖

從 Claude Code 到零人開發團隊


"The best way to predict the future is to invent it." — Alan Kay

「預測未來最好的方式就是發明它。」—— Alan Kay


Why CLI Agents, Not API Calls?

There's a fundamental architectural split in the AI coding agent world:

  • API-based orchestration: Your code calls the Claude/GPT API directly, manages context windows, handles tool use, and implements agent loops. Frameworks like CrewAI, AutoGen, and LangGraph live here.
  • CLI-based orchestration: You spawn actual CLI processes — claude, codex, gemini — that run as autonomous agents with their own context, tools, and shell access. You manage processes, not prompts.

The CLI approach has a crucial advantage: the agent runtime handles everything — context management, tool use, file editing, git operations, test execution. The orchestrator only needs to answer: what task?, which agent?, when to start?, when to stop?

This guide surveys every notable open-source project in the CLI-based orchestration space as of March 2026. Over 30 projects, organized into 9 categories.

為什麼是 CLI Agent,而不是 API 呼叫?

AI coding agent 的世界有一個根本的架構分歧:

  • API 型編排: 你的程式碼直接呼叫 Claude/GPT API,管理 context window、處理 tool use、實作 agent 迴圈。CrewAI、AutoGen、LangGraph 屬於這一類。
  • CLI 型編排: 你啟動實際的 CLI 程序——claudecodexgemini——它們作為自主 agent 運行,有自己的 context、工具和 shell 存取權限。你管理的是程序,不是 prompt

CLI 方式有一個關鍵優勢:agent runtime 處理一切——context 管理、tool use、檔案編輯、git 操作、測試執行。編排器只需要回答:什麼任務?哪個 agent?何時啟動?何時停止?

本指南盤點截至 2026 年 3 月,CLI 型編排領域中每一個值得關注的開源專案。超過 30 個專案,橫跨 9 大類別。


The Landscape at a Glance

CategoryKey ProjectsStars Range
Business orchestrationPaperclip27.6k
Claude Code-specificoh-my-claudecode, Gas Town, Multiclaude, agents (wshobson)500–31.5k
Multi-agent platformsRuflo, Agent Orchestrator, Overstory, MyClaude1k–21.5k
Session managersClaude Squad, dmux, cmux, Mux1.2k–7.5k
Codex CLI-specificCodex Orchestrator, Symphony235–13.1k
Desktop orchestratorsAperant, Dorothy, Jean, AI Maestro211–13.3k
Sandboxed executionTSK, Agent Swarm150–267
Lightweight / skill-basedSwarms, Sudocode, Claude MPM85–232
Mobile / remoteHappy Coder15.5k

Dominant patterns across all projects:

  1. tmux + git worktrees — the standard isolation mechanism
  2. Mayor/Lead/Supervisor — a coordinator agent decomposes tasks and delegates
  3. Wave-based execution — independent tasks run in parallel, dependent tasks wait
  4. SQLite or git-backed state — lightweight persistence without external databases

全景一覽

類別代表專案星數範圍
商業編排Paperclip27.6k
Claude Code 專用oh-my-claudecode、Gas Town、Multiclaude、agents (wshobson)500–31.5k
多 agent 平台Ruflo、Agent Orchestrator、Overstory、MyClaude1k–21.5k
Session 管理器Claude Squad、dmux、cmux、Mux1.2k–7.5k
Codex CLI 專用Codex Orchestrator、Symphony235–13.1k
桌面編排器Aperant、Dorothy、Jean、AI Maestro211–13.3k
沙箱執行TSK、Agent Swarm150–267
輕量 / skill 型Swarms、Sudocode、Claude MPM85–232
行動 / 遠端Happy Coder15.5k

所有專案中的主流模式:

  1. tmux + git worktrees —— 標準的隔離機制
  2. Mayor/Lead/Supervisor —— 協調者 agent 分解任務並委派
  3. 波次執行(Wave-based execution) —— 獨立任務平行執行,有相依性的任務等待
  4. SQLite 或 git-backed 狀態 —— 輕量持久化,不需要外部資料庫

Category 1: Business Orchestration — Paperclip

Paperclip | 27.6k stars | MIT | Node.js + React + PostgreSQL

The "zero-human company" platform. Paperclip models an entire business: org charts, reporting lines, goal hierarchies, per-agent budgets with auto-pause, heartbeat scheduling, board-level governance with approval gates, and immutable audit logs. It supports 9+ agent adapters (Claude Code, Codex, OpenClaw, Cursor, Gemini, and more).

Unique to Paperclip:

  • Multi-company support with complete data isolation
  • Atomic task checkout (no two agents work the same issue)
  • Budget enforcement with soft alerts (80%) and hard ceilings (100%)
  • Board approval gates for strategy, hires, and budget changes

Best for: Founders building "AI companies" — multiple agents working toward a business goal with governance.

Limitation: Heavy for solo developers who just want parallel coding agents.

類別 1:商業編排——Paperclip

Paperclip | 27.6k 星 | MIT | Node.js + React + PostgreSQL

「零人公司」平台。Paperclip 建模整間公司:組織架構、報告線、目標層級、每 agent 預算(含自動暫停)、heartbeat 排程、董事會層級的治理(含核准閘門)、以及不可變的稽核日誌。支援 9 種以上 agent adapter(Claude Code、Codex、OpenClaw、Cursor、Gemini 等)。

Paperclip 獨有:

  • 多公司支援,完全資料隔離
  • 原子任務 checkout(沒有兩個 agent 同時處理同一個 issue)
  • 預算控制:80% 軟警報 + 100% 硬上限自動暫停
  • 董事會核准閘門(策略、僱用、預算變更)

最適合: 建構「AI 公司」的創辦人——多個 agent 朝向商業目標工作,並有治理機制。

限制: 對只想要平行 coding agent 的單人開發者來說太重。


Category 2: Claude Code-Specific Orchestration

wshobson/agents — 31.5k stars

agents | MIT | The largest project by stars

Not a standalone orchestrator but a massive plugin ecosystem for Claude Code: 112 specialized agents, 16 multi-agent workflow orchestrators, 146 skills, and 79 dev tools organized into 72 Claude Code plugins. Uses 4-tier model assignment (Opus for architecture, Sonnet for development, Haiku for operations). Installs via /plugin marketplace.

Best for: Developers who want to supercharge Claude Code without leaving its native environment.

Gas Town — 12.4k stars

Gas Town | Go | By Steve Yegge

A deeply git-native orchestration system. A "Mayor" coordinator manages 20-30 agents using git-backed persistent state: "beads" for issue tracking, "convoys" for work bundles, "hooks" for agent state. Supports 10+ runtimes: Claude, Gemini, Codex, Cursor, Auggie, Amp, OpenCode, Copilot, Pi, OMP. Features formula workflows (TOML-defined repeatable processes), TUI activity feeds, and problem detection for stuck agents.

Best for: Teams that want orchestration deeply integrated with git workflows.

Key difference from Paperclip: Git is the source of truth, not PostgreSQL.

oh-my-claudecode — 10.1k stars

oh-my-claudecode | npm: oh-my-claude-sisyphus

Teams-first multi-agent orchestration with 32 specialized agents and smart model routing (Haiku for simple tasks, Opus for complex). 9 execution modes including Ultrawork (maximum parallelism) and Ralph (persistent verify/fix loops). Supports Claude Code natively, plus Codex/Gemini via tmux workers. The /ccg command synthesizes analysis from all three providers. Claims 30-50% token cost reduction through intelligent routing.

Best for: Power users who want sophisticated model routing and cost optimization.

Multiclaude — 504 stars

Multiclaude | Go | MIT

Spawns autonomous Claude Code agents in tmux windows with git worktrees. Uses a "Brownian Ratchet" philosophy — controlled chaos with CI/testing as the filtering mechanism. Built-in agent types: Supervisor, Merge Queue, PR Shepherd, Worker, Reviewer. Single-player mode auto-merges on CI pass; multiplayer mode respects team review processes.

Best for: Developers who trust CI as the quality gate and want agents to "throw code at the wall and see what sticks."

SwarmSDK (parruda/claude-swarm) — 1.7k stars

SwarmSDK | Ruby

A Ruby framework for multi-agent orchestration with persistent memory (FAISS-indexed semantic search). Single-process model using RubyLLM for direct method calls. Node workflows with dependencies, 12-event hooks system.

Best for: Ruby developers who want a programming framework, not a deployment platform.

Claude Code Agent Teams — Official Feature

Built into Claude Code v2.1.32+. Enable with CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS. One session acts as team lead, coordinating work and synthesizing results. Teammates work independently with their own context windows. This is the foundation many third-party tools build on or compete with.

類別 2:Claude Code 專用編排

wshobson/agents——31.5k 星

agents | MIT | 星數最高的專案

不是獨立的編排器,而是 Claude Code 的大型 plugin 生態系:112 個專門 agent、16 個多 agent 工作流編排器、146 個 skill、79 個開發工具,組織成 72 個 Claude Code plugin。使用 4 層模型分配(Opus 負責架構、Sonnet 負責開發、Haiku 負責維運)。透過 /plugin marketplace 安裝。

最適合: 想在 Claude Code 原生環境內強化功能的開發者。

Gas Town——12.4k 星

Gas Town | Go | Steve Yegge 作品

深度 git 原生的編排系統。「Mayor」協調者使用 git-backed 持久化狀態管理 20-30 個 agent:「beads」追蹤 issue、「convoys」打包工作、「hooks」記錄 agent 狀態。支援 10+ 種 runtime:Claude、Gemini、Codex、Cursor、Auggie、Amp、OpenCode、Copilot、Pi、OMP。具備 formula workflow(TOML 定義的可重複流程)、TUI activity feed、以及卡住 agent 的偵測。

最適合: 想要編排與 git 工作流深度整合的團隊。

與 Paperclip 的關鍵差異: Git 是真相來源,不是 PostgreSQL。

oh-my-claudecode——10.1k 星

oh-my-claudecode | npm: oh-my-claude-sisyphus

以團隊為核心的多 agent 編排,有 32 個專門 agent智慧模型路由(Haiku 處理簡單任務、Opus 處理複雜任務)。9 種執行模式,包括 Ultrawork(最大平行度)和 Ralph(持續驗證/修復迴圈)。原生支援 Claude Code,另透過 tmux worker 支援 Codex/Gemini。/ccg 指令能綜合三個 provider 的分析。宣稱透過智慧路由減少 30-50% token 成本。

最適合: 想要精密模型路由和成本優化的 power user。

Multiclaude——504 星

Multiclaude | Go | MIT

在 tmux 視窗中搭配 git worktree 啟動自主 Claude Code agent。採用**「布朗棘輪」哲學**——受控的混沌,以 CI/測試作為篩選機制。內建 agent 類型:Supervisor、Merge Queue、PR Shepherd、Worker、Reviewer。單人模式在 CI 通過時自動 merge;多人模式尊重團隊 review 流程。

最適合: 信任 CI 作為品質閘門、想讓 agent「把程式碼往牆上丟看什麼能黏住」的開發者。

SwarmSDK(parruda/claude-swarm)——1.7k 星

SwarmSDK | Ruby

Ruby 多 agent 編排框架,具備持久化記憶(FAISS 索引的語意搜尋)。單程序模型,使用 RubyLLM 直接方法呼叫。節點工作流有相依性、12 個事件 hook。

最適合: 想要程式框架而非部署平台的 Ruby 開發者。

Claude Code Agent Teams——官方功能

內建於 Claude Code v2.1.32+。透過 CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS 啟用。一個 session 作為 team lead,協調工作並綜合結果。Teammate 在自己的 context window 中獨立工作。這是許多第三方工具的基礎或競爭對象。


Category 3: Multi-Agent Platforms (Agent-Agnostic)

Ruflo — 21.5k stars

Ruflo | MIT | Formerly "Claude Flow"

The most feature-rich orchestration platform. 60+ specialized agents organized into swarms led by "queens" (Strategic, Tactical, Adaptive). Features: Q-Learning router, Mixture-of-Experts with 8 experts, 42+ skills, multiple consensus algorithms (Byzantine, Raft, Gossip, CRDT), WASM agent booster (<1ms for simple transforms), multi-provider support (Claude, GPT, Gemini, Cohere, Ollama). Claims ~250% extended Claude Code usage and 30-50% token optimization.

Best for: Teams wanting maximum feature breadth.

Caveat: The claims around self-learning and consensus algorithms warrant careful evaluation.

Agent Orchestrator (ComposioHQ) — 4.5k stars

Agent Orchestrator | MIT | TypeScript

The cleanest architecture in this space. 8 swappable plugin slots, each defined by a TypeScript interface:

SlotOptions
Runtimetmux / Docker / K8s
AgentClaude Code / Codex / Aider / OpenCode
Workspaceworktree / clone
TrackerGitHub / Linear
SCMGit
NotifierSlack / webhook
Terminaltmux
Lifecyclehooks

Agents autonomously fix CI failures, address review comments, and open PRs. 3,288 test cases.

Best for: Teams that value clean architecture and extensibility.

Overstory — 1k stars

Overstory | MIT | TypeScript/Bun

The most runtime-agnostic orchestrator. Supports 8 agent runtimes: Claude Code, Sapling, Pi, Copilot, Cursor, Codex, Gemini, OpenCode. Uses a SQLite mail system for inter-agent communication (8 message types, broadcast addressing). Agent hierarchy: Orchestrator > Coordinator > Supervisor > Workers. Three-tier watchdog system. Tiered merge queue with 4-level conflict resolution.

Explicitly warns: "Agent swarms are not a universal solution."

Best for: Teams using diverse agent runtimes that need merge coordination as a first-class feature.

MyClaude — 2.5k stars

MyClaude | Go, Python, JS

Two-agent model: Claude Code as Orchestrator (planning + verification), codeagent-wrapper as Executor across Codex, Claude, Gemini, and OpenCode backends. 5 workflow modules:

ModulePurpose
do5-phase feature development
omoMulti-agent with routing
bmadEnterprise agile with 6 agents
essentials11 core commands
sparvSpecify-Plan-Act-Review-Vault

Best for: Developers who want a practical, well-structured workflow system.

類別 3:多 Agent 平台(Agent 不可知)

Ruflo——21.5k 星

Ruflo | MIT | 前身為「Claude Flow」

功能最豐富的編排平台。60+ 個專門 agent 組織成由「queen」領導的 swarm(Strategic、Tactical、Adaptive)。功能:Q-Learning 路由器、8 個 expert 的 Mixture-of-Experts、42+ skill、多種共識演算法(Byzantine、Raft、Gossip、CRDT)、WASM agent booster(簡單轉換 <1ms)、多 provider 支援(Claude、GPT、Gemini、Cohere、Ollama)。宣稱延伸約 250% 的 Claude Code 使用量,token 優化 30-50%。

最適合: 想要最大功能廣度的團隊。

但書: 關於自學習和共識演算法的宣稱需要仔細評估。

Agent Orchestrator(ComposioHQ)——4.5k 星

Agent Orchestrator | MIT | TypeScript

這個領域中架構最乾淨的方案。8 個可替換的 plugin 插槽,每個都由 TypeScript interface 定義:

插槽選項
Runtimetmux / Docker / K8s
AgentClaude Code / Codex / Aider / OpenCode
Workspaceworktree / clone
TrackerGitHub / Linear
SCMGit
NotifierSlack / webhook
Terminaltmux
Lifecyclehooks

Agent 能自主修復 CI 失敗、回應 review 留言、開 PR。3,288 個測試案例。

最適合: 重視乾淨架構和可擴展性的團隊。

Overstory——1k 星

Overstory | MIT | TypeScript/Bun

最不綁定 runtime 的編排器。支援 8 種 agent runtime:Claude Code、Sapling、Pi、Copilot、Cursor、Codex、Gemini、OpenCode。使用 SQLite 郵件系統進行 agent 間通訊(8 種訊息類型、廣播定址)。Agent 層級:Orchestrator > Coordinator > Supervisor > Worker。三層 watchdog 系統。分層 merge queue,4 級衝突解決。

明確警告:「Agent swarm 不是萬能解法。」

最適合: 使用多種 agent runtime、需要 merge 協調作為一級功能的團隊。

MyClaude——2.5k 星

MyClaude | Go、Python、JS

雙 agent 模型:Claude Code 作為 Orchestrator(規劃 + 驗證),codeagent-wrapper 作為 Executor,跨 Codex、Claude、Gemini 和 OpenCode 後端。5 個工作流模組:

模組用途
do5 階段功能開發
omo多 agent 含路由
bmad企業敏捷,6 個 agent
essentials11 個核心指令
sparv指定-計畫-執行-審查-儲存

最適合: 想要實用、結構良好的工作流系統的開發者。


Category 4: Session Managers

These tools focus on managing multiple CLI agent sessions — no task decomposition, no dependency graphs, just clean multi-agent management.

cmux — 7.5k stars

cmux | AGPL-3.0

Native macOS terminal app (Swift/AppKit) using Ghostty's GPU-accelerated rendering. Agent notification system with blue rings on panes. Built-in scriptable WebKit browser. Premium terminal emulator with agent-aware features.

Claude Squad — 6.4k stars

Claude Squad | Go | AGPL-3.0

The most popular pure session manager. TUI for managing multiple Claude Code, Codex, Gemini, and Aider instances in isolated tmux sessions with git worktrees. Dead simple: n new session, o attach, s push. Installable via brew install claude-squad.

Best for: Developers who want multi-agent parallelism with zero orchestration overhead.

dmux — 1.2k stars

dmux | MIT

The widest agent support: Claude Code, Codex, OpenCode, Cline CLI, Gemini CLI, Qwen CLI, Amp CLI, Pi CLI, Cursor CLI, Copilot CLI, Crush CLI. Multi-select agent launches, AI-generated branch names, smart merge+cleanup, lifecycle hooks, macOS notifications.

Best for: Developers using many different agent CLIs who want a unified launcher.

Mux (Coder) — 1.4k stars

Mux | By Coder (remote dev environments)

Desktop + browser app for parallel agentic development. Custom agent loop with Plan/Exec mode. Supports Claude, GPT-5, Grok, Ollama, OpenRouter. VS Code extension. SSH-based remote execution.

Best for: Teams already using Coder for remote development environments.

類別 4:Session 管理器

這些工具專注於管理多個 CLI agent session——沒有任務分解、沒有相依圖,就是乾淨的多 agent 管理。

cmux——7.5k 星

cmux | AGPL-3.0

原生 macOS 終端應用程式(Swift/AppKit),使用 Ghostty 的 GPU 加速渲染。Agent 通知系統在面板上顯示藍色光圈。內建可腳本化的 WebKit 瀏覽器。具備 agent 感知功能的進階終端模擬器。

Claude Squad——6.4k 星

Claude Squad | Go | AGPL-3.0

最受歡迎的純 session 管理器。TUI 介面管理多個 Claude Code、Codex、Gemini 和 Aider 實例,在隔離的 tmux session 中搭配 git worktree。超級簡單:n 新 session、o 附加、s 推送。可透過 brew install claude-squad 安裝。

最適合: 想要多 agent 平行性、零編排開銷的開發者。

dmux——1.2k 星

dmux | MIT

支援最多 agent 的方案:Claude Code、Codex、OpenCode、Cline CLI、Gemini CLI、Qwen CLI、Amp CLI、Pi CLI、Cursor CLI、Copilot CLI、Crush CLI。多選 agent 啟動、AI 生成分支名稱、智慧 merge + 清理、lifecycle hook、macOS 通知。

最適合: 使用多種不同 agent CLI、想要統一啟動器的開發者。

Mux(Coder)——1.4k 星

Mux | 來自 Coder(遠端開發環境)

桌面 + 瀏覽器應用程式,用於平行 agentic 開發。自訂 agent 迴圈,Plan/Exec 模式。支援 Claude、GPT-5、Grok、Ollama、OpenRouter。VS Code 擴充套件。SSH 遠端執行。

最適合: 已經使用 Coder 遠端開發環境的團隊。


Category 5: Codex CLI & OpenAI Ecosystem

Symphony (OpenAI) — 13.1k stars

Symphony | Elixir | Engineering Preview

OpenAI's official answer to multi-agent orchestration. Monitors work boards (Linear), spawns autonomous agents, provides proof of work (CI status, complexity analysis, walkthrough videos), and lands PRs when approved. Spec-driven design (SPEC.md) with Elixir reference implementation.

Status: "Low-key engineering preview" — not production-ready but architecturally significant as OpenAI's vision for agentic coding.

Codex Orchestrator — 235 stars

Codex Orchestrator | MIT

Designed to work with Claude Code, not replace it. "Claude handles strategic thinking; Codex handles deep coding work." Delegates tasks to Codex agents via tmux sessions. Plugin pipeline: Ideation > Research > Synthesis > PRD > Implementation > Review > Testing.

Best for: Teams that want Claude Code for planning and Codex for execution.

類別 5:Codex CLI 與 OpenAI 生態系

Symphony(OpenAI)——13.1k 星

Symphony | Elixir | Engineering Preview

OpenAI 對多 agent 編排的官方回應。監控工作看板(Linear)、啟動自主 agent、提供工作證明(CI 狀態、複雜度分析、演練影片),在核准後合併 PR。Spec 驅動設計(SPEC.md),Elixir 參考實作。

狀態:「低調工程預覽」——尚未 production-ready,但作為 OpenAI 對 agentic coding 的願景具有架構意義。

Codex Orchestrator——235 星

Codex Orchestrator | MIT

設計上與 Claude Code 搭配使用,而非取代它。「Claude 處理策略思考;Codex 處理深度 coding 工作。」透過 tmux session 將任務委派給 Codex agent。Plugin 管線:構思 > 研究 > 綜合 > PRD > 實作 > 審查 > 測試。

最適合: 想用 Claude Code 做規劃、Codex 做執行的團隊。


Category 6: Desktop Orchestrators

Aperant — 13.3k stars

Aperant | AGPL-3.0 | Formerly "Auto-Claude"

Electron desktop app. Up to 12 concurrent agent terminals with git worktree isolation, self-validating QA loops, AI-powered conflict resolution. GitHub/GitLab/Linear integration. Memory layer retaining insights across sessions. Requires Claude Pro/Max subscription.

Best for: Developers who prefer GUI over TUI and want visual multi-agent management.

Dorothy — 211 stars

Dorothy

Desktop app with a unique "Super Agent" meta-agent that programmatically controls other agents via MCP tools. Automations poll GitHub/JIRA and spawn agents. Kanban task management with skill-based agent assignment. Scheduled tasks via cron. Telegram/Slack remote control.

Best for: Teams wanting GitHub/JIRA-driven agent automation with remote control.

Jean — 643 stars

Jean | Tauri (Rust)

Tauri desktop app managing Claude CLI, Codex CLI, and OpenCode across projects and worktrees. Integrated terminal, diff viewer, GitHub issue/PR investigation. Magic commands for code review, AI commit messages, PR generation. Built with React 19 + Rust.

Best for: Developers who want a lightweight, native desktop experience (Tauri is much lighter than Electron).

AI Maestro — 532 stars

AI Maestro | Dashboard

Features the Agent Messaging Protocol (AMP) for direct agent-to-agent communication with priority levels and cryptographic signatures. Peer mesh network for multi-machine support without a central server. Three-layer persistent intelligence (memory, code graph, auto-docs). Supports Claude Code, Aider, Cursor, Copilot.

Best for: Distributed teams running agents across multiple machines.

類別 6:桌面編排器

Aperant——13.3k 星

Aperant | AGPL-3.0 | 前身為「Auto-Claude」

Electron 桌面應用程式。最多 12 個並行 agent 終端搭配 git worktree 隔離、自驗證 QA 迴圈、AI 驅動的衝突解決。GitHub/GitLab/Linear 整合。跨 session 的記憶層。需要 Claude Pro/Max 訂閱。

最適合: 偏好 GUI 而非 TUI、想要視覺化多 agent 管理的開發者。

Dorothy——211 星

Dorothy

桌面應用程式,特色是「Super Agent」meta-agent,透過 MCP 工具程式化控制其他 agent。自動化機制輪詢 GitHub/JIRA 並啟動 agent。Kanban 任務管理,基於技能分配 agent。透過 cron 排程任務。Telegram/Slack 遠端控制。

最適合: 想要 GitHub/JIRA 驅動的 agent 自動化搭配遠端控制的團隊。

Jean——643 星

Jean | Tauri(Rust)

Tauri 桌面應用程式,管理 Claude CLI、Codex CLI 和 OpenCode 跨專案與 worktree。整合終端、diff 檢視器、GitHub issue/PR 調查。Magic command 用於 code review、AI commit 訊息、PR 生成。以 React 19 + Rust 建構。

最適合: 想要輕量原生桌面體驗的開發者(Tauri 比 Electron 輕量許多)。

AI Maestro——532 星

AI Maestro | Dashboard

特色是 Agent Messaging Protocol(AMP)——直接 agent 對 agent 通訊,有優先級和加密簽名。Peer mesh 網路支援多機器,不需中央伺服器。三層持久化智慧(記憶、程式碼圖、自動文件)。支援 Claude Code、Aider、Cursor、Copilot。

最適合: 在多台機器上運行 agent 的分散式團隊。


Category 7: Sandboxed Execution

TSK — 150 stars

TSK | Rust

The most security-focused tool in this space. Delegates tasks to agents in Docker containers with domain allowlist enforcement via Squid proxy. Auto-detects project toolchains and builds container images. SQLite-backed task queue with parallel workers. Supports Claude Code and Codex. Task chaining for sequential workflows.

Best for: Security-conscious teams that want agents sandboxed from the host system.

Agent Swarm (desplega-ai) — 267 stars

Agent Swarm

Lead/worker model. A lead agent receives tasks from Slack/GitHub/email/CLI, breaks them down, and delegates to workers in Docker containers. Compounding memory via embeddings. 9 agent templates. Persistent identity (SOUL.md, IDENTITY.md). Real-time monitoring dashboard.

Best for: Teams that want agents with persistent "personality" and multi-channel task intake.

類別 7:沙箱執行

TSK——150 星

TSK | Rust

這個領域中最注重安全的工具。在 Docker 容器中委派任務給 agent,透過 Squid proxy 強制執行網域白名單。自動偵測專案工具鏈並建構容器映像。SQLite 任務佇列搭配平行 worker。支援 Claude Code 和 Codex。任務串接用於循序工作流。

最適合: 注重安全、想讓 agent 與主機系統隔離的團隊。

Agent Swarm(desplega-ai)——267 星

Agent Swarm

Lead/Worker 模型。Lead agent 從 Slack/GitHub/email/CLI 接收任務,分解後委派給 Docker 容器中的 worker。透過 embeddings 的累積記憶。9 個 agent 模板。持久化身份(SOUL.md、IDENTITY.md)。即時監控儀表板。

最適合: 想要 agent 有持久化「個性」和多通道任務接收的團隊。


Category 8: Lightweight / Skill-Based

Swarms (am-will) — 144 stars

Swarms | Shell

Pure Shell-based orchestration skills for Claude Code and Codex. Two-phase: swarm-planner creates dependency-ordered task plans, parallel-task launches independent agents in waves. Minimal dependencies — just shell scripts.

Best for: Minimalists who want orchestration without installing anything.

Sudocode — 232 stars

Sudocode

Git-native agent memory system that lives in your repo. 4-tiered abstraction: Specs > Issues > Agents > Artifacts. Dual representation: human-readable Markdown + machine-optimized JSONL/SQLite. Topological ordering for issue execution. IDE integration for VS Code, Cursor, Windsurf, IntelliJ, Zed, Xcode.

Best for: Teams that want agent state to live in the repo, not a separate database.

Claude MPM — 85 stars

Claude MPM

47+ specialized agents with PM orchestration for automatic task routing. 44+ bundled skills. Session management with auto-pause at token thresholds. Extensive MCP integrations (Google Workspace, Notion, Confluence, Slack, Jira/Linear/GitHub).

Best for: Solo developers who want a project-manager-in-a-box with broad integrations.

類別 8:輕量 / Skill 型

Swarms(am-will)——144 星

Swarms | Shell

Shell 的 Claude Code 和 Codex 編排 skill。兩階段:swarm-planner 建立有相依順序的任務計畫,parallel-task 以波次啟動獨立 agent。最少相依——只有 shell 腳本。

最適合: 不想安裝任何東西就想要編排功能的極簡主義者。

Sudocode——232 星

Sudocode

住在你 repo 裡的 git 原生 agent 記憶系統。4 層抽象:Spec > Issue > Agent > Artifact。雙重表示:人類可讀的 Markdown + 機器最佳化的 JSONL/SQLite。拓撲排序決定 issue 執行順序。IDE 整合支援 VS Code、Cursor、Windsurf、IntelliJ、Zed、Xcode。

最適合: 想讓 agent 狀態住在 repo 裡而非獨立資料庫的團隊。

Claude MPM——85 星

Claude MPM

47+ 個專門 agent,PM 編排自動路由任務。44+ 個內建 skill。Session 管理在 token 門檻時自動暫停。廣泛的 MCP 整合(Google Workspace、Notion、Confluence、Slack、Jira/Linear/GitHub)。

最適合: 想要一站式專案經理、且有廣泛整合的單人開發者。


Category 9: Mobile & Remote Access

Happy Coder — 15.5k stars

Happy Coder | TypeScript

Mobile/web client for Claude Code and Codex with real-time voice, end-to-end encryption, and push notifications. Not an orchestrator but enables remote agent supervision from your phone. The happy agent command spawns and monitors remote sessions.

Best for: Developers who want to monitor and interact with agents while away from their desk.

類別 9:行動與遠端存取

Happy Coder——15.5k 星

Happy Coder | TypeScript

Claude Code 和 Codex 的行動/網頁客戶端,支援即時語音、端對端加密和推播通知。不是編排器,但能從手機遠端監督 agenthappy agent 指令啟動並監控遠端 session。

最適合: 離開桌面時仍想監控和互動 agent 的開發者。


Decision Matrix: Which Tool Should You Use?

Your SituationRecommended ToolWhy
I just want parallel Claude Code sessionsClaude SquadSimplest, zero config, brew install
I want parallel sessions with many different agentsdmuxWidest agent support (11+ CLIs)
I want full business orchestration with governancePaperclipOnly tool with board governance, budgets, org charts
I want git-native orchestration for a dev teamGas TownDeepest git integration, TOML workflows
I want clean plugin architecture I can extendAgent Orchestrator8 swappable slots, 3,288 tests
I want to supercharge Claude Code with pluginswshobson/agents112 agents, 146 skills, native integration
I want smart model routing to save costsoh-my-claudecode30-50% cost reduction via intelligent routing
I want maximum features and don't mind complexityRuflo60+ agents, Q-Learning, WASM, consensus
I want sandboxed agent executionTSKDocker isolation with network allowlists
I want a visual desktop appAperant12 concurrent terminals, Electron
I want lightweight native desktop (not Electron)JeanTauri/Rust, much lighter
I want agents on my phoneHappy CoderMobile client with push notifications
I want zero dependencies, just shell scriptsSwarmsPure shell, no install
I want the official OpenAI approachSymphonyOpenAI's spec-driven vision

決策矩陣:你該用哪個工具?

你的情況推薦工具原因
我只想要平行的 Claude Code sessionClaude Squad最簡單,零設定,brew install
我想用很多不同 agent 的平行 sessiondmux最廣 agent 支援(11+ 種 CLI)
我想要完整的商業編排與治理Paperclip唯一有董事會治理、預算、組織架構的工具
我想要 git 原生的開發團隊編排Gas Town最深的 git 整合,TOML workflow
我想要乾淨的 plugin 架構可以擴展Agent Orchestrator8 個可替換插槽,3,288 個測試
我想用 plugin 強化 Claude Codewshobson/agents112 個 agent、146 個 skill、原生整合
我想要智慧模型路由來省錢oh-my-claudecode透過智慧路由減少 30-50% 成本
我想要最多功能,不介意複雜度Ruflo60+ agent、Q-Learning、WASM、consensus
我想要沙箱化的 agent 執行TSKDocker 隔離搭配網路白名單
我想要視覺化桌面應用Aperant12 個並行終端,Electron
我想要輕量原生桌面(非 Electron)JeanTauri/Rust,輕量許多
我想在手機上操作 agentHappy Coder行動客戶端搭配推播通知
我想要零相依,只要 shell 腳本Swarms純 shell,不需安裝
我想要 OpenAI 官方方案SymphonyOpenAI 的 spec 驅動願景

The Maturity Warning

Every single project in this list is less than one year old. Most are less than three months old. The entire CLI-based agent orchestration category effectively did not exist before mid-2025.

What this means:

  • Expect breaking changes — pin versions, update intentionally
  • No security audits — none of these have undergone formal security review
  • Documentation is sparse — most projects have a README and little else
  • Bus factor is 1 — most are maintained by a single developer or small team
  • Stars ≠ maturity — 30k stars in two weeks means hype, not stability

Practical advice:

  1. Start with Claude Squad or dmux — they're simple, low-risk session managers
  2. Graduate to Gas Town or Agent Orchestrator when you need real task coordination
  3. Consider Paperclip only when you need business-level orchestration with governance
  4. Always keep your agents' work in git worktrees so you can easily discard bad outputs
  5. Set budget limits wherever the tool supports it — runaway agents are expensive

成熟度警告

這份清單中的每一個專案都不到一年。大多數不到三個月。整個 CLI 型 agent 編排類別在 2025 年中之前基本不存在。

這意味著:

  • 預期有破壞性變更 —— 鎖定版本,有意識地更新
  • 沒有安全審計 —— 這些專案沒有經過正式的安全審查
  • 文件稀疏 —— 大多數專案只有 README,其他很少
  • Bus factor 是 1 —— 大多數由單一開發者或小團隊維護
  • 星數 ≠ 成熟度 —— 兩週 30k 星意味著熱度,不是穩定性

實用建議:

  1. Claude Squaddmux 開始——簡單、低風險的 session 管理器
  2. 當你需要真正的任務協調時,升級到 Gas TownAgent Orchestrator
  3. 只在你需要有治理的商業層級編排時才考慮 Paperclip
  4. 永遠讓 agent 的工作在 git worktree 中,這樣你可以輕鬆丟棄不好的產出
  5. 在工具支援的地方設定預算限制——失控的 agent 很昂貴