Roadmap
Phased Roadmap¶
Phase 1 — Agent in a Box ✅ COMPLETE¶
Goal: Run moby in a Docker container with persistent memory, interact via CLI.
agents/moby/soul.yaml— Moby's personality, model, tools, behaviorDockerfile— Agent base image with cagentdocker-compose.yml— Compose manifestmobyclaw— CLI script (up, down, logs, status, run, chat)- Persistent memory in
~/.mobyclaw/(MEMORY.md, daily logs)
Phase 2 — Gateway + Messaging ✅ COMPLETE¶
Goal: Chat with moby through Telegram. Heartbeat, scheduling, and reminders working.
gateway/— Gateway container (message routing, sessions, scheduler)- Telegram adapter with streaming responses and tool status indicators
- Heartbeat system (periodic agent wake-ups during active hours)
- Scheduler with reminders, recurring tasks, and prompt-based schedules
- Single session architecture with FIFO queue and disk persistence
- Auto-backup to GitHub on a cron schedule
Phase 2.5 — Session & Queue UX ✅ COMPLETE¶
Goal: Polished session management inspired by OpenClaw patterns.
- Collect queue mode (coalesces rapid messages into one turn)
- Typing indicators and queue feedback in Telegram
- /stop, /new, /reset, /clear, /status commands
- Daily session reset at 4 AM, debounce, queue cap (20)
- SSE
queuedevent for programmatic clients
Phase 2.7 — Tool Gateway ✅ COMPLETE¶
Goal: Agent can access external web services and automate browser interactions.
- tool-gateway container: Stateless MCP Streamable HTTP server + admin API
- mcp-bridge: Node.js stdio↔HTTP bridge connecting cagent to tool-gateway
- 19 MCP tools total:
browser_fetch— clean text extraction from any URLbrowser_search— web search via DuckDuckGoweather_get— current weather + forecast (Open-Meteo, free)- 16 browser automation tools (Playwright + Chromium):
browser_navigate,browser_snapshot,browser_screenshot,browser_click,browser_type,browser_fill_form,browser_select_option,browser_hover,browser_press_key,browser_scroll,browser_back,browser_forward,browser_wait,browser_tabs,browser_close,browser_eval
- Accessibility snapshots with aria-ref element targeting
- Persistent browser context with 10min idle auto-close
- Clawnching $MOBYCLAW on https://clawn.ch/
Phase 3 — Workspace + Integrations¶
Goal: More messaging channels, integrations, and intelligence.
- More messaging adapters (WhatsApp, Discord, Slack)
- Notion integration via MCP (OAuth 2.0 + PKCE, chat-mediated auth)
- Vector memory search (semantic recall over memory files)
- Webhook ingress (GitHub events, etc.)
- Web UI for management and chat
Phase 4 — Production Hardening¶
Goal: Ready for real 24/7 workloads.
- Security hardening (seccomp, read-only root, network policy)
- Monitoring and observability (logs, metrics, health checks)
- Session compaction (summarize old context)
- Auto memory flush before compaction
- Plugin/skill system