AGENTS.md

Autonomous coding agent on Cloudflare Workers. Self-hostable, multi-tenant, sandboxed.

Focus

Multi-tenant architecture: per-user state via UserControl DO, global state via SharedIndex DO.
CodingAgent built on @cloudflare/think for message persistence, chat loop, and durable fibers.
Keep the implementation publishable and self-hostable.
Prioritize correctness and test coverage over feature expansion.

Commands

npm install
npm run typecheck
npm run lint
npm test
npm run dev
npm run deploy

Rules

Run npm test and npm run typecheck regularly while changing behavior.
Prefer small, typed modules over large all-in-one files.
Keep Durable Object state persistent and observable.
Preserve Cloudflare Access in production; local test bypass is allowed only in test/dev config.
Do not introduce container- or VM-specific assumptions.
All Think imports route through src/think-adapter.ts — never import @cloudflare/think directly elsewhere.

Linting

Two layers:

In-isolate — the typecheck tool with extraStrict: true. Catches unused locals/parameters, missing returns, and switch fall-through via TypeScript's own diagnostics. No extra bundle cost, sub-second feedback inside a session.
External (CI) — Biome runs in .github/workflows/dodo-verify.yml for every PR and dispatched verify run. Catches the wider set: unused imports, suspicious patterns, double-equals, etc. Configured in biome.jsonc (correctness + suspicious rules only — formatting and style are intentionally off).

Why two layers: Biome's wasm bundle is ~8 MB gzipped, which would push Dodo past the Workers 10 MB compressed script limit if loaded in-isolate. The in-isolate check is the agent's fast feedback loop; the CI check is the thorough gate before merge.

Run npm run lint locally before pushing if you want the same signal CI will give you. npm run lint:fix applies safe auto-fixes.

Deploying

Deploys are manual. Run npm run deploy:safe locally after merging to main — it builds, deploys, then runs scripts/post-deploy-smoke.sh against the deployed Worker (asserting /health, /version.json matches HEAD, codemode execute round-trips through globalOutbound, and a real git clone lands in the workspace). Use plain npm run deploy only when you intentionally want to skip the probe (e.g. you're about to run it manually with extra args).

Workers Builds CI is disabled because the "experimental" compat flag (required by @cloudflare/think and the Agents SDK subAgent() facet API) blocks non-local deploys by design — see issue #46. When Think graduates out of experimental, this can be re-enabled.

Git Discipline — Pick the Workflow That Matches Your Runtime

Never commit directly to main. All changes go through feature branches. The exact mechanics differ depending on where you're running.

If you're a Dodo session (sandboxed clone in a Durable Object)

This is the case when the agent reading this file is running inside Dodo itself — the workspace is an ephemeral clone in a session DO, you have git tools (not a shell), and there is no ~/dev/dodo parent checkout to share with.

Do not use git worktree. It has no meaning here — there's nothing to share with, no concurrent local agents to collide with, and no .. parent dir. The session's clone is the workspace.

Git is split across two surfaces:

Top-level tools (call directly): git_status, git_add, git_commit, git_diff. The hot path.
Inside codemode (call as git.<name>): git_clone_known, git_clone, git_push, git_push_checked, git_pull, git_branch, git_checkout, git_log, git_verify_remote_branch, pr_create. Wrap them in a codemode({ code: ... }) JS block.

Workflow:

After cloning, create a branch directly inside codemode: await git.git_branch({ name: "feat/x" }) then await git.git_checkout({ branch: "feat/x" }). Branch off main.
Make changes. Use top-level git_status / git_add / git_commit for the inner loop.
Push with git.git_push_checked from inside codemode — pass an explicit branch ref, never push to main.
Open the draft PR with git.pr_create before replying to the user. It auto-detects GitHub vs GitLab, auto-fills title and body from your latest commit, and returns the PR/MR URL. Quote that URL in your reply. If git.pr_create fails (e.g. missing per-user token), fall back to a compare URL and tell the user what's missing — but don't just leave them with a pushed branch and no PR.
Branch naming: fix/sse-serialization, feat/per-user-auth, docs/update-readme, chore/<scope>.

If you're running locally via OpenCode CLI in `~/dev/dodo`

This is the case when the agent reading this file is running on a developer laptop, has shell access, and shares ~/dev/dodo with potentially-concurrent agents. Use the worktree workflow:

Create a worktree branch before making any changes:

git worktree add ../dodo-<short-description> -b <branch-name>

Work in the worktree directory, not in the main checkout.
Commit and push the branch, then open a PR on GitHub.
Do not merge your own PRs — let the repo owner review and merge.

Clean up the worktree after the PR is merged:

git worktree remove ../dodo-<short-description>
git branch -d <branch-name>

Why the split

Without worktrees in the local case, concurrent agents create divergent histories on main that require merge commits and conflict resolution. This has already caused confusing commit graphs and near-loss of work.

In the Dodo-session case, worktrees solve a problem that doesn't exist — there's no shared checkout. Trying to use them wastes turns and confuses the agent (it tried, the tool didn't exist, then it improvised badly).

File Map

src/index.ts — Worker router (Hono), all HTTP routes, auth middleware, session fork, admin routes
src/coding-agent.ts — per-session agent DO (extends Think, chat via Think.chat(), fibers, workspace, git, cron, prompts, snapshots, SSE). Contains the system prompt.
src/think-adapter.ts — Think integration boundary: re-exports, types (DodoConfig, MessageMetadata, SnapshotV2), adapter functions
src/agentic.ts — LLM provider construction (buildProvider), tool composition (buildToolsForThink), git tools
src/user-control.ts — per-user DO (config, sessions, memory, tasks, skills, key envelope, encrypted secrets, fork snapshots)
src/skill-registry.ts — Claude/OpenCode-compatible SKILL.md loader (parser, manifest renderer, workspace scanner, R2 asset helpers)
src/builtin-skills.ts — built-in SKILL.md content shipped with Dodo
src/shared-index.ts — global singleton DO (user allowlist, host allowlist, models cache, session shares/permissions)
src/executor.ts — DynamicWorkerExecutor wrapper for sandboxed code execution (direct API route)
src/git.ts — git helpers via @cloudflare/shell, multi-host token injection (GitHub + GitLab)
src/mcp.ts — MCP server exposing all Dodo capabilities as tools
src/crypto.ts — hybrid envelope encryption (PBKDF2 + HKDF + AES-GCM) for per-user secrets
src/auth.ts — Cloudflare Access JWT verification, user allowlist check, admin guard
src/notify.ts — push notifications via ntfy.sh (per-user topic from encrypted secrets)
src/outbound.ts — AllowlistOutbound WorkerEntrypoint for gated sandbox fetch
src/presence.ts — WebSocket presence tracking
src/sql-helpers.ts — lightweight SQLite query helpers
src/health-check.ts — health endpoint handler
src/logger.ts — structured logging helpers
src/mcp-catalog.ts — curated catalog of recommended MCP servers
src/mcp-codemode.ts — code-mode MCP endpoint (2 tools, minimal context)
src/mcp-gatekeeper.ts — MCP server auth and rate limiting
src/notify.ts — push notifications via ntfy.sh
src/onboarding.ts — guided passkey and secrets setup
src/rate-limit.ts — per-user rate limiting
src/repos.ts — known repository registry for orchestration
src/rpc-api.ts — JSON-RPC API surface
src/rpc-transport.ts — JSON-RPC transport layer
src/share.ts — session sharing (tokens, permissions, cookies)
src/types.ts — shared TypeScript types
test/dodo.test.ts — integration tests via vitest-pool-workers
public/index.html — three-panel web UI (mobile-responsive)
public/docs.html — architecture documentation page
public/howto.html — task-oriented how-to guides

Architecture

Worker (Hono router + CF Access auth)
+-- SharedIndex DO (global singleton)
|   +-- users, host_allowlist, models_cache, session_shares, session_permissions
+-- UserControl DO (one per user, idFromName(email))
|   +-- user_config, sessions, memory_entries, tasks, key_envelope, encrypted_secrets, fork_snapshots
+-- CodingAgent DO (one per session, extends Think<Env, DodoConfig>)
|   +-- Think tables: assistant_sessions, assistant_messages, _think_config, cf_agents_fibers
|   +-- Dodo tables: metadata, message_metadata, prompts, cron_jobs
|   +-- Workspace (@cloudflare/shell, SQLite + optional R2 spill)
|   +-- createExecuteTool (codemode with workspace + git providers, gated outbound)
|   +-- One Think session per DO — single-session invariant
|   +-- Durable fibers — async prompts survive DO eviction
+-- AllowlistOutbound (WorkerEntrypoint, self-referencing service binding)

Key Invariants

One Think session per Dodo DO — never more
All Think imports through src/think-adapter.ts
No cf_agent_chat_* WebSocket messages trigger Think chat
Fiber methods use stashFiber() checkpoints — never assume resume-mid-execution
Snapshot import handles both v1 and v2
Git auth flows through Dodo's resolveRemoteToken()
SSE/WS event protocol unchanged from client perspective

System Prompt Design

The system prompt lives in src/coding-agent.ts as const SYSTEM_PROMPT. It's a static string — not dynamically assembled from config or user preferences. The CodingAgent class returns it via getSystemPrompt().

Design principles

Teach the tools, not the model. The prompt documents what tools exist and when to use each one. It doesn't teach the model how to code — that's what the model already knows. The prompt bridges the gap between the model's general capabilities and Dodo's specific tool surface.

Match reality. Every tool mentioned in the prompt must actually exist. Every constraint mentioned (30s timeout, 10-step limit, sandboxed network) must be accurate. The prompt is a contract with the model.

Concise over exhaustive. A Sonnet-class model doesn't need 1500 lines of instruction. Cover identity, tool surface, key behaviors, safety rules, and limits. Trust the model for everything else.

No sycophancy instructions. The model should be direct and technically accurate. No "Great question!" or "I'd be happy to help." Focus on the work.

Sections

Section	Purpose
Identity	"You are Dodo" — establishes the agent's name and platform
Tone and style	Concise, markdown, no emojis, prefer edits over new files
Doing tasks	Todo discipline: explicit lists of task shapes that always need todos (cloned-repo work, multi-file edits, review/audit/investigate tasks) vs. shapes that can skip them; post-compaction `todo_list` re-grounding hint; when to delegate with `task`
Workspace tools	Documents read_file, write_file, search_files, replace_in_file
Code execution	Documents codemode: sandboxed JS, fetch with auto-auth, 30s timeout
Git	Documents all git tools, auto-auth for GitHub/GitLab
Git safety	Stage specific files, clear commit messages, no force-push
Working with errors	State what failed, fix it, move on
Limits	10-step cap, ephemeral workspace, no shell

Changing the prompt

When modifying the system prompt:

Keep it factual. If you add a section about a tool, verify the tool exists in buildToolsForThink().
Test with real prompts. The prompt shapes every interaction — small wording changes can have outsized effects on behavior.
Don't duplicate tool descriptions. The AI SDK sends tool schemas automatically. The prompt should explain when and why to use tools, not re-document their parameters.
The max steps limit (10) is set in getMaxSteps() on the CodingAgent class. If you change it, update the prompt too.

Tool surface

These tools are available to the agent at runtime (built in src/agentic.ts):

Workspace tools (from @cloudflare/think/tools/workspace):

read_file — read file contents
write_file — create or overwrite a file
search_files — glob + content search
replace_in_file — find-and-replace within a file

Git tools (built in buildGitTools()):

Top-level (hot path): git_status, git_add, git_commit, git_diff
Inside codemode only (call as git.<name>): git_clone_known, git_clone, git_push, git_push_checked, git_pull, git_branch, git_checkout, git_log, git_verify_remote_branch, pr_create

The lower-frequency git tools are reachable via codemode's git provider namespace rather than as individual top-level tools. This saves roughly 1k tokens of tool-schema budget per turn without losing any capability — see buildTools() in src/agentic.ts for the split.

Code execution (from @cloudflare/think/tools/execute):

codemode — sandboxed JS execution with workspace filesystem and git access, gated outbound fetch

Typecheck (from src/typecheck.ts):

typecheck — runs tsc --noEmit against the workspace inside the CodingAgent DO. Bundles typescript and every lib.*.d.ts into the Worker so the check happens without a subprocess. Honours user tsconfig.json; refuses oversized projects (> 50 .ts/.tsx files or > 5 MB) with a structured skipped payload. Pass extraStrict: true to layer noUnusedLocals + noUnusedParameters + noImplicitReturns + noFallthroughCasesInSwitch on top of the user's tsconfig — a cheap stand-in for a real linter at zero extra bundle cost. Lib map (src/typecheck-libs.generated.ts) is checked in and regenerated by scripts/generate-typecheck-libs.mjs during npm run build. Manual validation: npm run test:typecheck-smoke.

Skill loader (src/skill-registry.ts):

skill — load a SKILL.md body on demand. The system prompt's <available_skills> block lists name + description per skill; this tool returns the full body when the model picks one. Two-stage progressive disclosure mirrors Claude Code / OpenCode.

The hot-path tools (workspace primitives, the four top-level git tools, codemode, the subagent tools, and skill) are top-level. The rest of the git surface is exposed only through codemode's provider namespaces.

Skills

Dodo supports SKILL.md files compatible with both Claude Code and OpenCode. Three sources merged into one deduplicated list (precedence: personal > workspace > builtin):

Personal — per-user, stored in UserControl DO SQLite. Created via the skill_write MCP tool or POST /api/skills. Bundled assets live in R2 under skills/{userId}/{skillName}/....
Workspace — scanned from the cloned repo's .dodo/skills/, .claude/skills/, .agents/skills/, .opencode/skill/, .opencode/skills/ directories. Read-only — promote to personal to edit.
Builtin — shipped with Dodo via src/builtin-skills.ts.

Loading model

Two-stage progressive disclosure (matches Claude Code / OpenCode):

Session start: getSystemPrompt() injects <available_skills> with name + description per enabled skill (~150 tokens each, capped at 4 KB total).
On demand: the skill tool returns the full SKILL.md body and a sampled list of bundled file paths. Bundled files are NOT auto-loaded — the model uses read to fetch.

MCP tools

skill_list — list personal skills
skill_read — get full body of a personal skill
skill_write — create/update a personal skill
skill_enable — toggle enabled flag
skill_delete — remove a personal skill
skill_import_url — fetch a SKILL.md from a URL and store it as personal

Workspace and built-in skills are visible from inside the chat (via the skill tool) but cannot be modified through the MCP CRUD surface. To edit a workspace skill, copy its body into a personal skill via skill_write.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AGENTS.md

Focus

Commands

Rules

Linting

Deploying

Git Discipline — Pick the Workflow That Matches Your Runtime

If you're a Dodo session (sandboxed clone in a Durable Object)

If you're running locally via OpenCode CLI in `~/dev/dodo`

Why the split

File Map

Architecture

Key Invariants

System Prompt Design

Design principles

Sections

Changing the prompt

Tool surface

Skills

Loading model

MCP tools

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

AGENTS.md

Focus

Commands

Rules

Linting

Deploying

Git Discipline — Pick the Workflow That Matches Your Runtime

If you're a Dodo session (sandboxed clone in a Durable Object)

If you're running locally via OpenCode CLI in ~/dev/dodo

Why the split

File Map

Architecture

Key Invariants

System Prompt Design

Design principles

Sections

Changing the prompt

Tool surface

Skills

Loading model

MCP tools

If you're running locally via OpenCode CLI in `~/dev/dodo`