Execution Flow

Running a hank involves a complex sequence of events: state machines transition, processes spawn, files are watched, and checkpoints are created. This guide explains the entire execution flow, helping you debug problems, design better hanks, and know what to expect at each stage.

The Big Picture

Execution flows through four phases: Server Startup, Execution Planning, Codon Execution, and Completion. Each codon goes through its own lifecycle within the broader run.

Execution Flow Overview

Server Startup

When you run hankweave run, the runtime initializes through a sequence of steps. Each one must succeed before the first codon can execute.

Initialization Sequence

Lock File Check — Prevents multiple instances from running on the same project.
Checkpoint System — Initializes git-based checkpointing in .hankweave/.
State Manager — Loads or creates state.json for run tracking.
Event Journal — Opens event storage for WebSocket replay.
Sentinel Manager — Prepares the parallel observer system.
WebSocket Server — Starts listening for client connections.

Startup Logic Flow

Crash Recovery

Sometimes things go wrong. The process gets killed, the machine reboots, or your cat walks across the keyboard. When Hankweave detects a crashed run (a stale lock file or dead process), it marks the run as crashed and offers recovery options:

Rollback to last success — Restore files to the last good checkpoint.
Continue from crash point — Pick up where things left off. If a codon was mid-execution, it may need manual intervention.

How Crash Detection Works: The lock file contains a process ID (PID) and a heartbeat timestamp. If the PID doesn't exist or the heartbeat is stale (>2 minutes), the run is considered crashed.

Execution Planning

Before any codon runs, Hankweave builds an execution plan—a flattened sequence of every codon that will execute, including any loop expansions. Think of it as a flight plan: the runtime needs to know where it's going before it takes off.

Plan Building

The execution planner reads your hank.json and produces a concrete sequence:

Loop Structure

Regular codons map directly to the plan. Loops are expanded with iteration suffixes (#0, #1, #2) to give each execution a unique ID for logging, checkpointing, and rollbacks.

Lazy Loop Expansion

Here's a subtlety: loops don't expand all at once. Hankweave expands one iteration at a time.

The first iteration (#0) is added to the plan.
After it completes, the runtime checks the loop's termination conditions.
If the loop hasn't terminated, the next iteration (#1) is added to the plan.
This repeats until the termination condition is met.

This is important for contextExceeded termination, as it allows the loop to stop mid-stream if an agent hits its context limit. It also keeps the plan manageable for loops that might run for dozens of iterations.

Codon Execution Lifecycle

Each codon progresses through a state machine with well-defined transitions. Understanding these states is key to debugging when a run gets stuck or fails unexpectedly.

State Machine

Codon State Machine

State Descriptions

State	What's Happening	Duration
preparing	Running rig setup (copying files, running commands).	Seconds
starting	Spawning the agent process (e.g., Claude Code).	Seconds
initializing	Waiting for the agent process to start and send its session ID.	Seconds
running	The agent is actively working—this is where the magic happens.	Minutes to hours
completing_sentinels	The agent has finished; waiting for background sentinel tasks to complete.	Seconds
completed	The codon succeeded, and a checkpoint has been created.	Terminal
failed	An error occurred; see the failure reason for details.	Terminal
skipped	The user chose to skip this codon.	Terminal

Key Insight: The running state is where your agent does its work. Everything else is setup and teardown. If a codon gets stuck in initializing, the agent process likely failed to start—check its logs.

What Happens in Each State

Preparing

This is the rig setup phase. If your codon has rigSetup operations, they run here:

Text

{
  "rigSetup": [
    {
      "type": "command",
      "command": { "run": "mkdir -p src/schema", "workingDirectory": "project" }
    },
    {
      "type": "copy",
      "copy": { "from": "./templates", "to": "src" }
    }
  ]
}

Commands execute in order. If any fails, the codon transitions to failed with a clear error message.

Starting

Hankweave spawns the agent process. For Claude Code, this means running the claude command with the appropriate flags:

--model — The model to use
--resume — The session ID, if continuing a previous session
--append-system-prompt — Any additions to the system prompt
Environment variables from the env field

Initializing

The agent process is running but has not yet reported its session ID. Hankweave watches the agent's log file for this initialization message, which usually appears within a few seconds.

Running

The agent is executing your prompt. During this state:

Events stream via WebSocket (assistant actions, tool results, file updates).
Sentinels observe the event stream in parallel.
File changes are tracked and reported.
Costs accumulate as tokens are consumed.

Completing Sentinels

After the agent's main work is done, Hankweave waits for any pending sentinel tasks to complete. This includes:

Draining sentinel event queues
Finishing final LLM calls
Writing any output files

Terminal States

Once a codon reaches completed, failed, or skipped, it is finished and will not transition further.

completed: A checkpoint is created, files are committed, and costs are finalized.
failed: Error information is stored, and partial costs are tracked.
skipped: The user chose to skip; a skip checkpoint is created to record this.

Checkpointing

Every successful codon creates a git checkpoint—a commit capturing the exact file state. This is your time machine.

Checkpoint Types

Type	When Created	Purpose
`rig-setup`	After rig commands complete	Rollback to pre-agent state
`completed`	After successful codon	Primary rollback target
`error`	On codon failure	Capture error state for debugging
`skipped`	After user skip	Track the skip in the run history

Checkpoint Flow

Checkpoint Creation Sequence

What Gets Checkpointed

Files matching your checkpointedFiles patterns are committed. See Checkpointed Files in Codons for the complete reference on glob patterns and tracking behavior.

Event System

During execution, Hankweave emits events over a WebSocket connection. These events power the UI, enable custom integrations, and provide the raw data for sentinels.

Event Categories

Events fall into four categories:

Category	Events	Journaled	Broadcast
Server State	`codon.started`, `codon.completed`, `error`, `state.transition`	Yes	Yes
Agentic Backbone	`assistant.action`, `tool.result`, `file.updated`	Yes	Yes
Sentinel	`sentinel.loaded`, `sentinel.output`, `sentinel.triggered`	Yes	Yes
Connection	`server.ready`, `pong`, `history.batch`	No	To sender only

Server State, Agentic Backbone, and Sentinel events are journaled (saved to disk) and broadcast to all connected clients. Connection events are ephemeral and exist only for the current session.

Event Flow

Event Types

Key Events

codon.started — A codon began execution.

Text

{
  "type": "codon.started",
  "data": {
    "codonId": "generate-schema",
    "codonName": "Generate Schema",
    "sessionId": "abc-123",
    "startTime": "2024-01-15T10:30:00Z"
  }
}

assistant.action — The agent performed an action.

Text

{
  "type": "assistant.action",
  "data": {
    "codonId": "generate-schema",
    "action": "tool_use",
    "toolName": "Write",
    "content": "Writing schema file..."
  }
}

codon.completed — A codon finished.

Text

{
  "type": "codon.completed",
  "data": {
    "codonId": "generate-schema",
    "success": true,
    "cost": 0.0234,
    "duration": 45000
  }
}

Run Lifecycle

A run is the top-level container for a sequence of codon executions. A new run is created each time you start hankweave run.

Run Types

Type	Description	When Used
`fresh`	A new run from scratch.	The first execution, or an explicit fresh start.
`continuation`	A run that resumes from a checkpoint.	After a rollback, or when continuing previous work.

Run States

Codon 8 States

A run is running while its codons execute. It transitions to completed if all codons finish successfully, failed if a codon fails, or crashed if a crash is detected during recovery.

Continuation Runs

When you roll back to a previous checkpoint and restart, Hankweave creates a continuation run. This lets you fix a problem and try again without losing your history.

Branches Per Run

In this example, Run 2 starts from the checkpoint created after codon-2 in Run 1. It re-executes codon-3 and then proceeds to codon-4.

Autostart and Manual Mode

Hankweave can run codons automatically in sequence or wait for your command.

Autostart Mode (Default)

With autostart: true, Hankweave automatically starts the next codon as soon as the previous one completes.

Continuation Modes

Manual Mode

With autostart: false, Hankweave idles after each codon and waits for an explicit codon.next command from a client. This is useful for debugging, step-through execution, or inspecting results between codons.

Cost Tracking

Hankweave tracks LLM costs throughout execution, so you always know what you're spending.

Cost Accumulation

Costs are tracked at multiple levels:

Per codon — currentCost while running, finalCost when complete.
Per run — The sum of all codon costs in the run.
All time — The sum of costs across all runs.

Cost tracking flow

Cost Events

Cost updates stream via WebSocket as tokens are used.

Text

{
  "type": "token.usage",
  "data": {
    "codonId": "generate-schema",
    "inputTokens": 1500,
    "outputTokens": 800,
    "totalCost": 0.0234
  }
}

Common Issues

Codon Stuck in "initializing"

The agent process started but never sent a session ID, which usually means it crashed during startup.

Causes:

The agent crashed immediately after launch.
Authentication issues (e.g., an invalid or missing API key).
Network problems preventing the agent from initializing.

Debug:

Check the agent's log file (.hankweave/runs/<run-id>/<codon>-claude.log).
Verify that API keys are set correctly.
Try running the agent's command manually with the same parameters.

Lock File Errors

You see a "Server already running" error when you know it isn't.

Cause: A previous run crashed without cleaning up the lock file.

Fix: The lock file is located at .hankweave/server.lock. Hankweave should detect stale locks automatically (via its heartbeat mechanism), but if it fails to, you can remove this file manually.

Checkpoint Failures

The codon completes, but the git checkpoint fails.

Causes:

The git repository in .hankweave/ is missing or corrupt.
Uncommitted changes in tracked files are causing merge conflicts.
Disk space issues.

Debug:

Check that .hankweave/ exists and contains a valid .git directory.
Run git status in the execution directory to check for conflicts.
Check your available disk space.

Codons — The atomic unit of work
Hanks — How codons sequence into programs
Checkpoints — Git-based time travel
State File Reference — Complete state.json documentation
WebSocket Protocol — Event schemas and API specification

Next Steps

Now that you understand the execution flow:

Getting Started — Run your first hank.
Debugging — Learn what to do when things go wrong.
WebSocket Protocol — Build custom integrations on the event stream.

Sentinels State Machine

Execution Flow

The Big Picture

Server Startup

Initialization Sequence

Crash Recovery

Execution Planning

Plan Building

Lazy Loop Expansion

Codon Execution Lifecycle

State Machine

State Descriptions

What Happens in Each State

Preparing

Starting

Initializing

Running

Completing Sentinels

Terminal States

Checkpointing

Checkpoint Types

Checkpoint Flow

What Gets Checkpointed

Event System

Event Categories

Event Flow

Key Events

Run Lifecycle

Run Types

Run States

Continuation Runs

Autostart and Manual Mode

Autostart Mode (Default)

Manual Mode

Cost Tracking

Cost Accumulation

Cost Events

Common Issues

Codon Stuck in "initializing"

Lock File Errors

Checkpoint Failures

Related Pages

Next Steps