neutral

Phase 1: Durable Single-Agent Loop (MVP)

Status: Completed (2026-02-09)

Goal: A minimal agent that survives crashes, retries failures, and runs on Temporal.

Scope:

Tasks and subtasks:

Repository and runtime discovery
- Identify language, build system, and Temporal SDK version
- Locate existing app skeleton and config
- Confirm how workers are started and deployed
Core workflow
- Define AgentRunInput and AgentConfig
- Implement deterministic state machine loop (decide -> act -> observe -> repeat)
- Implement stop conditions (max steps, goal achieved, timeouts, explicit halt)
- Isolate non-determinism in activities
Activities
- LLMDecideActivity: accept current state, return structured decision
- ToolExecuteActivity: strict schema validation, per-tool execution
- ObserveActivity: update state from tool results
Tool registry
- Define ToolRegistry, ToolDefinition, versioning, content hash
- Implement registry storage and lookup
- Enforce immutability for published versions
Observability
- Workflow-level attributes (run ID, step count, state hash)
- Activity-level metrics (latency, tokens, retries)
- OTEL spans per decision cycle and tool call
Example tools and sample agent
- Implement http_get, calculator, key_value_store
- Implement sample “research assistant” agent
- Demonstrate retries, timeouts, clean shutdown
Validation and test harness
- Unit tests for decision loop and schema validation
- Integration test that survives worker restart
- Trace verification in dev environment

Deliverables:

Dev environment checks:

Run instructions:

Dependencies:

Files to add/change (TBD after repo discovery) Initial scaffolding created: