Agent Runtime β The Execution Layer Nobody Is Building
What the runtime includes
The agent runtime is the union of six components:
- Compute substrate β container, microVM, or full VM
- Filesystem with snapshot and rollback semantics
- Tools β shell, code interpreter, browser, MCP servers β exposed as callable interfaces
- Network boundary defining reachability
- State model governing persistence across turns, episodes, and users
- Lifecycle controller handling start, suspend, snapshot, resume, teardown
This maps to the H (harness) and S (state) components in the RL environments taxonomy. In production, the runtime determines latency, isolation, and security posture.
Why it matters
The agent is not the model. The agent is the harness plus the model, running inside the runtime. Sculley et al. showed in 2015 that ML model code is the smallest box in a real ML system. The same pattern holds for agents: the model call is the small box, the runtime is the large one driving spend and shaping architecture.
Teams that pick a runtime as a quick integration decision at prototype time accumulate runtime debt β the agentβs behavior entangles with that runtimeβs quirks. A year later, switching runtimes becomes a six-month migration because nobody can predict which behaviors will break.
The experimentation/production split compounds this. Training rollouts want 200ms cold starts, thirty-second lifetimes, and frozen-world snapshots. Production sessions want two-second starts, forty-minute lifetimes, live internet access, and persistence across async gaps (CI waits, review cycles, multi-day workflows). Optimizing the same runtime for both produces something too slow for training and too brittle for production.
Source: Hidden Technical Debt of AI Systems: Agent Runtime β Lee Hanchung, April 2026
Related Notes
- Agent Harnesses β the orchestration layer the runtime hosts; harness handles tool execution and context, runtime handles compute and isolation
- AI Infrastructure Investment Thesis - Mid-Post Training Layer β RL environments and eval stacks as the upstream investment thesis; agent runtimes are the serving-side complement
- Claws - Persistent Looping Agents as App Replacement β persistent agent loops depend on runtime lifecycle management for long-running sessions
- Hidden Technical Debt of AI Systems - Agent Runtime β source clipping