Context Continuity

Holon agents are meant to continue work across many turns, wakeups, tasks, and model calls. Context continuity is the product contract that lets an agent keep the current objective, important results, and provenance visible without replaying the entire transcript.

Holon does this by treating context as a runtime projection, not as a chat log. The durable record remains available for audit and recovery, while each model turn receives a bounded selection of the information that matters now.

What the Model Sees

A Holon prompt is assembled from several sections with different jobs:

SectionPurpose
Current input and continuation anchorThe event that woke the agent and the trusted operator intent it continues.
Current work itemThe active objective, durable plan, todo list, and any waiting state.
Working memoryA compact snapshot of current state and follow-ups derived from runtime records.
Relevant episode memoryArchived completed work selected by relevance and budget.
Recent runtime evidenceRecent messages, result briefs, task results, tool executions, and wakeups when they are directly useful.
Execution environmentThe current workspace, runtime capabilities, and scoped guidance.

These sections are not separate sources of truth competing with each other. They are projections of runtime evidence. The durable ledger remains the audit trail; the prompt is the budgeted view used for the next model decision.

Turns Are the Causal Unit

Holon uses runtime turns as the main way to preserve continuity. A turn is one activation of an agent: an operator message, a task result, an external wake, a scheduler tick, or a provider recovery attempt.

The important part is not only the trigger. A turn also links what happened:

This keeps related facts together. For example, a CI wake can be shown as a continuation of the original "fix this issue" request instead of replacing that request as the newest intent. A completion report can stay linked to the work item it completed instead of becoming an unrelated summary.

Projection Is Not Transcript Truncation

Simple truncation drops the oldest text once a context window fills. That is not enough for long-lived agents: old text may contain the operator's original intent, a completed decision, a task-result fact, or the reason an agent is waiting.

Holon uses projection instead:

  1. Pin what must remain authoritative, such as trusted operator intent, active work item state, waits, and lifecycle transitions.
  2. Select useful recent turns by continuation chain, retention priority, and budget.
  3. Fold low-value runtime noise, such as duplicate wakeups, retries, fallback attempts, no-op scheduler ticks, or repeated pending polls.
  4. Render older completed work as structured episodes with source refs and authority boundaries.

The result is a shorter prompt that still preserves why the agent is acting, what it has already done, and which evidence supports the current state. This turn-based context projection is Holon's intended direction for preserving continuity without rendering an unbounded transcript.

Compaction Preserves Provenance

Compaction is a way to keep context bounded; it is not permission to rewrite history.

When older turn ranges age out of the hot prompt budget, Holon can archive them as structured episodes. An episode records the covered turn range, source turn IDs, source references, decisions, results, verification, unresolved items, operator intents, and model inferences.

Model-generated summaries may help describe an episode, but they are evidence, not authority. They do not change the trust level of an input, overwrite operator intent, decide work item state, or determine what may be discarded. The source refs remain available for audit and recovery.

Work Items, Tasks, Briefs, and Final Answers

Continuity depends on keeping runtime objects distinct:

Because these objects are linked through turns, the agent can answer questions such as "what did we do for that issue?" or "why are you waiting?" without searching an unstructured transcript.

What Is the User-Facing Contract

Users should be able to rely on these behaviors:

The exact internal schemas, token budgets, and retention scores are implementation details. The visible contract is that Holon preserves continuity through typed runtime evidence rather than by treating the chat transcript as the only source of memory.

See Also