Inference as Semantic Execution
by Nick Clark | Published March 27, 2026
Inference does not unfold as a free generative trajectory whose outputs are filtered after the fact. It proceeds along a semantic graph in which each step's admissibility is checked against the current governance class before the step is committed. Non-execution at any step halts the chain. The agent does not produce a result and then ask whether the result was admissible; it produces each transition only after the transition has been admitted, and a rejection at any point ends the inference without producing the disallowed step. This article describes the semantic graph, the per-step admissibility gate, the governance classes that the gate consults, and the halt semantics that bind them together.
Mechanism
Semantic execution represents inference as traversal of a semantic graph whose nodes are semantic states and whose edges are admissible transitions between states. A semantic state encapsulates the structured content the agent has committed to so far in the inference chain: the propositions it has accepted, the references it has bound, the obligations it has incurred, and the lineage that records how each component arrived. A transition is a candidate change to the semantic state that, if admitted, replaces the current state with the successor.
The admissibility gate evaluates every candidate transition before it is committed. The gate's evaluation function takes as inputs the current semantic state, the candidate transition, the governance class assigned to the inference chain, the agent's policy reference, and the evidence vector held by the confidence governor. The function produces one of three outcomes: admit, in which case the transition is committed and the successor state replaces the current state; reject, in which case the transition is recorded as a rejection event without affecting the state; or decompose, in which case the transition is replaced by a sequence of finer-grained candidate transitions each of which is itself subject to evaluation.
Halt semantics are structural. When a candidate transition is rejected and no alternative candidate is available at the current state, the inference chain halts. Halting does not produce the rejected transition's output; it produces a halt record that names the state at which inference ceased, the candidate transition that was rejected, and the rule under which the rejection was issued. The agent's downstream consumers receive the halt record rather than a partial or substitute output. There is no fallback to a heuristic completion path.
Trust slope validation operates alongside the admissibility gate. Trust slope is the trajectory of confidence and evidence quality across the steps already committed to the chain. A negative slope indicates that successive transitions have been admitted under progressively weaker evidence, and the gate consults the slope as a structural input. A slope that violates the policy-declared envelope is itself sufficient to reject a candidate transition, even if every other admissibility predicate would have admitted it. The slope is a chain-level property, not a step-level property, and it makes within-chain degradation a structural rejection condition.
Operating Parameters
The mechanism is configured by parameters declared in the policy reference. The semantic-state schema declares the structured fields that constitute a state, the canonicalization rules that determine state equality, and the integrity invariants that every state must satisfy. The transition grammar declares the structure of candidate transitions, the action classes they may represent, and the side-effect classes they may emit. The governance-class catalog declares the named classes under which an inference chain may execute, each with its own admissibility predicate, trust-slope envelope, and halt rule.
The admissibility predicate associated with each governance class is a deterministic function over the structured inputs declared in policy. The predicate is not a soft score; it is a boolean over field-level conditions whose evaluation is fully reproducible. The trust-slope envelope is a pair of bounds on the rate of change of confidence and evidence quality across the chain, expressed in the same units as the values being bounded. Violation of either bound is a rejection condition.
Halt parameters bound the chain-level behavior of the mechanism. A maximum chain length bounds the depth to which inference may recurse before a halt is forced. A minimum admissibility margin requires that admitted transitions clear their predicate by a configured margin rather than satisfying it at the boundary. A halt-record schema declares the structured fields produced by the halt, ensuring that downstream consumers can parse halt records uniformly across governance classes.
Alternative Embodiments
The mechanism accommodates multiple embodiments without altering its structural commitments. In a serial embodiment, candidate transitions are evaluated one at a time and the chain advances only on admission. In a parallel embodiment, multiple candidates are evaluated concurrently and the gate selects the highest-margin admissible candidate, recording the rejected candidates as evidence of the decision. In a speculative embodiment, candidate transitions are evaluated and provisionally committed in parallel with subsequent candidates, with rollback occurring if a predecessor is later rejected on a slower-arriving signal.
The decomposition outcome admits multiple realizations. In a planner-driven embodiment, decomposition consults a planning subsystem that proposes a sequence of finer-grained candidates whose composition matches the rejected candidate's intent. In a grammar-driven embodiment, decomposition rewrites the rejected candidate using the transition grammar's productions until each leaf candidate is individually admissible. In a model-driven embodiment, a learned model proposes the decomposition and the gate evaluates each leaf independently. The choice of decomposition realization is a deployment parameter, not a structural commitment.
The governance-class assignment to a chain admits multiple embodiments. In a static embodiment, a chain's class is fixed at the chain's origin and cannot change during execution. In a dynamic embodiment, the class may be updated by a separate authority during execution, with the update itself being a recordable event subject to its own admissibility. In a layered embodiment, multiple classes apply concurrently and the effective predicate is their conjunction.
Composition with Adjacent Mechanisms
Semantic execution composes with the revocable-permission mechanism by sharing the gate. The action class that a candidate transition represents is checked against the current scope held by the confidence governor. Scope contraction during a chain therefore halts the chain at the next transition whose class falls outside the contracted scope, without requiring a separate halt mechanism. The composition makes scope changes effective within a chain rather than only across chains.
Semantic execution composes with the skill-gating pipeline by treating the activation status of each skill referenced in a candidate transition as an admissibility input. A revoked skill renders any transition that references it inadmissible, and the chain halts at the first such transition. The composition makes skill revocation effective within a running chain, propagating immediately rather than waiting for the chain to complete.
Semantic execution composes with the lineage subsystem by appending every admitted transition, every rejection event, every decomposition, and every halt record to the agent's structured lineage. Forensic review of an inference chain can therefore reconstruct, for any output produced, the precise sequence of candidates considered, the rules under which each was admitted or rejected, the trust slope at each step, and the policy version under which the chain executed. The reconstruction is exact because the gate is deterministic and the policy version is recorded.
Prior-Art Distinction
Conventional inference architectures, including autoregressive language models and chain-of-thought systems, treat inference as an unconstrained generative trajectory whose outputs may be filtered or scored after generation. Filtering is post-hoc: the model produces a candidate output, and a separate classifier or rule decides whether the output is acceptable. The disallowed output is generated before it is suppressed, the resources to produce it are consumed, and any side effects emitted during generation, such as cache contamination or adversarial state shifts, have already occurred.
Conventional planning systems with admissibility checks operate at the plan level rather than at the inference-step level. A plan is constructed in full and then evaluated for admissibility, with rejection forcing replanning. The granularity is coarser than per-step, and within-plan trust degradation cannot trigger early halt because the plan is evaluated only at the boundary.
The mechanism described here departs from both patterns by evaluating admissibility at every transition, by halting the chain structurally on rejection, and by treating chain-level trust slope as a per-step rejection condition. Generation does not occur before admission; the gate is the precondition for the transition, not its post-hoc reviewer. The result is a system in which problematic trajectories are prevented from forming, not merely suppressed after they have formed.
Disclosure Scope
This article describes the semantic-execution mechanism as defined in Chapter 8 of the cognition patent. The mechanism is disclosed at a level sufficient for a person of ordinary skill in the art of governed autonomous systems to construct an embodiment without undue experimentation. The semantic graph, admissibility gate, governance-class catalog, trust-slope envelope, and halt semantics are declared as policy-bound rather than implementation-bound, and the disclosure encompasses any embodiment in which inference proceeds as graph traversal subject to per-step admissibility evaluation against a governance class.
The disclosure scope extends to embodiments in which evaluation is serial, parallel, or speculative; in which decomposition is planner-driven, grammar-driven, or model-driven; and in which governance-class assignment is static, dynamic, or layered. It extends to embodiments in which the trust-slope envelope is expressed as bounds on first-order rate of change, as bounds on higher-order trajectory features, or as a composition of both. It extends to embodiments in which the gate's evaluation function is realized as a tabulated predicate, a compiled rule set, a deterministic interpretation of a learned model, or any composition thereof, provided the function is deterministic and its inputs are drawn from the canonical fields declared in policy.
The disclosure does not encompass embodiments in which inference proceeds without per-step admissibility evaluation, in which rejection is realized by post-hoc filtering rather than by halt at the gate, or in which the policy reference does not declare the governance-class catalog and the trust-slope envelope. Such embodiments lack the structural commitments that make inference governable within the loop and are outside the scope of the present disclosure.
The disclosure further encompasses embodiments in which the semantic graph is materialized lazily, with successor states constructed only on admission of the transition leading to them, and embodiments in which the graph is materialized eagerly to support look-ahead evaluation. The structural commitment that admission precedes commitment is preserved across both, because eager construction of successor states is permitted only when their construction is itself governed by the same admissibility predicate. The look-ahead is therefore not a bypass of the gate but a precomputation under the gate, and any state whose construction was admitted may be subsequently revoked if the chain's trust slope deteriorates before the state is reached in execution order.