Confidence History Calibration

Nick Clark

Confidence History Calibration

by Nick Clark | Published March 27, 2026 | PDF

Confidence values produced by the agent's evaluation function are continuously calibrated against a rolling history of observed-versus-claimed deviation. When the agent's claimed confidence systematically exceeds its observed performance, future confidence is tightened by a bounded calibration factor, and every calibration adjustment is recorded in an audit-required log that cannot be elided.

Mechanism

History calibration is the structural mechanism by which the agent's confidence-evaluation function is continuously aligned with its observed behavioral record. The mechanism is defined in Chapter 5 of the cognition patent as a deterministic update rule whose inputs are a rolling window of confidence-claim and outcome-observation pairs and whose output is a calibration factor applied to all subsequent confidence values. The factor is bounded, the update rule is auditable, and every adjustment is recorded in the agent's lineage as an immutable calibration event.

At each evaluation cycle, the agent records two structured fields: the confidence the agent claimed at the moment of action, and the outcome observed after the action resolved. The outcome is reduced to a canonical observation in the agent's outcome vocabulary, which classifies the result as a success, a partial success, a failure, or an indeterminate outcome. The pair is appended to the rolling history window, which is bounded in length by a policy-reference parameter and is content-addressed so that no claim or observation can be modified after the fact.

The calibration update rule operates on the contents of the rolling window. It computes the empirical reliability of the agent's claims at each confidence band, comparing the rate of successful outcomes within the band to the claimed confidence at the band. Where the empirical reliability falls below the claimed confidence by more than a structural margin, the calibration factor is tightened, reducing all subsequent confidence values in that band. Where the empirical reliability matches or exceeds the claim, the factor is relaxed toward unity but not above it.

Tightening is asymmetric and bounded. The structural property of the rule is that the factor can move monotonically toward greater conservatism quickly, but can relax toward less conservatism only slowly and only within a bound set by the policy reference. This asymmetry is the mechanism by which the agent rapidly responds to evidence of overconfidence while resisting spurious relaxation driven by short-window noise. The bound on the factor is a structural floor below which the agent's confidence cannot be inflated regardless of recent positive outcomes.

Audit requirement is structural rather than policy-imposed. Every application of the calibration factor to a confidence value is recorded as a calibration event in the agent's lineage, with the event identifying the rolling-window contents that motivated the factor, the band to which the factor applies, and the resulting adjustment. The lineage is append-only and content-addressed, so a calibration event cannot be elided or rewritten. Downstream auditors can reconstruct, from the lineage alone, the full calibration trajectory that produced any confidence value the agent ever emitted.

Operating Parameters

The calibration mechanism is parameterized by the rolling-window length, the band partition, the structural margin, the tightening rate, the relaxation rate, and the structural floor. Each parameter is specified declaratively in the policy reference and is auditable without inspection of the underlying confidence model.

Rolling-window length governs the number of claim-observation pairs retained for calibration. Short windows respond quickly to changing conditions but are noisy; long windows are stable but slow. The window length is selected per deployment according to the volatility of the operating environment and the consequence class of the actions being gated.

The band partition divides the confidence range into intervals against which empirical reliability is computed independently. Fine partitions support precise per-band calibration but require more data to estimate reliably; coarse partitions are robust but less expressive. The structural margin specifies how far observed reliability must fall below claimed confidence before tightening is triggered, and is set to the smallest deviation that the policy reference treats as actionable.

Tightening and relaxation rates govern how rapidly the calibration factor moves in response to evidence. The asymmetry between the rates is structural: tightening rate exceeds relaxation rate by a factor specified in the policy reference, and this asymmetry cannot be reduced without a policy-reference amendment that is itself recorded in the lineage. The structural floor sets the maximum confidence the agent is permitted to claim regardless of recent evidence and is the ultimate ceiling on the calibration factor.

Alternative Embodiments

Embodiments differ in the choice of empirical reliability estimator. A frequentist embodiment computes reliability as the success rate within the band. A Bayesian embodiment treats reliability as a posterior over a Beta prior whose parameters are set by the policy reference. A robust embodiment uses a trimmed mean to discount outlier outcomes. The embodiments are structurally equivalent under the patent's claims; the distinction is operational and concerns sensitivity to noise.

Embodiments differ in the locus of calibration. In a global embodiment, a single calibration factor applies to all confidence values. In a per-skill embodiment, each skill in the capability registry maintains its own factor calibrated against its own outcome history. In a per-context embodiment, factors are partitioned by environmental context, supporting deployments in which the agent's reliability varies across operating regimes.

Embodiments differ in the application of the factor. A multiplicative embodiment scales the raw confidence value by the factor. A clipping embodiment caps the confidence at a threshold derived from the factor. A monotonic-mapping embodiment transforms the confidence through a band-specific mapping function whose shape is determined by the factor. The patent's claims govern all embodiments in which the factor is bounded, the update rule is asymmetric, and every application is recorded in an audit-required log.

A federated embodiment supports deployments in which the rolling history is distributed across multiple agent instances, with each instance contributing claim-observation pairs to a shared calibration store. The federated embodiment requires that the store be content-addressed and that contributions be signed, preserving the structural property that the lineage cannot be elided.

Composition With Adjacent Mechanisms

The calibration mechanism composes with the broader confidence-governance subsystem by intercepting every confidence value before it is consumed by downstream gates. The raw confidence emitted by the evaluation function passes through the calibration factor before being compared against authorization thresholds, so all downstream consumers see calibrated values exclusively.

Composition with the capability-genealogy subsystem is structural. When a derived capability is admitted, its initial calibration factor is inherited as the meet of its parents' factors, mirroring the bound-inheritance property. Composition with the multimodal evaluation pipeline is direct: the per-mode evidence vector that justified a skill's admission is used as the initial outcome record for the skill's calibration history, providing a non-empty starting point for subsequent calibration.

Distinction From Prior Art

Prior art in confidence calibration typically uses isotonic regression or temperature scaling applied at training time and held fixed at inference. Such systems cannot adapt to runtime distribution shift and produce no audit trail of calibration adjustments. Prior art in online learning adapts at runtime but does not enforce asymmetric bounds and does not produce immutable calibration events.

The mechanism's distinguishing structural properties are the asymmetric bounded update rule, the structural floor that no calibration can pierce, and the audit-required lineage of calibration events. These properties together support both safety guarantees and external auditability and are not present in prior calibration systems.

Implementation Considerations

A faithful implementation must keep the rolling history immutable from the moment of write. Implementations that overwrite or summarize history entries in place destroy the audit trail and undermine the structural floor; the reference implementation accordingly content-addresses every entry and writes only append-only references into the calibration index. Aggregations such as per-band success rates are recomputed from the immutable entries on each calibration cycle rather than maintained as mutable counters.

Outcome classification is a structural rather than statistical concern. The mapping from raw outcomes to canonical observations must be stable across calibration cycles; reclassification of historical outcomes invalidates prior calibration events and requires that the affected events be re-emitted as corrections rather than silently overwritten. The policy reference specifies the outcome vocabulary and its mapping function, and amendments to the vocabulary are themselves recorded as lineage events.

Cold-start handling is governed by the structural floor and the inherited factor from the capability-genealogy subsystem. A newly admitted skill begins with an empty rolling history, and its initial calibration factor is set to the floor or to the inherited meet, whichever is more conservative. As outcomes accumulate, the factor is permitted to relax toward unity, but only at the policy-reference relaxation rate and never above the floor. This treatment prevents an attacker from exploiting the cold-start regime to inflate confidence on a freshly admitted skill.

Performance considerations favor a per-skill calibration store partitioned by consequence class. High-consequence skills carry small windows with aggressive tightening rates and slow relaxation, supporting rapid response to evidence of overconfidence. Low-consequence skills carry larger windows with gentler updates, supporting stable confidence in the face of normal noise. The partition boundaries and per-partition parameters are declared in the policy reference and are themselves audit-required, so an operator cannot quietly reclassify a high-consequence skill into a low-consequence partition.

Failure Modes And Mitigations

The calibration mechanism fails safely under three structural failure modes. History tampering, in which an adversary attempts to modify or remove claim-observation pairs after they have been recorded, is mitigated by the content-addressed append-only history; any tampering produces a different content address and is detectable by lineage verification. Outcome misclassification, in which an adversary attempts to label genuine failures as successes to relax the calibration factor, is mitigated by requiring that the outcome classifier be specified in the policy reference and that reclassifications be recorded as corrections.

Window-flooding, in which an adversary emits a high volume of trivially successful outcomes to displace genuine failure evidence from the rolling window, is mitigated by the consequence-class partition: high-consequence skills carry small windows that cannot be flooded without observable volume anomalies, and the floor prevents the calibration factor from rising above the structural ceiling regardless of recent positive evidence. The combination of the small window, the asymmetric rates, and the floor produces a structural property that the agent's effective confidence on high-consequence skills cannot be manipulated upward by short-term outcome injection.

Disclosure Scope

This article discloses the structural mechanism, operating parameters, and alternative embodiments of confidence history calibration as defined in Chapter 5 of the cognition patent. The disclosure is sufficient to enable a person of ordinary skill in the art to construct an embodiment without reference to the patent's underlying implementation. The patent's claims govern all embodiments that incorporate the asymmetric bounded update, the structural floor, and the audit-required calibration lineage, regardless of the specific reliability estimator, band partition, or factor application adopted.