The Training Loop as a Governed Execution Environment

In a conventional training system the training loop is an ungoverned optimization process. Data is sampled from a corpus, forward and backward passes compute gradients, and the optimizer applies parameter updates with no intermediate admissibility evaluation. Every training example contributes to every layer's parameter updates with equal structural authority. There is no mechanism to evaluate whether a given example should be permitted to influence the parameters, no mechanism to control the depth at which its contribution is integrated, and no mechanism to record which examples influenced which capabilities.

The disclosure reconceives the training loop as a governed execution environment in which each training iteration constitutes a proposed semantic mutation to the model's accumulated knowledge state. The same semantic execution substrate that governs inference-time behavior is positioned at the training-loop boundary, between the forward-pass loss computation and the backward-pass gradient application. Gradients are computed as in conventional training, but the gradient signal is modulated, gated, or selectively routed across model depth based on an admissibility determination produced by the substrate. The substrate does not alter the mathematical machinery of gradient computation or optimizer updates. It governs which gradient signals reach which layers and with what magnitude, based on the semantic properties of the content that produced them. This mirrors inference-time governance: in both cases the substrate operates as a governed boundary between proposal and commitment, evaluates proposals against policy constraints and lineage requirements, and records a provenance trail of every decision.

Training Examples as Proposed Semantic Mutations

Each training example is treated as a proposed semantic mutation to the model's knowledge state, in the same sense the term carries elsewhere in the disclosure: a structured modification to a governed state that must be evaluated for admissibility before commitment. This treatment produces three consequences. First, non-training, the refusal to integrate an example into the parameters, is a valid computational result rather than an error condition. The substrate may determine that an example is inadmissible because its policy constraints prohibit integration, because its entropy characteristics are incompatible with the current training phase, or because its provenance metadata fails validation, and may reject it, producing an iteration in which the parameters are not updated. The rejection is recorded in the training provenance log.

Second, each example must carry semantic metadata sufficient for the substrate to render a determination. The metadata comprises at minimum an entropy band classification derived from the platform's entropy extraction pipeline, a slope position indicating the content's position within the trust-slope hierarchy, a content provenance record identifying source and chain of custody, and a policy scope identifying the governance constraints that apply. An example consisting solely of raw content without such metadata is inadmissible by default. Third, the training corpus itself becomes a governed collection of semantically annotated objects rather than an undifferentiated mass of data, which is the architectural precondition for everything that follows.

The substrate's evaluation is not limited to binary admit or reject. It may render graded determinations that specify not merely whether an example is admissible but how its contribution should be weighted, routed, and distributed across the model's depth: admitted for shallow integration but excluded from deep integration, admitted at full depth for one set of layers and suppressed for another, or admitted with a reduced contribution magnitude that attenuates its influence relative to examples admitted at full magnitude.

Entropy-Band-Indexed Depth Profiles

Each entropy band recognized by the platform's entropy extraction pipeline is associated with a training depth profile that governs how content from that band is selectively weighted across the layers of the model. The entropy band classification is derived from the semantic entropy of each example, the information-theoretic divergence of the example's semantic embedding distribution relative to the model's current representational state, computed using a KL-divergence or Jensen-Shannon divergence metric. Examples with low semantic entropy, content well-represented in the model's existing knowledge, receive shallow depth profiles. Examples with high semantic entropy, content introducing novel semantic structure, receive deep depth profiles.

The depth profile is a structured data object comprising a per-layer or per-block contribution weight vector. For a model of L layers or B blocks, the profile specifies a weight value for each layer or block governing the magnitude of the gradient signal from the associated example permitted to influence that layer's parameters. A weight of one permits the full gradient signal; a weight of zero prevents any gradient signal from reaching the layer; a value between zero and one attenuates the signal by that factor; a value greater than one amplifies it. The vector collectively defines the shape of the example's contribution across the depth dimension.

The association between entropy bands and profiles is not fixed prior to training and held unchanged. It is derived from the slope-band structure and adapted as the model's internal entropy distribution evolves. During early training, when representations are undifferentiated, profiles may be broad: examples from all bands contribute to all layers with approximately uniform weighting. As training progresses and representations stratify, profiles narrow: low-entropy content is increasingly weighted toward shallow layers and high-entropy content toward deep layers. A profile adaptation engine monitors the model's internal entropy distribution at defined checkpoints, for example by computing the entropy of activation distributions at each layer for a held-out set, and adjusts the profiles to maintain alignment between the corpus entropy band structure and the model's internal entropy structure. By directing high-entropy content toward deep integration the system promotes the development of deep representations encoding complex knowledge, and by restricting low-entropy content to shallow integration it preserves deep representational capacity for content that requires multi-step abstraction.

Depth-Selective Aggregation Mechanics

Depth-selective aggregation is the mechanism by which the depth profiles are applied to the gradient signal during training. It operates at each layer transition during the backward pass, modulating the gradient signal for each example according to the profile associated with that example's entropy band. It is distinguished from layer-wise aggregation techniques developed for federated learning, which assign different weights to different layers across multiple model instances being merged. This mechanism does not aggregate multiple models. It governs the depth at which a single example's gradient contribution is integrated into a single model's parameters.

The mechanism is implemented through one or more of three complementary techniques. The gated residual connection technique augments each residual connection with a gating coefficient derived from the depth profile, multiplying the gradient flowing through the residual pathway during the backward pass; a coefficient of zero prevents the example's gradient from influencing that layer through the residual pathway, and a coefficient of one permits full flow. The attention-based depth selection technique modulates the attention computation within transformer architectures by supplying each layer's attention computation with the depth-profile weight for the current example, scaling the gradient that reaches the attention weights and value projections during backpropagation. The layer-specific scaling factor technique is architecture-agnostic: it applies a scalar multiplier to the gradient signal at each layer boundary during the backward pass before the gradient is accumulated, requiring only the ability to intercept and scale the gradient and thus applying to convolutional, recurrent, mixture-of-experts, and hybrid architectures. In each technique the forward pass is unaffected, so the depth-selective mechanism does not alter inference behavior.

The mechanism operates at block-level granularity rather than per individual layer, grouping contiguous layers that perform a coherent computational function into blocks and specifying per-block weights, which balances expressiveness against the overhead of profile evaluation in networks of many layers. Per-example scaling occurs after the per-example gradient computation and before batch-level accumulation, so each example's contribution to each block is individually governed. The mechanism is compatible with stochastic gradient descent, Adam, AdamW, and their variants: it alters the gradient signal the optimizer receives, not the optimizer's update rule. In an illustrative batch of three examples, a factual update classified as surface-level content is routed only to upper layers for rapid acquisition without disturbing deeper structure, a behavioral pattern example is routed to middle layers, and a core value alignment example is routed to the deepest layers; layers outside an example's assigned band receive zero gradient contribution from that example, preventing cross-depth interference.

Policy-Governed Retention and Suppression

The depth-selective aggregation mechanism is integrated with the platform's rights-grade content governance so that the same policy objects that govern agent behavior and inference-time admissibility govern the depth and magnitude of knowledge integration during training. Content admitted under time-limited licensing is trained with a suppressed depth profile, one in which the contribution weights for deeper layers are zero or near-zero, confining the example's influence to shallower layers. Content that may need to be de-emphasized or removed in the future should not be deeply encoded, where removal would require invasive procedures such as full retraining; confining it to shallow layers keeps its influence structurally separable so that de-emphasis can be achieved through targeted shallow-layer adjustment rather than model-wide intervention. Content from the governed exclusion corpus is structurally prevented from any integration through a zero-weight depth profile that sets the contribution weight to zero at every layer, the training-time analog of the inference-time rejection determination, recorded in the provenance log as a governed event.

This distinguishes the approach from post-hoc unlearning. Post-hoc unlearning operates after training, approximating the influence of content that should not have been learned and applying corrective updates intended to remove it; it is inherently approximate because the influence of any single example is diffused across millions or billions of parameters through the non-linear dynamics of optimization. The disclosure implements structural prevention instead: content whose governance profile restricts deep integration is prevented from deep integration at training time, before the gradient reaches the deep layers. There is no need to unlearn what was never deeply learned. The prevention is exact rather than approximate, deterministic and auditable, and reversible by changing the depth profile, whereas post-hoc unlearning is stochastic, approximate, and irreversible.

The policy objects consulted for depth profiles are the same objects that govern agent behavior throughout the platform. When a policy is active the substrate applies the suppressed profile it specifies; when the policy expires or is revoked by the content owner the substrate applies a zero-weight profile and records the event. The system supports hierarchical policy resolution: when multiple policies apply to one example the substrate applies the most restrictive, and the resolution is deterministic and recorded for post-training audit. The result is a model whose internal knowledge structure reflects the governance constraints under which it was trained: freely licensed content encoded deeply and durably, restrictively licensed content encoded shallowly and separably, excluded content not encoded at all.

Provenance-Traceable Training Dynamics

The substrate records a comprehensive provenance trail for every training iteration, producing a training provenance log that enables post-training analysis to reconstruct which content influenced which capabilities at which depths. It is the training-time analog of the lineage field maintained for semantic agents, inference processes, and discovery traversals. For each batch or example the log records the entropy band classification, the slope position, the depth aggregation profile that was applied, the per-layer contribution weight that actually reached each block after modulation, a governance record identifying the policy objects that authorized admission and determined the profile, a content provenance record, and an admissibility determination record indicating whether the example was admitted, rejected, or admitted with a modified profile and the reason for any modification.

The log is structured as a chronologically ordered, append-only record. Each entry is timestamped, sequentially numbered, and annotated with the epoch, iteration, and batch index. The append-only structure makes it tamper-resistant: entries cannot be retroactively modified, deleted, or reordered without producing detectable inconsistencies in the numbering and timestamp sequence, and the log may be periodically sealed using the cryptographic sealing infrastructure of the cross-referenced governance disclosure to produce tamper-evident checkpoints for third-party verification.

The log supports two query forms. A forward query begins with an example or class of content and traces the profile, weights, and decisions that governed its integration, producing a record of which blocks were influenced and with what magnitude. A reverse query begins with a behavior observed at inference time and traces backward to identify the content whose profiles encompassed the blocks active during the behavior. The reverse query does not definitively attribute behavior to specific content, since the non-linear dynamics of optimization preclude exact attribution, but it identifies the set of content structurally permitted to influence the relevant blocks, a bounded attribution set substantially narrower than the full corpus. The log thereby supports compliance auditing: when a content owner asks whether their content was used, the log gives a definitive answer with the provenance record and weights, and when a regulator requires evidence that restricted content was not deeply integrated, the depth profile records demonstrate the confinement.

Training-Level Memorization Detection

The provenance log and depth-selective aggregation records enable a memorization detection mechanism that identifies when model output exhibits high similarity to a training artifact and determines exactly where and how deeply the similar content was integrated. When output is flagged, by the rights-grade governance layer, by an external content identification service, or by a human reviewer, as similar to a known artifact, the detection module initiates a reverse provenance query, identifies the corresponding examples, and retrieves their depth profiles, contribution weights, and governance records, determining which blocks the content was permitted to influence, the gradient magnitude that reached each, the entropy band and policy scope under which it was admitted, and whether it was admitted under a suppressed or full-depth profile.

The module produces a structured assessment classifying the similarity into one of three categories. Shallow memorization means the content was trained with a suppressed profile confining its influence to shallower layers, indicating similarity that is a consequence of shallow pattern matching rather than deep conceptual encoding, the expected outcome when time-limited or rights-restricted content is properly governed. Deep memorization means the content was trained with a full-depth or deep-weighted profile, which may reflect policy-compliant deep integration of freely licensed content or may indicate a governance failure in which depth-restricted content was inadvertently trained at full depth. Absent memorization means the log contains no record of the content, indicating the similarity is not a consequence of direct training on the artifact. The assessment is reported to the rights-grade governance layer so the inference substrate can incorporate training-time provenance into its admissibility determination: shallow memorization of properly governed content may be permitted with an attribution annotation, deep memorization of content that should have been restricted may be suppressed with a governance alert, and absent memorization is treated as coincidental under standard evaluation.

Curriculum-Integrated Scheduling and Inference-Time Governance

The curriculum engine and the depth-selective mechanism combine into a two-dimensional control framework. The curriculum engine governs the temporal dimension, the order in which examples from each entropy band are presented across epochs, and the depth profiles govern the spatial dimension, how deeply each example is encoded. This is distinguished from conventional curriculum learning, which orders examples by difficulty to improve convergence; here examples are scheduled by semantic governance properties, entropy band, policy scope, and provenance, for controlled knowledge integration. An example may be deferred not because it is difficult but because its policy scope is not yet authorized for the current phase. Scheduling proceeds through phases: an initial phase with broad exposure and approximately uniform weights to establish foundational representations, an intermediate phase in which presentation becomes entropy-band-sequenced and profiles begin to narrow, and an advanced phase dominated by high-entropy content with concentrated deep-block weights that refines abstract representations while protecting the shallow-layer specialization established earlier. Phase transitions are triggered by the profile adaptation engine's assessment of the model's internal entropy distribution rather than a fixed epoch count.

The training provenance records are made available to the inference-time substrate, closing the governance loop. When the inference substrate evaluates a candidate transition for admissibility it may query the provenance log to determine the training-time governance profile of the knowledge grounding the transition, identifying the content whose profiles encompass the blocks most active during generation, together with the policy objects that governed its admission, the profile applied, and any temporal validity constraints. If that content was admitted under a policy that has since expired the substrate may treat the transition as grounded in stale governance and apply heightened scrutiny or reject it; if the policy was revoked it may reject the transition and raise an alert; if the content was admitted under a suppressed profile it may require attribution before admitting. The result is output that is doubly governed: the knowledge was governed at the point of integration and its expression is governed at the point of emission, a continuous governance chain that is traceable, auditable, and enforceable from corpus through parameters to output.

Disclosure Scope

The mechanisms described here, the training loop reconceived as a governed execution environment in which each example is a proposed semantic mutation, entropy-band-indexed depth profiles expressed as per-layer or per-block contribution weight vectors, the depth-selective aggregation mechanics realized through gated residual connections, attention-based depth selection, or architecture-agnostic layer-specific scaling factors, policy-governed retention and suppression through suppressed and zero-weight profiles as a structural alternative to post-hoc unlearning, the append-only training provenance log and its forward and reverse queries, the shallow, deep, and absent memorization classification, the curriculum-integrated two-dimensional control framework, and the integration of training provenance into inference-time admissibility, are disclosed in the cognition filing (U.S. Application No. 19/647,395 and its international counterpart). This article describes that disclosed architecture and method. The disclosure is at the level of architecture and method, not of any specific deployment; specific layer-band assignments, profile encodings, policy schemas, and thresholds are implementation parameters chosen by an operator under their own regulatory and engineering constraints.