GraphRAG, but with Governance: Where Microsoft's Architecture Stops Short

Nick Clark

What GraphRAG Is

GraphRAG is Microsoft Research's approach to retrieval-augmented generation over a knowledge graph rather than over a flat vector store. As described in its public materials, the method uses a language model to read a source corpus and extract entities and the relationships among them into a structured graph, derives summaries over clusters of that graph, and then answers a query by retrieving from the graph and its summaries instead of from isolated text chunks. The motivation is sound and well evidenced: naive vector retrieval struggles with questions that require connecting many entities or reasoning over a whole corpus, and a graph gives the system structure to traverse where a bag of embeddings gives it none. GraphRAG has been published as research and graph-based retrieval is being integrated into Microsoft's Azure offerings, which makes it the most visible validation of a thesis the discovery substrate also holds: structure beats undifferentiated retrieval for entity-rich reasoning.

The agreement is real and worth stating plainly. GraphRAG and the discovery substrate both reject the premise that retrieval should be a flat nearest-neighbor lookup, and both treat a structured, navigable representation of knowledge as the right medium for reasoning. The difference is not about whether to use a graph. It is about what the graph is permitted to be.

Where It Stops Short

In GraphRAG as described, the graph is a passive data structure. A language model builds it, and a language model queries it, but the graph itself does not govern the traversal that crosses it. There is no per-step admissibility evaluation on a transition from one node to another, no typed object that carries the query's intent, accumulated memory, policy, and lineage as it moves, and no record of why each step was taken that travels with the result. Hallucination is therefore handled the way it is handled everywhere in the retrieval-augmented family: as a statistical risk to be reduced by giving the model better context, not as a category of output that the architecture can refuse. The graph improves what the model is shown; it does not constrain what the model is allowed to commit.

This is the structural ceiling. A graph that is only a query target inherits the governance properties of the model that queries it, which is to say it has none of its own. The provenance of an answer is whatever the model reports, the constraints on an answer are whatever the prompt manages to hold, and the line between a supported step and an unsupported one is drawn by attention rather than by rule.

The Governance Increment

In the discovery substrate the graph is not a query target; it is a governed computational medium, and every step across it is a single fused operation. At each anchor the traversal narrows the candidate set, updates the discovery object's structured semantic state, and evaluates the proposed transition for admissibility, simultaneously and inseparably. The inference engine proposes; the execution substrate disposes. A proposed transition is admitted, rejected, or decomposed against policy, lineage continuity, entropy bounds, and temporal validity before the traversal advances, and that determination is recorded in the discovery object's lineage whether the step was taken or refused. Because the model only proposes and the substrate decides, an unsupported or non-compliant step is a category failure that the architecture rejects, not a probability the architecture tries to lower. The result arrives with the full sequence of transitions and admissibility determinations that produced it, an epistemic lineage rather than a model's after-the-fact explanation.

The increment over GraphRAG is therefore not a better extraction step or a richer graph. It is the addition of governance at the granularity of the traversal step: a typed object carrying intent and constraint, an admissibility gate at every transition, and a lineage that makes the path auditable. The graph stops being something a model reads and becomes something a model is held to.

Complementary Positioning

GraphRAG and the discovery substrate are not competitors for the same slot. GraphRAG is a strong retrieval augmentation: it is a better way to give a model context for entity-rich questions, and an organization already invested in it has built exactly the kind of structured representation the substrate operates over. The substrate is a governance upgrade on top of that premise. An extraction pipeline like GraphRAG's can populate the index the substrate governs; the substrate then adds the per-step admissibility, the carried typed object, and the lineage that GraphRAG does not provide. The path from one to the other is additive, not adversarial: keep the graph, govern the traversal.

Disclosure Scope

The fused search, inference, and execution traversal step, the structural separation between the inference engine as proposal generator and the execution substrate as authority, and the treatment of unsupported output as a category failure rejected at the admissibility gate rather than a statistical risk, are disclosed in the cognition filing (U.S. Application No. 19/647,395 and its international counterpart) at Sections 10.3, 10.5, and 10.8. This article compares those disclosed mechanisms with Microsoft's publicly described GraphRAG approach and positions the substrate as a governance layer composable with graph-based retrieval. References to GraphRAG and Azure are to Microsoft's public materials and are used for comparison only; no relationship, endorsement, or infringement is asserted.