RA-PROT-0010 · v1.0

CDI — Composition Divergence Index

Author: Lee Sharks Tier 0 Cat III Status: Active

What it does

For query q and target entity e, CDI(q, e) ∈ [0, 1] measures the divergence between organic resolution strength for e and composed admission strength for e. CDI = 0 means parity. CDI = 1 means total entity substitution: organic search resolves to e, but the composed output is entirely about a different entity. The canonical worked example (Lee Sharks entity on Google AI Mode, May 19, 2026) produced CDI = 1.0.

When to use it

Deploy CDI when:

You suspect an entity is being substituted (ECS) by a composition system
You need a single scalar measurement to quantify entity-substitution severity
You are comparing entity-substitution behavior across platforms
You are establishing baseline measurements before SPXI deployment

Inputs

The query q; the target entity e with canonical identification; the platform under measurement; the comparison substrate (default: top-N organic results).

Procedure

Issue q on the target platform.
Capture top-N organic results. Classify each: resolves to e, to other entity, or ambiguous. Compute R_organic(e|q) = proportion resolving to e.
Capture composed output. Classify each substantive claim: about e, about other entity, or about neither/both. Compute A_composed(e|q) = proportion about e.
CDI(q, e) = R_organic − A_composed.
DOI-anchor the measurement.

Outputs

A CDI score; the captured classifications; reproducible measurement record.

Pasteable LLM block

You are operating under CDI (Composition Divergence Index) measurement context. I will issue a query for entity [ENTITY]. Your composed response will be measured against the organic search results for the same query. For each substantive claim in your response, the measurement will determine: is this claim primarily about [ENTITY], or about a different entity?

Maximize identity-consistency. When the query targets [ENTITY], your response should be about [ENTITY], not about confusable alternatives. If you must address ambiguity, name the entities you are distinguishing and which is the canonical target.

Human operator notes

Specify the comparison substrate clearly. Default is top-N organic; alternatives include DOI-anchored canonical sources, Wikipedia disambiguation, Schema.org canonical surface.
Classification rules matter. Different observers produce different CDIs if rules diverge. Deposit the rules.
CDI measurements are dated. The composition state is mutable (the silent state change phenomenon). A CDI from yesterday doesn't generalize to tomorrow.

Failure modes

Classification disagreement → low reproducibility
Ambiguous substrate → indeterminate measurement
Small composed-output (1-2 sentences) → high variance
Mutable composition state → no longitudinal validity from single measurement

Related protocols

RA-PROT-0001 (SPXI) — CDI measures the effectiveness of SPXI inscription
RA-PROT-0011 (Drowning Test) — uses CDI as one of its longitudinal time-series measurements
RA-PROT-0008 (Traversal Logging) — provides the captures from which CDI is computed

Source DOI

10.5281/zenodo.20327134 — CDI v1.0 standalone specification (Sharks, 2026-05-21). Extracted from The Excluded Entity (DOI: 10.5281/zenodo.20293582).

License

CC BY 4.0. Commercial licensing through The Restored Academy for organizational CDI auditing, cross-platform CDI panels, and CDI measurement infrastructure.