Open provenance infrastructure for the machine-readable web. Every entity found, measured, structured, and made traversable — according to a fixed set of governing principles.
The Global Data Registry is the open provenance substrate of the AI era. Every domain the crawler discovers enters the intake pipeline and emerges as a structured, machine-readable entity record — UUID, timestamp, source URL, content hash. Provenance on everything. Inference fills nothing.
The registry operates continuously. Layer 1 profiles are generated automatically and made publicly accessible. Every record is citable, traversable, and permanent. The substrate exists independent of any single intelligence querying it.
The pipeline runs on a fixed set of constitutional principles derived from research and operational practice. Those principles govern every record the system produces.
The registry operates according to a constitutional architecture — a set of governing laws derived from research into provenance, semantic structure, and machine-readable graph design. The constitution is not policy. It is the structural logic the pipeline enforces at every layer.
The constitutional principles and pipeline architecture are documented in the registry's published research. The methodology is open. The findings are citable. Researchers and practitioners working at the intersection of provenance, linked data, and machine-readable graph architecture are invited to engage with the published record.
View Publications →