Hierarchical Bits — Presentation Pitch

In one sentence: a representation model where multiple — possibly contradictory — interpretations share one immutable substrate and stay queryable, without duplicating the data and without forcing them into a single truth.


SLIDE 1 — The problem

Don't duplicate the world every time someone disagrees with it.

Start from a common base — one dataset, one building, one document set, one conversation history. Over that base, multiple interpretations naturally arise: annotators label it, disciplines read it differently, hypotheses compete over it, versions accumulate. They coexist.

Today, holding them forces a bad choice:


SLIDE 2 — The idea (a model, not a file format)

BH is a representation model for shared immutable substrates and concurrent interpretations.

One substrate, stored once and immutable. Each interpretation is a first-class, co-registered entity over it. The reader picks the lens at read time — adjudication is deferred and optional, never baked in.

SUBSTRATE   stored once, immutable, shared by every reading
LAYERS      each interpretation is a first-class, co-registered entity
READINGS    one lens / the matrix / the majority / the disagreement
            — your choice, at read time, not baked in

We give this property a working name — the First-Class Interpretation Representation (FCIR): interpretations kept as persistent, addressable, first-class entities over a shared substrate, rather than temporary versions or conflicts to be resolved away. (Working name — see Slide 4.)


SLIDE 3 — The distinguishing test (falsifiable)

Given two interpretations that disagree about the same element — can both remain, neither marked wrong, until a reader chooses (or declines) to adjudicate?

Many systems end up converging to a dominant representation, or isolating each interpretation into an independent copy/version. BH keeps them co-registered over one substrate and lets adjudication wait. That is the differentiator — stated as a test you can run on any system, not a boast.


SLIDE 4 — What we are NOT (the honest positioning)

We surveyed 20 data domains. The result killed the easy claim that "BH is universally new":

Storing the substrate once + reading it selectively is already mature SOTA — DICOM, COG/STAC, lakeFS, CRAM/tabix, S-LoRA, MAM. BH does not claim to invent that.

Our 20-domain sweep identified the First-Class Interpretation Representation (FCIR) — keeping rival readings as preserved entities instead of resolving them away — as the property that best distinguishes BH from the approaches evaluated. That is a result of the investigation, not a universal claim; FCIR is a working name, and the property may yet prove broader. Saying plainly what BH is not — and how far the claim reaches — is what survives the skeptical engineer.

And FCIR is not unique to BH: RDF named graphs and standoff annotation already implement it in their own domains. So FCIR is a synthesis and a cross-domain name, not a new mechanism — judged as a synthesis, not an invention (full confrontation in BH_PRINCIPLE.md).


SLIDE 5 — Where it fits, and where it doesn't (the useful limit)

FITS     multiple readings of ONE base object:
         · annotation with annotators who disagree
         · agent memory with conflicting versions over one history
         · BIM/CAD — disciplines reading one building (not five copies)
         · legal / eDiscovery — rival readings of one document set
         · science — competing hypotheses over the same raw data

DOESN'T  dense signal (photo / audio / embeddings) → delegate to codecs
         single-truth goals (consensus, gold labels) → already solved

A pitch that states its own limit is the opposite of vapor.


SLIDE 6 — The evidence (the principle is reproducible)

The same model showed up — independently — in four completely different domains. That reproducibility matters more than any single number:

instance domain the same model, instantiated
bhanno rival annotations the purest: K labelings coexist, adjudication optional
bhmem agent memory conflicting versions over one history
bhckpt model checkpoints alternative readings of one shared base
bhtrace traces competing lenses over one span tree

One principle, four instances, each measured and tested — correctness as a gate, honest baselines, public self-corrections, a Zenodo DOI. The numbers exist (4.6×, 35×, 1,779×, 9×); the point is that the principle held every time.


SLIDE 7 — The state and the ask


Don't duplicate the world every time someone disagrees with it.

Hierarchical Bits · © 2025–2026 Márcio M. Carvalho · code under Apache-2.0, docs under CC BY 4.0 · repository