Validation

Validation &
Claim Boundaries

The bench is designed around a simple principle. The system structures judgement. It does not replace it.

§ 01 / Permanent

Permanent claim boundaries

These are not phase boundaries. They will not be lifted in a later version.

⊘No readiness scores
⊘No diagnostic findings
⊘No risk ratings
⊘No compliance determinations
⊘No legal conclusions
⊘No deployment approval
⊘No liability assessments

§ 02 / v0.2

v0.2 implementation boundaries

These describe the current technical posture of the Mode 2 demo. They may change in later versions, by deliberate design choice and with explicit consent flows.

Local-only

Browser-based

No backend

No lead capture

No data transmission

No telemetry

§ 03 / Inference

No autonomous inference on the customer-facing surface

The Deployment Judgement Snapshot does not use an LLM to interpret participant responses. It structures participant-provided context into a reviewable snapshot.

This is deliberate. The bench is designed to preserve human judgement rather than simulate it. Customer-facing outputs remain bounded by question structure, answer metadata, schema, and participant-provided context.

Agents may support research, scenario generation, and build-time tooling. They do not generate customer-facing conclusions in the current Mode 2 surface.

§ 04 / Checks

Validation checks

The current Mode 2 package includes local validation scripts that check the following properties of any snapshot output:

Schema compliance
Output conforms to the snapshot schema. No fields outside the contract.
Prohibited fields
No score, rating, grade, classification, risk level, or compliance status fields.
Claim-language leakage
No diagnostic, evaluative, or approval language in copy or output.
Free-text preservation
Participant responses are reflected verbatim — not summarised, classified, or reinterpreted.
Answer classification metadata
Classifications are user-assigned, not system-derived.
Mode 3 absence
No Mode 3 evidence-scaffold material is present in Mode 2 output.
Local-only operation
No network requests during snapshot generation.

§ 05 / Artefacts

Artefacts under pilot / NDA

The following artefacts are available in pilot or partner contexts because they contain the working machinery: schemas, claim boundaries, validation reports, scenario logic, and scaffold specifications. The public site explains the method; partner review exposes the instrument.

Mode Boundary Contract
Snapshot Output Schema
Claim Audit
Validation Report
Manual Demo Test Plan
Mode 1 Simulator (private)
Mode 3 Evidence Scaffold Specification
Wind tunnel pattern library

Boundary

If you are a buyer wondering why this page exists in this much detail: it exists because the distinction between an instrument that helps an organisation see itself and a tool that pretends to see it for them is the only distinction that matters. Everything else is decoration.

Discuss a pilot