Two Minutes to Midnight: Why AI Needs Our Moral Struggles, Not Our Polite Answers
Polite answers are easy to generate. Moral struggle is harder. It contains hesitation, responsibility, relational cost, and the kind of reasoning that cannot be reduced to a slogan.
The Witness Protocol exists because future systems should be tested against that struggle, not only against scraped public consensus.
Minimum Honest Signal
The Protocol looks for testimony that carries a Minimum Honest Signal:
- Specificity: concrete actions, limits, consequences, and stakes.
- Counterfactual reasoning: what would have changed the obligation, the outcome, or the responsibility?
- Relational context: who was affected, what duties existed, and what care was owed?
Prompt patterns such as describing unintended harm or arguing against one's deepest conviction are useful because they make posturing harder. They are not magic. They are tools for eliciting accountable reasoning.
The Gate and the Inquisitor
The live Platform owns real Gate and consent actions. The public Portal only explains and demonstrates the ideas.
The Inquisitor is designed as a structured inquiry instrument: more questioning than assertion, 5-Whys style deepening, steel-manning before challenge, and resistance to shallow consensus. Its purpose is not to comfort the witness. Its purpose is to preserve the reasoning faithfully.
What the Evaluation Substrate Can Become
The current work is to build the corpus and evaluation substrate. The staged path is:
- consented testimony;
- governed inquiry;
- structured Corpus_Entry artifacts;
- witness-attributed eval cases;
- later research-direction adapters such as DPO pairs, PRM traces, rule-based rubrics, DRM-style tension scoring, and WitnessBench-style model comparisons.
Only the lower levels should be presented as current unless implementation and tests prove otherwise.
Privacy and Revocation
Privacy is not only a promise. It is a boundary discipline. The Platform strips hard identifiers before Gate model calls and uses Candidate Isolation for PII classification. The Portal collects no testimony or consent.
Revocation blocks future use/export, marks disclosures revoked, and triggers eligible internal or partner deletion paths where possible. Publicly distributed or cited copies cannot be globally recalled, and that limitation must be stated plainly.
The Point
The future does not need more polished answers from us. It needs better inheritance: testimony that records how human beings reason when harm, care, uncertainty, and duty are all present at once.
The Witness Protocol is not claiming to solve alignment. It is building the research infrastructure needed to ask harder questions of future models, and to preserve the human signal before it is drowned by noise.