Narrow Highway · Empirical · Live

The mechanism, measured

Every prompt in the eval set run through two modes — unfiltered LLM and the four-gate mechanism. Both responses, every gate, every verifier, every metric. Open.

loading latest run…


Narrow Highway · Benchmark · Phase 2 of the mechanism build
Eval set: data/eval/prompts_v1.jsonl · Run: tools/run_benchmark.py
Results published to site/benchmark/latest/ on every run.