ruvector/examples/benchmarks
Claude ccfc386ac3
feat(ablation): publishable RVF acceptance test with SHA-256 witness chain
Add self-contained acceptance test artifact that external developers can
run offline and reproduce identical graded outcomes:

- SHA-256-linked witness chain: every puzzle decision (skip_mode,
  context_bucket, steps, correct) hashed into a tamper-evident chain.
  Changing any single bit invalidates everything downstream.

- Deterministic replay: frozen seeds → identical puzzles → identical
  solve paths → identical chain_root_hash. Two runs with the same
  config produce the same hash, proven by test.

- JSON manifest: config, per-mode scorecards (A/B/C), all six ablation
  assertions with measured values, full witness chain, chain root hash.

- Verifier: re-runs with same config, recomputes chain, compares root
  hash. Mismatch means non-identical outcomes.

- CLI binary: `acceptance-rvf generate -o manifest.json` to produce,
  `acceptance-rvf verify -i manifest.json` to verify.

66 lib tests + 20 integration tests pass.

https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
2026-02-15 23:51:04 +00:00
..
src feat(ablation): publishable RVF acceptance test with SHA-256 witness chain 2026-02-15 23:51:04 +00:00
tests feat(generator): posterior-targeting puzzle generation, weekday skipping PolicyKernel 2026-02-15 22:31:12 +00:00
Cargo.toml feat(ablation): publishable RVF acceptance test with SHA-256 witness chain 2026-02-15 23:51:04 +00:00