ruvector/examples/benchmarks/src/bin
Claude 1c22ba99ce
feat(agi): KnowledgeCompiler Strategy Zero, StrategyRouter bandit, ablation protocol
Wire the KnowledgeCompiler as Strategy Zero in AdaptiveSolver solve
path — compiled constraint-signature configs are consulted before any
strategy. Add StrategyRouter with epsilon-greedy contextual bandit for
adaptive strategy selection per difficulty/constraint family.

Implement three-mode ablation protocol (A/B/C):
- Mode A: baseline (no compiler, fixed router)
- Mode B: compiler only (Strategy Zero with early termination)
- Mode C: full (compiler + adaptive router)

Adds run_ablation_comparison() and AblationComparison::print() with
quantitative assertions (B beats A on cost >=15%, C beats B on
robustness >=10%, compiler false-hit rate <5%).

Other changes:
- Early termination (stop_after_first) in TemporalSolver for compiled
  single-solution puzzles
- Step accumulation across Strategy Zero failures + fallback
- Promotion gating: patterns only promoted when holdout accuracy
  doesn't regress
- Compiler false_hits tracking
- --ablation flag on agi-proof-harness binary
- 81 tests passing (61 unit + 20 integration)

Ablation result (100-task holdout, 5 cycles): compiler active at 59%
hit rate with 8.2% false hit rate. Cost and robustness targets not yet
met — solver needs more policy surface (step 5: PolicyKernel learning).

https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
2026-02-15 21:29:48 +00:00
..
agi_proof_harness.rs feat(agi): KnowledgeCompiler Strategy Zero, StrategyRouter bandit, ablation protocol 2026-02-15 21:29:48 +00:00
intelligence_assessment.rs style: apply rustfmt across entire codebase 2026-01-28 17:00:26 +00:00
rvf_intelligence_bench.rs feat(benchmarks): 6-vertical intelligence benchmark with real divergence 2026-02-15 20:08:47 +00:00
superintelligence.rs feat(benchmarks): 5-level superintelligence pathway engine 2026-02-15 20:16:11 +00:00
swarm_regret.rs style: apply rustfmt across entire codebase 2026-01-28 17:00:26 +00:00
temporal_benchmark.rs style: apply rustfmt across entire codebase 2026-01-28 17:00:26 +00:00
timepuzzle_runner.rs style: apply rustfmt across entire codebase 2026-01-28 17:00:26 +00:00
vector_benchmark.rs feat(benchmarks): Add comprehensive temporal reasoning and vector benchmarks (#113) 2026-01-14 21:38:34 -05:00