mirror of
https://github.com/ruvnet/RuVector.git
synced 2026-05-25 06:36:37 +00:00
Three-fix iteration based on ablation diagnostics: 1. Bounded trial: Strategy Zero now caps trial budget at min(avg_steps*2, external_limit/4) with floor of 10 steps. Makes false hits cheap (max 100 steps overhead instead of full compiled budget). 2. Confidence gating: Strategy Zero only attempts when config confidence >= 0.7 (Laplace-smoothed success rate). Compiled observations from training seed initial confidence so configs start trusted. 3. 2-failure quarantine: any compiled signature with 2+ false hits is disabled (expected_correct=false). Prevents persistent bad patterns. Additional changes: - Versioned signature prefix (v1:difficulty:constraints) for cache safety across refactors - CompiledSolveConfig gains avg_steps, observations, confidence(), trial_budget() methods - KnowledgeCompiler gains steps_saved tracking, confidence_threshold, print_diagnostics() for per-signature analysis - record_success now tracks actual steps for delta-cost calculation - Verbose mode prints full compiler diagnostics after each ablation Results: false hit rate dropped from 8.2% to 4.4% (PASS). Cost still net-positive because constraint-determined search ranges are 1-10 dates — structurally no room for compiler optimization. Next: PolicyKernel constraint ordering for real cost surface. 81 tests passing. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G |
||
|---|---|---|
| .. | ||
| bin | ||
| acceptance_test.rs | ||
| agi_contract.rs | ||
| intelligence_metrics.rs | ||
| lib.rs | ||
| logging.rs | ||
| loop_gating.rs | ||
| reasoning_bank.rs | ||
| rvf_artifact.rs | ||
| rvf_intelligence_bench.rs | ||
| superintelligence.rs | ||
| swarm_regret.rs | ||
| temporal.rs | ||
| timepuzzles.rs | ||
| vector_index.rs | ||