- Add release-rvf-cli.yml: builds standalone binaries for Linux x64/ARM64,
macOS x64/ARM64, and Windows x64 on tag push (rvf-v*)
- Creates GitHub Release with all binaries and SHA256 checksums
- Update CLI README with install instructions for pre-built binaries,
examples/rvf/output/ usage guide, and full command reference
Co-Authored-By: claude-flow <ruv@ruv.net>
Three fixes:
1. locking.rs: __errno_location is Linux-only; macOS uses __error().
Split the extern "C" declarations by target_os so rvf-runtime
compiles on both platforms.
2. build-rvf-node.yml: NAPI CLI outputs index.<platform>.node instead
of rvf-node.<platform>.node. Added rename step after build.
3. build-rvf-node.yml: darwin builds need -undefined dynamic_lookup
RUSTFLAGS so NAPI symbols resolve at runtime via Node.js.
Added CARGO_TARGET_*_APPLE_DARWIN_RUSTFLAGS env vars.
Co-Authored-By: claude-flow <ruv@ruv.net>
The copy step was failing with "cp: 'X' and 'X' are the same file" because
committed binaries in npm/ subdirs matched the find pattern. Added -maxdepth 1
to only find freshly built files and realpath comparison before cp.
Co-Authored-By: claude-flow <ruv@ruv.net>
The workspace member crates/rvf/rvf-adapters/claude-flow was listed in
Cargo.toml but gitignored, causing CI builds to fail with:
"failed to load manifest for workspace member rvf-adapters/claude-flow"
Co-Authored-By: claude-flow <ruv@ruv.net>
- Fix WASM glue: detect Node.js properly instead of relying on fetch()
(fetch on file:// URLs fails in Node.js 18-21)
- Support both CJS require() and ESM import via exports map
- Add .mjs ESM wrapper for dual-format support
- Remove "type": "module" for CJS compatibility
- Bump rvf-wasm to 0.1.5
- Add build-rvf-node.yml CI workflow for cross-platform NAPI builds
(linux-x64-gnu, linux-arm64-gnu, darwin-x64, darwin-arm64, win32-x64-msvc)
- Fix wasm-dedup-check CI: use --ignore-scripts --omit=optional to avoid
EBADPLATFORM errors from platform-specific workspace packages
Co-Authored-By: claude-flow <ruv@ruv.net>
- Build NAPI native addon (linux-x64-gnu, 1.3MB) and WASM binary (42KB)
- Fix NodeBackend to use RvfDatabase class instance methods instead of module-level functions
- Fix WasmBackend to use C-ABI store functions with integer handles
- Add platform loader (index.js) and TypeScript declarations (index.d.ts)
- Create JS glue and type declarations for WASM package
- Set up platform-specific npm packages for all 5 targets
- Bump rvf-node/rvf-wasm to 0.1.4, SDK to 0.1.6
- Fix version pins from 0.1.0 to ^0.1.4
Resolves: rvf-node and rvf-wasm published as empty stubs with no binaries
Verified: E2E test passes (create -> ingest -> query -> status -> close)
Co-Authored-By: claude-flow <ruv@ruv.net>
- Add introductory paragraph explaining RVF's structural security model
- Expand Security & Trust tables with TEE attestation, KernelBinding,
adversarial hardening details
- Upgrade Security Hardening from bullet list to defense table
- Add live_boot_proof as example #45, update counts to 46
Co-Authored-By: claude-flow <ruv@ruv.net>
- rvf-launch: add missing retrieval_quality field to SearchResult
- rvf-node: add match arms for new Security/QualityBelowThreshold error variants
- rvf-node: use struct update syntax for new QueryOptions fields
- rvf-runtime: add missing domain_expansion_present field in tests
Co-Authored-By: claude-flow <ruv@ruv.net>
Plans the integration path for .rvf acceptance test verification into
the npm ecosystem:
- npx ruvector rvf verify-witness <file.rvf> (N-API + WASM fallback)
- npx rvlite verify-witness <file.rvf> (WASM via cli-rvf.ts)
- rvlite SDK verifyWitnessChain() for browser-side verification
- MCP tool rvf_verify_witness for Claude Code agents
- 5-phase implementation plan, each independently shippable
Bridges the rvf_witness_verify WASM export (ADR-037) to end users
without requiring the Rust toolchain.
https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
Add self-contained acceptance test artifact that external developers can
run offline and reproduce identical graded outcomes:
- SHA-256-linked witness chain: every puzzle decision (skip_mode,
context_bucket, steps, correct) hashed into a tamper-evident chain.
Changing any single bit invalidates everything downstream.
- Deterministic replay: frozen seeds → identical puzzles → identical
solve paths → identical chain_root_hash. Two runs with the same
config produce the same hash, proven by test.
- JSON manifest: config, per-mode scorecards (A/B/C), all six ablation
assertions with measured values, full witness chain, chain root hash.
- Verifier: re-runs with same config, recomputes chain, compares root
hash. Mismatch means non-identical outcomes.
- CLI binary: `acceptance-rvf generate -o manifest.json` to produce,
`acceptance-rvf verify -i manifest.json` to verify.
66 lib tests + 20 integration tests pass.
https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
Fixed policy sign flip (Mode A):
risk_score = R - 30*D (was R + 30*D)
Distractors now reduce effective range, making Mode A conservative
under distractors. This is the defensible control arm: a rational
fixed agent should be more cautious when distractors are present.
Mode C must learn to outperform this baseline.
EarlyCommitPenalty wired into bandit reward:
SkipModeStats now tracks early_commit_penalty_sum per arm.
reward() includes robustness_penalty = 0.2 * avg_penalty.
This means Mode C can actually learn to avoid early wrong commits
in distractor-heavy contexts. Previously the penalty was only
printed, not optimized.
Context buckets expanded to 18:
3 range (small/medium/large) × 3 distractor (clean/some/heavy)
× 2 noise (clean/noisy) = 18 buckets.
Previous: 4 range × 2 distractor = 8 (too coarse for bandit).
Noise flag now flows through AdaptiveSolver.noisy_hint.
New ablation assertion:
c_penalty_better_than_b: Mode C EarlyCommitPenalty must be ≤90%
of Mode B penalty. Proves robustness improvement is explicit,
not just noise_accuracy-based.
Acceptance test noise plumbing:
solver.noisy_hint set to true for noisy puzzles in both training
and holdout evaluation. Context buckets now correctly distinguish
clean vs noisy conditions.
81 tests passing (61 lib + 20 integration).
https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
PolicyKernel refinements:
- Fixed policy (Mode A): risk_score = R + k*D, k=30, T=140
Fixed constants (not learned) — Mode A is the control arm.
One distractor raises perceived risk by ~30 range-days.
Weekday only when range is large AND distractor-free.
- Normalized EarlyCommitPenalty: (remaining/initial) * scale
Committing at 5% scan = cheap (0.05), at 90% = expensive (0.90).
Only charged on wrong commits.
- Hybrid minimum evidence: stop_after_first disabled in Hybrid mode
so solver checks all matching weekdays before committing.
Witness log:
- SolutionAttempt now carries skip_mode and context_bucket strings
- record_attempt_witnessed() for full policy audit trail
- Every trajectory records which skip mode was chosen and why
Observability:
- Puzzle tags now include distractor_count and has_dow (deterministic)
- count_distractors() made public for generator to tag puzzles
Ablation assertions (two new):
- a_skip_nonzero: Mode A uses skip at least sometimes (proves not hobbled)
- c_multi_mode: Mode C uses different skip modes across contexts (proves learning)
- Skip-mode distribution table printed per context bucket for Mode C
posterior_target monotonicity verified: 2→4→8→12→18→25→35→50→70→100
(never shrinks with difficulty)
81 tests passing (61 lib + 20 integration).
https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
Three-fix iteration based on ablation diagnostics:
1. Bounded trial: Strategy Zero now caps trial budget at min(avg_steps*2,
external_limit/4) with floor of 10 steps. Makes false hits cheap
(max 100 steps overhead instead of full compiled budget).
2. Confidence gating: Strategy Zero only attempts when config confidence
>= 0.7 (Laplace-smoothed success rate). Compiled observations from
training seed initial confidence so configs start trusted.
3. 2-failure quarantine: any compiled signature with 2+ false hits is
disabled (expected_correct=false). Prevents persistent bad patterns.
Additional changes:
- Versioned signature prefix (v1:difficulty:constraints) for cache
safety across refactors
- CompiledSolveConfig gains avg_steps, observations, confidence(),
trial_budget() methods
- KnowledgeCompiler gains steps_saved tracking, confidence_threshold,
print_diagnostics() for per-signature analysis
- record_success now tracks actual steps for delta-cost calculation
- Verbose mode prints full compiler diagnostics after each ablation
Results: false hit rate dropped from 8.2% to 4.4% (PASS). Cost still
net-positive because constraint-determined search ranges are 1-10 dates
— structurally no room for compiler optimization. Next: PolicyKernel
constraint ordering for real cost surface.
81 tests passing.
https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
Wire the KnowledgeCompiler as Strategy Zero in AdaptiveSolver solve
path — compiled constraint-signature configs are consulted before any
strategy. Add StrategyRouter with epsilon-greedy contextual bandit for
adaptive strategy selection per difficulty/constraint family.
Implement three-mode ablation protocol (A/B/C):
- Mode A: baseline (no compiler, fixed router)
- Mode B: compiler only (Strategy Zero with early termination)
- Mode C: full (compiler + adaptive router)
Adds run_ablation_comparison() and AblationComparison::print() with
quantitative assertions (B beats A on cost >=15%, C beats B on
robustness >=10%, compiler false-hit rate <5%).
Other changes:
- Early termination (stop_after_first) in TemporalSolver for compiled
single-solution puzzles
- Step accumulation across Strategy Zero failures + fallback
- Promotion gating: patterns only promoted when holdout accuracy
doesn't regress
- Compiler false_hits tracking
- --ablation flag on agi-proof-harness binary
- 81 tests passing (61 unit + 20 integration)
Ablation result (100-task holdout, 5 cycles): compiler active at 59%
hit rate with 8.2% false hit rate. Cost and robustness targets not yet
met — solver needs more policy surface (step 5: PolicyKernel learning).
https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
Implements a recursive intelligence amplification pipeline where each
level feeds the next, measuring IQ at every stage:
L1 Foundation (IQ ~79) Adaptive solver + ReasoningBank + retry
L2 Meta-Learning (IQ ~82) Learns optimal hyperparams per problem class
L3 Ensemble Arbiter (IQ ~83) Multi-strategy voting with learned selection
L4 Recursive Improve(IQ ~85) Bootstraps from own outputs + knowledge compiler
L5 Adversarial Grow (IQ ~89) Self-generated hard tasks + cascade reasoning
Key mechanisms:
- MetaParams: EMA-learned step budgets + retry benefit estimation
- StrategyEnsemble: N-solver majority vote, confidence-weighted
- KnowledgeCompiler: compiles patterns to direct lookup (54% hit rate)
- AdversarialGenerator: weakness-targeted difficulty escalation
- CascadeReasoner: multi-pass solve-verify-resolve
Results: +7.5 to +10.1 IQ gain across 5 levels, reaching IQ 86-89
depending on noise conditions. 100% accuracy at max difficulty in L4/L5.
https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
Add builder methods with_authority_config() and with_domain_profile()
for the two new TLV tags (0x0110, 0x0111). Update ParsedAgiManifest
parser to extract these sections with round-trip test coverage.
https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
- Resolve open questions: repo automation as first domain, four-level
AuthorityLevel enum, per-task ResourceBudget with hard caps,
CoherenceThresholds with validation
- Add AGI_MAX_CONTAINER_SIZE (16 GiB) with enforcement in validation
- Tighten ContainerSegments::validate: Verify/Live modes now require
world model data (VEC or INDEX segments), not just kernel/WASM
- Add ContainerError variants: InsufficientAuthority, BudgetExhausted
- Add to_flags support for orchestrator_present and world_model_present
- Add wire format section and cross-references to ADRs 029-033 in doc
- Add 2 new TLV tags: AUTHORITY_CONFIG (0x0110), DOMAIN_PROFILE (0x0111)
- Re-export new types from lib.rs
- Update rvf-runtime tests for tightened validation
- All 222 rvf-types + all rvf-runtime tests pass
https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G