ruvector

mirror of https://github.com/ruvnet/RuVector.git synced 2026-05-24 22:15:18 +00:00

Author	SHA1	Message	Date
rUv	2238362980	feat(rvf-cli): add cross-platform release workflow and update README - Add release-rvf-cli.yml: builds standalone binaries for Linux x64/ARM64, macOS x64/ARM64, and Windows x64 on tag push (rvf-v*) - Creates GitHub Release with all binaries and SHA256 checksums - Update CLI README with install instructions for pre-built binaries, examples/rvf/output/ usage guide, and full command reference Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 23:19:39 +00:00
rUv	bd05fd7f2e	chore: publish rvf-node@0.1.6 with win32-x64-msvc binary - Published @ruvector/rvf-node-win32-x64-msvc@0.1.4 to npm - Bumped @ruvector/rvf-node to 0.1.6 (all 5 platform binaries) - Added publishConfig to win32-x64-msvc package.json Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 23:04:34 +00:00
github-actions[bot]	eda601dbca	chore: Update NAPI-RS binaries for all platforms Built from commit `f0de332edd` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-16 22:43:35 +00:00
github-actions[bot]	6434701325	chore: Update RVF NAPI-RS binaries for all platforms Built from commit `f0de332edd` Platforms: linux-x64-gnu, linux-arm64-gnu, darwin-x64, darwin-arm64, win32-x64-msvc Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 22:43:15 +00:00
rUv	f0de332edd	fix(ci): resolve all build-rvf-node failures Three fixes: 1. locking.rs: __errno_location is Linux-only; macOS uses __error(). Split the extern "C" declarations by target_os so rvf-runtime compiles on both platforms. 2. build-rvf-node.yml: NAPI CLI outputs index.<platform>.node instead of rvf-node.<platform>.node. Added rename step after build. 3. build-rvf-node.yml: darwin builds need -undefined dynamic_lookup RUSTFLAGS so NAPI symbols resolve at runtime via Node.js. Added CARGO_TARGET_*_APPLE_DARWIN_RUSTFLAGS env vars. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 22:39:04 +00:00
github-actions[bot]	e782f3cdbe	chore: Update NAPI-RS binaries for all platforms Built from commit `5e6a9280a6` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-16 22:17:05 +00:00
rUv	5e6a9280a6	Merge pull request #177 from ruvnet/fix/rvf-backend-stubs fix(rvf): populate backend binaries and fix SDK API wiring	2026-02-16 14:12:31 -08:00
rUv	54fb86465f	fix(ci): resolve cp same-file error in build-rvf-node workflow The copy step was failing with "cp: 'X' and 'X' are the same file" because committed binaries in npm/ subdirs matched the find pattern. Added -maxdepth 1 to only find freshly built files and realpath comparison before cp. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 21:57:12 +00:00
rUv	30a556129c	fix(ci): add missing rvf-adapters/claude-flow crate to git The workspace member crates/rvf/rvf-adapters/claude-flow was listed in Cargo.toml but gitignored, causing CI builds to fail with: "failed to load manifest for workspace member rvf-adapters/claude-flow" Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 21:51:05 +00:00
rUv	365139bb31	chore: bump rvf-node to 0.1.5, rvf SDK to 0.1.7 - rvf-node@0.1.5 includes all 4 platform binaries inline (linux-x64-gnu, linux-arm64-gnu, darwin-x64, darwin-arm64) - rvf SDK@0.1.7 depends on rvf-node ^0.1.5 and rvf-wasm ^0.1.5 Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 21:43:46 +00:00
rUv	941f5cb846	chore(rvf-node): add darwin and linux-arm64 platform binaries Cross-compiled from Linux via cargo-zigbuild: - darwin-arm64 (3.1MB Mach-O arm64) - darwin-x64 (3.1MB Mach-O x86_64) - linux-arm64-gnu (1.3MB ELF aarch64) Published platform packages: - @ruvector/rvf-node-darwin-arm64@0.1.4 - @ruvector/rvf-node-darwin-x64@0.1.4 - @ruvector/rvf-node-linux-arm64-gnu@0.1.4 Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 21:42:52 +00:00
rUv	4e504f2172	fix(rvf-wasm): fix Node.js CJS/ESM glue and add rvf-node CI - Fix WASM glue: detect Node.js properly instead of relying on fetch() (fetch on file:// URLs fails in Node.js 18-21) - Support both CJS require() and ESM import via exports map - Add .mjs ESM wrapper for dual-format support - Remove "type": "module" for CJS compatibility - Bump rvf-wasm to 0.1.5 - Add build-rvf-node.yml CI workflow for cross-platform NAPI builds (linux-x64-gnu, linux-arm64-gnu, darwin-x64, darwin-arm64, win32-x64-msvc) - Fix wasm-dedup-check CI: use --ignore-scripts --omit=optional to avoid EBADPLATFORM errors from platform-specific workspace packages Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 21:33:14 +00:00
rUv	bb78099f0d	fix(rvf): populate backend binaries and fix SDK API wiring - Build NAPI native addon (linux-x64-gnu, 1.3MB) and WASM binary (42KB) - Fix NodeBackend to use RvfDatabase class instance methods instead of module-level functions - Fix WasmBackend to use C-ABI store functions with integer handles - Add platform loader (index.js) and TypeScript declarations (index.d.ts) - Create JS glue and type declarations for WASM package - Set up platform-specific npm packages for all 5 targets - Bump rvf-node/rvf-wasm to 0.1.4, SDK to 0.1.6 - Fix version pins from 0.1.0 to ^0.1.4 Resolves: rvf-node and rvf-wasm published as empty stubs with no binaries Verified: E2E test passes (create -> ingest -> query -> status -> close) Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 20:24:51 +00:00
github-actions[bot]	0ed302e1b3	chore: Update NAPI-RS binaries for all platforms Built from commit `ff8ebedc5e` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-16 15:41:53 +00:00
rUv	ff8ebedc5e	docs(rvf): remove redundant sections from crate README - Fix "20 segment types" → "24 segment types" in ASCII anatomy - Remove duplicate "Category Shift" table (restated capability table) - Remove duplicate "Where It Runs" table (restated capability table) - Remove "What You Can Ship" table from Sealed Cognitive Engines - Remove "What This Enables" 6-item list (restated format capabilities) - Remove duplicate "Cognitive Containers" and "Security & Trust" sub-tables from Features section - Remove "File Structure with KERNEL_SEG" diagram (duplicated segment tree) - Convert "Security Hardening" verbose table to compact "Security Modules" reference table Net: -119 lines of redundant content, +13 lines of concise replacements. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 15:37:35 +00:00
github-actions[bot]	49315aab91	chore: Update NAPI-RS binaries for all platforms Built from commit `3a89681bf2` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-16 14:59:15 +00:00
rUv	3a89681bf2	feat(rvf): add platform-specific scripts for Linux, Windows, Node, browser, Docker - rvf-quickstart.sh / .ps1 — 7-step RVF workflow (create, ingest, query, branch, verify) - rvf-claude-appliance.sh / .ps1 — build & boot the 5.1 MB Claude Code Appliance - rvf-mcp-server.sh / .ps1 — start stdio or SSE MCP server for AI agents - rvf-node-example.mjs — full Node.js API walkthrough - rvf-browser.html — browser WASM vector search demo - rvf-docker.sh — containerized RVF CLI for CI/CD Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 14:55:15 +00:00
github-actions[bot]	cb33833b2b	chore: Update NAPI-RS binaries for all platforms Built from commit `58c36b8dfe` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-16 14:54:43 +00:00
github-actions[bot]	8d5e4f60db	chore: Update NAPI-RS binaries for all platforms Built from commit `b9fc0bb8f6` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-16 14:52:40 +00:00
rUv	58c36b8dfe	docs(rvf): improve Security & Trust sections, add live_boot_proof example - Add introductory paragraph explaining RVF's structural security model - Expand Security & Trust tables with TEE attestation, KernelBinding, adversarial hardening details - Upgrade Security Hardening from bullet list to defense table - Add live_boot_proof as example #45, update counts to 46 Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 14:49:52 +00:00
rUv	b9fc0bb8f6	feat(rvf): add WASM_SEG (0x10) for self-bootstrapping RVF files feat(rvf): add WASM_SEG (0x10) for self-bootstrapping RVF files	2026-02-16 09:47:49 -05:00
rUv	ab7d1e78fc	docs: update READMEs with self-booting instructions, bump npm versions - Add Claude Code Appliance walkthrough and 5.1 MB self-boot line to crate, examples, npm, and root READMEs - Add missing live_boot_proof example to table (45→46 examples) - Update segment count references from 20→24 - Improve rvf-node npm README with full API reference - Expand AGI Cognitive Container documentation - Bump npm packages: rvf-node 0.1.3, rvf-wasm 0.1.3, rvf-mcp-server 0.1.3, rvf 0.1.5 - Include verified claude_code_appliance output files Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 14:43:04 +00:00
rUv	9fb3d2c63b	chore: bump rvf-types/rvf-crypto/rvf-runtime to 0.2.0 for new features Breaking changes from 0.1.0: - rvf-types: new Security/QualityBelowThreshold error variants, new quality module, AGI container types, WASM bootstrap types, Ed25519 signing, witness/attestation types, QR seed types - rvf-crypto: new witness chain, attestation, lineage modules - rvf-runtime: new AGI authority/coherence, QR seed, witness bundles, safety net, adversarial detection, domain expansion bridge Also updates all internal dependency version references. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 14:04:23 +00:00
rUv	104fe20a93	fix: resolve build breaks from new rvf-types fields in rvf-launch, rvf-node - rvf-launch: add missing retrieval_quality field to SearchResult - rvf-node: add match arms for new Security/QualityBelowThreshold error variants - rvf-node: use struct update syntax for new QueryOptions fields - rvf-runtime: add missing domain_expansion_present field in tests Co-Authored-By: claude-flow <ruv@ruv.net>	2026-02-16 14:03:29 +00:00
Claude	53243194ca	feat(domain-expansion): add meta-learning engine with five AGI learning improvements Closes five architectural gaps in the learning pipeline: 1. RegretTracker — cumulative regret tracking per context bucket. Measures optimality gap (O(√T) = learning, O(T) = not). Enables convergence detection and learning speed measurement. 2. DecayingBeta — exponential forgetting for non-stationary environments. Old evidence decays by configurable factor per observation. Effective window ≈ 1/(1-decay). Prevents calcification on stale data. 3. PlateauDetector — detects learning stalls with escalating responses: Continue → IncreaseExploration → TriggerTransfer → InjectDiversity → Reset. Compares accuracy windows and tracks consecutive plateau events. 4. ParetoFront — multi-objective optimization replacing single-scalar fitness. Tracks non-dominated solutions across [accuracy, -cost, robustness]. Includes hypervolume indicator, spread metrics, and per-objective queries. 5. CuriosityBonus — UCB-style exploration bonus for under-visited contexts. Bonus = c * sqrt(ln(N) / n_i). Directs exploration toward novel bucket/arm combinations rather than relying solely on Thompson variance. All five compose into MetaLearningEngine, wired into DomainExpansionEngine: - record_decision() feeds regret + curiosity + decaying beta on every arm pick - evolve_population() records kernels into Pareto front before evolution - select_arm_curious() adds curiosity bonus to Thompson Sampling - check_plateau() monitors cost curves for learning stalls - meta_health() provides unified diagnostics Performance (optimized hot paths avoid HashMap clone on existing entries): - RegretTracker: 84ns/record (1k decisions in 84µs) - DecayingBeta: 3ns/update - PlateauDetector: 4.1ns/check - ParetoFront: 67ns/insert, 146ns/hypervolume - Full cycle: 199ns/decision (18% faster after optimization) 82 tests pass. 6 new benchmarks added. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-16 04:40:34 +00:00
Claude	0261cf1198	docs(adr): update ADRs with implementation details from rvf-types - ADR-029: Add complete segment type registry (23 variants) with source references - ADR-030: Phase 1 complete — KernelHeader (128B), EbpfHeader (64B), WasmHeader (64B), all enums and flag constants implemented with 32+ tests. Updated GOAP world state. - ADR-032: Add WASM bootstrap types implementation section (WasmHeader, WasmRole, WasmTarget, 8 feature flags, 10 tests) - ADR-036: Status updated to Partially Implemented. Documented AGI container implementation (972 lines, 24 tests) including AgiContainerHeader, ExecutionMode, AuthorityLevel, ResourceBudget, CoherenceThresholds, ContainerSegments, and 22 TLV tags with domain expansion integration (0x0112-0x0115) https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-16 02:58:43 +00:00
Claude	52f5caeb11	feat(domain-expansion): integrate with RVF format — segment serialization, witness chains, AGI packaging Connects the domain expansion engine to the RuVector Format (RVF) wire protocol, closing all integration gaps: - Add SegmentType::TransferPrior (0x30), PolicyKernel (0x31), CostCurve (0x32) to rvf-types for domain expansion segment packaging - Add AGI_HAS_DOMAIN_EXPANSION flag and AGI_TAG_TRANSFER_PRIOR/POLICY_KERNEL/ COST_CURVE/COUNTEREXAMPLES TLV tags to AGI container types - Create rvf_bridge module (feature-gated behind "rvf") with: - RVF segment round-trip serialization for all three core types - SHAKE-256 witness chain integration via rvf-crypto - AGI container TLV packaging and encoding/decoding - SolverPriorExchange bridge for rvf-solver-wasm prior transfer - Multi-segment file assembly for standalone domain expansion archives - Wire-format wrappers (WireTransferPrior, WirePolicyKernel) handle HashMap<ContextBucket, _> → Vec<(K,V)> conversion for JSON safety - Add RVF export methods to WASM crate (WasmRvfBridge) for browser-side segment serialization, witness hashing, and solver prior exchange - 59 tests pass with rvf feature, 49 without — feature gate clean https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-16 02:44:09 +00:00
Claude	eff0ccce81	feat(domain-expansion): cross-domain transfer learning engine with WASM bindings Implements a complete cross-domain transfer learning system proving that kernels trained on Domain 1 can improve Domain 2 faster than training Domain 2 alone — demonstrating true generalization. Core engine (ruvector-domain-expansion): - Three specialized domains: Rust program synthesis, structured planning, tool orchestration — each with task generation, evaluation, and 64-dim shared embedding space - Meta Thompson Sampling with Beta-posterior priors across domains and contextual bandits (difficulty_tier × category buckets) - Population-based PolicyKernel search: evolutionary optimization with elite selection (top 25%), mutation, crossover over 8 tunable knobs - Speculative dual-path execution triggered by posterior variance - Cost curve compression tracking + acceleration scoreboard verifying progressive generalization (target: 95% accuracy, ≤0.01 cost) - Cross-domain transfer protocol with dampened prior initialization (sqrt scaling) and non-regression verification WASM bindings (ruvector-domain-expansion-wasm): - WasmDomainExpansionEngine, WasmThompsonEngine, WasmPopulationSearch, WasmScoreboard — full JS interop via serde-wasm-bindgen - Optimized for edge: opt-level "z", LTO, panic=abort, strip 49 tests passing, 8 Criterion benchmarks (Thompson select: 266ns, embedding: 2.86µs, population evolve: 7.4µs, cost curve AUC: 768ns). https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-16 01:41:47 +00:00
Claude	262d70d2d4	docs(rvf-solver-wasm): add detailed README with architecture, API tables, and usage examples https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-16 00:51:21 +00:00
Claude	e2a3f1a6e4	feat(rvf): rvf-solver-wasm — self-learning AGI engine compiled to WASM Compiles the complete three-loop adaptive solver to wasm32-unknown-unknown (160 KB, no_std + alloc). Preserves all AGI capabilities: - Thompson Sampling two-signal model (safety Beta + cost EMA) - 18 context buckets with per-arm bandit stats - Speculative dual-path execution - KnowledgeCompiler with signature-based pattern cache - Three-loop architecture (fast/medium/slow) - SHAKE-256 witness chain via rvf-crypto 12 WASM exports: create/destroy/train/acceptance/result/policy/witness. Handle-based API supports 8 concurrent solver instances. ADR-039 documents the integration architecture. Benchmark binary validates WASM against native solver. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-16 00:43:12 +00:00
Claude	e2cc7326ab	docs(adr): ADR-038 npx ruvector & rvlite witness verification integration Plans the integration path for .rvf acceptance test verification into the npm ecosystem: - npx ruvector rvf verify-witness <file.rvf> (N-API + WASM fallback) - npx rvlite verify-witness <file.rvf> (WASM via cli-rvf.ts) - rvlite SDK verifyWitnessChain() for browser-side verification - MCP tool rvf_verify_witness for Claude Code agents - 5-phase implementation plan, each independently shippable Bridges the rvf_witness_verify WASM export (ADR-037) to end users without requiring the Rust toolchain. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-16 00:17:00 +00:00
Claude	aca7f6b197	feat(rvf): integrate publishable acceptance test with native SHAKE-256 witness chain Replace standalone SHA-256 chain with rvf-crypto SHAKE-256, add native .rvf binary output (WITNESS_SEG + META_SEG), and wire witness verification into rvf-wasm microkernel. Key changes: - Feature-gate ed25519 in rvf-crypto for WASM compatibility (sha3 no_std) - Rewrite WitnessChainBuilder to use shake256_256 + parallel rvf_crypto::WitnessEntry - Add export_rvf_binary() with WITNESS_SEG (0x0A) + META_SEG (0x07) segments - Add rvf_witness_verify/rvf_witness_count exports to rvf-wasm - Add verify-rvf subcommand to acceptance-rvf CLI - Write ADR-037 documenting architecture and AGI benchmark integration - Update rvf-crypto, rvf-wasm, and rvf READMEs 86 tests pass (66 lib + 20 integration). rvf-crypto 49 tests pass. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-16 00:13:44 +00:00
Claude	0dabec3e38	chore: update Cargo.lock for sha2 dependency https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 23:51:51 +00:00
Claude	ccfc386ac3	feat(ablation): publishable RVF acceptance test with SHA-256 witness chain Add self-contained acceptance test artifact that external developers can run offline and reproduce identical graded outcomes: - SHA-256-linked witness chain: every puzzle decision (skip_mode, context_bucket, steps, correct) hashed into a tamper-evident chain. Changing any single bit invalidates everything downstream. - Deterministic replay: frozen seeds → identical puzzles → identical solve paths → identical chain_root_hash. Two runs with the same config produce the same hash, proven by test. - JSON manifest: config, per-mode scorecards (A/B/C), all six ablation assertions with measured values, full witness chain, chain root hash. - Verifier: re-runs with same config, recomputes chain, compares root hash. Mismatch means non-identical outcomes. - CLI binary: `acceptance-rvf generate -o manifest.json` to produce, `acceptance-rvf verify -i manifest.json` to verify. 66 lib tests + 20 integration tests pass. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 23:51:04 +00:00
Claude	2ed3dce655	feat(ablation): Thompson Sampling two-signal model, speculative dual-path, constraint propagation Replace epsilon-greedy with two-signal Thompson Sampling (safety Beta posterior + cost EMA) for Mode C learned policy. Score = safety_sample - lambda * cost_ema provides principled exploration-exploitation. Add speculative dual-path for Mode C only: when Beta variance > 0.02 and top-2 arms within delta 0.15, run both arms (60/40 budget split) to resolve uncertainty faster while keeping Mode A/B ablation clean. Add constraint propagation pre-pass as PolicyKernel-controlled mode (Off/Light/Full, defaults to Off). Light handles InMonth+DayOfMonth direct solves; Full adds DayOfWeek pruning for ranges ≤60 days. PrepassMetrics tracks pruned_candidates, prepass_steps, scan_steps_saved. Beta sampling via Marsaglia-Tsang Gamma method + Box-Muller normal. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 23:40:05 +00:00
Claude	aab38ed45b	refine(ablation): flip sign, wire penalty, expand buckets Fixed policy sign flip (Mode A): risk_score = R - 30D (was R + 30D) Distractors now reduce effective range, making Mode A conservative under distractors. This is the defensible control arm: a rational fixed agent should be more cautious when distractors are present. Mode C must learn to outperform this baseline. EarlyCommitPenalty wired into bandit reward: SkipModeStats now tracks early_commit_penalty_sum per arm. reward() includes robustness_penalty = 0.2 * avg_penalty. This means Mode C can actually learn to avoid early wrong commits in distractor-heavy contexts. Previously the penalty was only printed, not optimized. Context buckets expanded to 18: 3 range (small/medium/large) × 3 distractor (clean/some/heavy) × 2 noise (clean/noisy) = 18 buckets. Previous: 4 range × 2 distractor = 8 (too coarse for bandit). Noise flag now flows through AdaptiveSolver.noisy_hint. New ablation assertion: c_penalty_better_than_b: Mode C EarlyCommitPenalty must be ≤90% of Mode B penalty. Proves robustness improvement is explicit, not just noise_accuracy-based. Acceptance test noise plumbing: solver.noisy_hint set to true for noisy puzzles in both training and holdout evaluation. Context buckets now correctly distinguish clean vs noisy conditions. 81 tests passing (61 lib + 20 integration). https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 23:19:43 +00:00
Claude	38228a9a6d	refine(ablation): risk_score policy, normalized penalty, witness log PolicyKernel refinements: - Fixed policy (Mode A): risk_score = R + kD, k=30, T=140 Fixed constants (not learned) — Mode A is the control arm. One distractor raises perceived risk by ~30 range-days. Weekday only when range is large AND distractor-free. - Normalized EarlyCommitPenalty: (remaining/initial) scale Committing at 5% scan = cheap (0.05), at 90% = expensive (0.90). Only charged on wrong commits. - Hybrid minimum evidence: stop_after_first disabled in Hybrid mode so solver checks all matching weekdays before committing. Witness log: - SolutionAttempt now carries skip_mode and context_bucket strings - record_attempt_witnessed() for full policy audit trail - Every trajectory records which skip mode was chosen and why Observability: - Puzzle tags now include distractor_count and has_dow (deterministic) - count_distractors() made public for generator to tag puzzles Ablation assertions (two new): - a_skip_nonzero: Mode A uses skip at least sometimes (proves not hobbled) - c_multi_mode: Mode C uses different skip modes across contexts (proves learning) - Skip-mode distribution table printed per context bucket for Mode C posterior_target monotonicity verified: 2→4→8→12→18→25→35→50→70→100 (never shrinks with difficulty) 81 tests passing (61 lib + 20 integration). https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 23:08:02 +00:00
Claude	fa69d5e247	feat(ablation): PolicyKernel, DifficultyVector, fair mode comparison All modes now share the same solver capabilities. What differs is the policy mechanism that decides when to use them: - Mode A: fixed heuristic (posterior_range + distractor_count) - Mode B: compiler-suggested skip_mode from constraint signatures - Mode C: learned PolicyKernel (contextual bandit over skip modes) Key changes: PolicyKernel (temporal.rs): - SkipMode enum: None \| Weekday \| Hybrid - fixed_policy(): if DayOfWeek AND range>30 AND no distractors → Weekday - compiled_policy(): uses CompiledSolveConfig.compiled_skip_mode - learned_policy(): epsilon-greedy over per-context SkipModeStats - EarlyCommitPenalty: tracks solved-but-wrong from aggressive skipping - Hybrid mode: weekday skip + ±7 day refinement pass for safety DifficultyVector (timepuzzles.rs): - Replaces single-axis difficulty with (range_size, posterior_target, distractor_rate, noise_rate, ambiguity_count) - Flipped relationship: higher difficulty = wider range + more ambiguity (not tighter posterior) - Distractor DayOfWeek (difficulty 6+): DayOfWeek present but paired with wider Between that makes unconditional skipping risky Ablation fairness (acceptance_test.rs): - Removed feature gating: skip_weekday no longer forbidden for Mode A - All modes access same solver knobs, differ only by policy - AblationResult tracks PolicyKernel metrics (early_commit_rate, etc) - Comparison print shows policy differences explicitly 81 tests passing (61 lib + 20 integration). https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 22:54:28 +00:00
Claude	60b2aeec20	feat(generator): posterior-targeting puzzle generation, weekday skipping PolicyKernel Generator hardening: - Rewrite puzzle generator with difficulty-based posterior targeting (30-365 day ranges) - Remove InMonth/DayRange over-constraining from low difficulties - DayOfWeek constraint (difficulty 3+) creates 7x cost surface for solver optimization - Distractor injection at difficulty 5+ (redundant constraints that don't narrow search) - target_posterior() scales 300→20 across difficulty 1→10 Solver PolicyKernel: - Add skip_weekday: Option<Weekday> to TemporalSolver - Weekday skipping advances by 7 days instead of 1 when DayOfWeek constraint detected - Wire into AdaptiveSolver for compiler/router modes (B and C) - Mode A (baseline) scans linearly, Mode B/C skip to matching weekdays Correctness: - Relax correctness check: "every expected solution found" (not "only expected found") - Wide posteriors have many valid dates; only target inclusion matters - Integration test step budget increased to 400 for wider ranges Ablation results: - Mode A: 195.96 cost/solve (full linear scan) - Mode B: 68.80 cost/solve (65% reduction via weekday skipping) - Mode C: 68.80 cost/solve (65% reduction, same as B) - B beats A on cost: PASS (65% > 15% threshold) - Compiler false-hit rate: PASS (<5%) - 81 tests passing (61 unit + 20 integration) https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 22:31:12 +00:00
Claude	df054cfcdc	feat(compiler): bounded trial, confidence gating, 2-failure quarantine Three-fix iteration based on ablation diagnostics: 1. Bounded trial: Strategy Zero now caps trial budget at min(avg_steps*2, external_limit/4) with floor of 10 steps. Makes false hits cheap (max 100 steps overhead instead of full compiled budget). 2. Confidence gating: Strategy Zero only attempts when config confidence >= 0.7 (Laplace-smoothed success rate). Compiled observations from training seed initial confidence so configs start trusted. 3. 2-failure quarantine: any compiled signature with 2+ false hits is disabled (expected_correct=false). Prevents persistent bad patterns. Additional changes: - Versioned signature prefix (v1:difficulty:constraints) for cache safety across refactors - CompiledSolveConfig gains avg_steps, observations, confidence(), trial_budget() methods - KnowledgeCompiler gains steps_saved tracking, confidence_threshold, print_diagnostics() for per-signature analysis - record_success now tracks actual steps for delta-cost calculation - Verbose mode prints full compiler diagnostics after each ablation Results: false hit rate dropped from 8.2% to 4.4% (PASS). Cost still net-positive because constraint-determined search ranges are 1-10 dates — structurally no room for compiler optimization. Next: PolicyKernel constraint ordering for real cost surface. 81 tests passing. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 22:01:46 +00:00
Claude	1c22ba99ce	feat(agi): KnowledgeCompiler Strategy Zero, StrategyRouter bandit, ablation protocol Wire the KnowledgeCompiler as Strategy Zero in AdaptiveSolver solve path — compiled constraint-signature configs are consulted before any strategy. Add StrategyRouter with epsilon-greedy contextual bandit for adaptive strategy selection per difficulty/constraint family. Implement three-mode ablation protocol (A/B/C): - Mode A: baseline (no compiler, fixed router) - Mode B: compiler only (Strategy Zero with early termination) - Mode C: full (compiler + adaptive router) Adds run_ablation_comparison() and AblationComparison::print() with quantitative assertions (B beats A on cost >=15%, C beats B on robustness >=10%, compiler false-hit rate <5%). Other changes: - Early termination (stop_after_first) in TemporalSolver for compiled single-solution puzzles - Step accumulation across Strategy Zero failures + fallback - Promotion gating: patterns only promoted when holdout accuracy doesn't regress - Compiler false_hits tracking - --ablation flag on agi-proof-harness binary - 81 tests passing (61 unit + 20 integration) Ablation result (100-task holdout, 5 cycles): compiler active at 59% hit rate with 8.2% false hit rate. Cost and robustness targets not yet met — solver needs more policy surface (step 5: PolicyKernel learning). https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 21:29:48 +00:00
Claude	b06c797188	feat(agi): three-class memory, loop gating, RVF artifacts, rollback witnesses Memory poisoning defense: - Three memory classes: Volatile → Trusted → Quarantined - Counterexample-first promotion: patterns require counterexamples to promote - Demote Trusted → Quarantined on holdout failure - Strategy selection respects quarantine (skips quarantined patterns) - Structured counterexamples with full evidence chain - Rollback witnesses with trajectory/pattern diff recording Three-loop gating architecture: - Fast loop (per step): invariant checking, gate decisions (allow/block/quarantine/rollback) - Medium loop (per attempt): proposes memory writes, cannot commit - Slow loop (per cycle): consolidation, promotion review, rollback on regression - Critical rule: medium proposes, fast commits, slow promotes RVF artifact packaging: - Manifest (engine version, pinned configs, seed set, holdout IDs) - Memory snapshot (bank serialization, compiler cache, promotion log) - Witness chain (per-episode input/config/grade/memory hashes) - Verification: replay mode (stored grades) and verify mode (regenerated) - FNV-1a hashing for deterministic witness chain integrity Acceptance test improvements: - Fixed step budget (was /10, now uses full budget per task) - Integrated memory checkpoints with rollback on regression - Quarantine contradictory training trajectories - Counterexample recording during training - Quantitative thresholds: cost -15%, robustness +10%, rollback 95% - Separated contradictions from policy violations Bug fixes: - Fixed L1/L2 rollback tracking dead code in superintelligence.rs - Fixed unused parens warning in intelligence_metrics.rs 80 tests passing (60 unit + 20 integration) https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 21:09:01 +00:00
Claude	d51972d4a3	feat(agi-contract): multi-dimensional IQ with cost, robustness, and AGI contract Redefine intelligence measurement as a falsifiable contract with three equal pillars: graded outcomes (~34%), cost efficiency (~33%), and robustness under noise (~33%). This addresses the fundamental critique that accuracy-only IQ saturates at the ceiling. New modules: - agi_contract.rs: AGI contract definition (5 core metrics), autonomy ladder (5 levels gated by sustained health), viability checklist - acceptance_test.rs: 10K-task holdout harness with frozen seed, multi-dimensional improvement tracking, deterministic replay - bin/agi_proof_harness.rs: nightly proof runner publishing success rate, cost/solve, noise stability, policy compliance, autonomy level Changes to existing modules: - intelligence_metrics.rs: Add CostMetrics, RobustnessMetrics as first-class dimensions; add noise_tasks, contradictions, rollbacks, policy_violations to RawMetrics; rebalance overall_score weights - superintelligence.rs: Track noise accuracy, contradiction rate, rollback correctness, and policy violations across all 5 levels Contract metrics: solved/cost, noise stability, contradiction rate, rollback correctness, policy violations (zero tolerance). https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 20:43:31 +00:00
Claude	7e070dbf9c	feat(benchmarks): 5-level superintelligence pathway engine Implements a recursive intelligence amplification pipeline where each level feeds the next, measuring IQ at every stage: L1 Foundation (IQ ~79) Adaptive solver + ReasoningBank + retry L2 Meta-Learning (IQ ~82) Learns optimal hyperparams per problem class L3 Ensemble Arbiter (IQ ~83) Multi-strategy voting with learned selection L4 Recursive Improve(IQ ~85) Bootstraps from own outputs + knowledge compiler L5 Adversarial Grow (IQ ~89) Self-generated hard tasks + cascade reasoning Key mechanisms: - MetaParams: EMA-learned step budgets + retry benefit estimation - StrategyEnsemble: N-solver majority vote, confidence-weighted - KnowledgeCompiler: compiles patterns to direct lookup (54% hit rate) - AdversarialGenerator: weakness-targeted difficulty escalation - CascadeReasoner: multi-pass solve-verify-resolve Results: +7.5 to +10.1 IQ gain across 5 levels, reaching IQ 86-89 depending on noise conditions. 100% accuracy at max difficulty in L4/L5. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 20:16:11 +00:00
Claude	e20776bb0d	feat(benchmarks): 6-vertical intelligence benchmark with real divergence Rewrites the intelligence benchmark so RVF-learning ACTUALLY diverges from baseline. Introduces six intelligence verticals where learning changes outcomes: 1. Step-Limited Reasoning — adaptive step budget allocation from learned averages 2. Noisy Constraints — noise injection + RVF retry with clean puzzle 3. Transfer Learning — cross-episode pattern reuse via persistent ReasoningBank 4. Error Recovery — coherence-gated rollback with doubled step budget retry 5. Compositional Scaling — progressive difficulty ramp across episodes 6. Knowledge Retention — recycled puzzles from earlier solved archives Key results (15 episodes x 25 tasks, 30% noise, 350 step budget): - Overall Accuracy: +13.1% (78.7% -> 91.7%) - Final Episode: +16.0% (80.0% -> 96.0%) - IQ Score: +5.7 (79.2 -> 84.9) - Noisy Constraints: +47.5% (49.5% -> 97.1%) - Error Recovery: +61.3% (0.0% -> 61.3%) Also adds AdaptiveSolver.solver_mut() and external_step_limit to temporal.rs for safe step budget control without unsafe transmute. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 20:08:47 +00:00
Claude	ef8646784c	feat(benchmarks): add RVF intelligence benchmark (baseline vs learning) Adds head-to-head cognitive benchmark comparing stateless baseline against full RVF-learning pipeline (witness chains, coherence monitoring, authority guards, budget tracking, ReasoningBank). Measures accuracy, learning curves, reasoning efficiency, and meta-cognitive quality across configurable episodes. Results: RVF-learning shows +1.1 IQ delta with higher reasoning coherence (0.98 vs 0.95) and efficiency (0.91 vs 0.83) at difficulty 1-10. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 19:59:29 +00:00
Claude	7c6a637bba	feat(agi-runtime): authority guard, coherence monitor, benchmarks Add three new modules to rvf-runtime implementing the ADR-036 runtime: - agi_authority.rs: AuthorityGuard (per-mode + per-action-class enforcement), BudgetTracker (resource consumption tracking with hard caps), ActionClass enum (10 action categories) - agi_coherence.rs: CoherenceMonitor (real-time state machine with Healthy/SkillFreeze/RepairMode/Halted transitions), ContainerValidator (full validation pipeline) - tests/agi_e2e.rs: end-to-end integration tests and performance benchmarks (header serialize/deserialize, container build/parse, flags computation) All 219 rvf-runtime lib tests pass. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 19:22:26 +00:00
Claude	191fa06874	feat(agi-container): add authority_config and domain_profile TLV support Add builder methods with_authority_config() and with_domain_profile() for the two new TLV tags (0x0110, 0x0111). Update ParsedAgiManifest parser to extract these sections with round-trip test coverage. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 19:18:09 +00:00
Claude	8715458cb8	refactor(adr-036): optimize AGI container architecture - Resolve open questions: repo automation as first domain, four-level AuthorityLevel enum, per-task ResourceBudget with hard caps, CoherenceThresholds with validation - Add AGI_MAX_CONTAINER_SIZE (16 GiB) with enforcement in validation - Tighten ContainerSegments::validate: Verify/Live modes now require world model data (VEC or INDEX segments), not just kernel/WASM - Add ContainerError variants: InsufficientAuthority, BudgetExhausted - Add to_flags support for orchestrator_present and world_model_present - Add wire format section and cross-references to ADRs 029-033 in doc - Add 2 new TLV tags: AUTHORITY_CONFIG (0x0110), DOMAIN_PROFILE (0x0111) - Re-export new types from lib.rs - Update rvf-runtime tests for tightened validation - All 222 rvf-types + all rvf-runtime tests pass https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 19:10:00 +00:00
Claude	ba012d0c54	feat(adr-036): AGI cognitive container types + builder Wire format for packaging the entire AGI framework into a single RVF: Types (rvf-types/src/agi_container.rs): - AgiContainerHeader: 64-byte repr(C) header (RVAG magic) - ContainerSegments: inventory of KERNEL/WASM/VEC/INDEX/WITNESS segments - ExecutionMode: Replay / Verify / Live - 16 TLV tags: model_id, policy, orchestrator, tools, eval, skills, replay_script, kernel_config, coherence, project_instructions, etc. - 12 capability flags: kernel, wasm, orchestrator, world_model, eval, skills, witness, signed, replay, offline, tools, coherence_gates Builder (rvf-runtime/src/agi_container.rs): - AgiContainerBuilder: fluent API for assembling container manifests - ParsedAgiManifest: zero-copy parser with section extraction - HMAC-SHA256 signing for tamper detection - Segment validation per execution mode Container layout: - META segment: AGI manifest (header + TLV config) - KERNEL_SEG: micro Linux kernel (Firecracker vmlinux) - WASM_SEG: interpreter + microkernel modules - VEC_SEG + INDEX_SEG: RuVector world model - WITNESS_SEG: ADR-035 witness chains - CRYPTO_SEG: signing keys and attestation Tests: 13 types + 4 runtime = 17 new tests. All 468 passing. https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G	2026-02-15 18:47:56 +00:00

1 2 3 4 5 ...

955 commits