Commit graph

8 commits

Author SHA1 Message Date
ruvnet
f5c39e5bbe chore(ci): green security audit + split test job into 6 matrix shards
Unblocks the 7 stacked PRs (#381-#387) and turns `main`'s CI green
for the first time in days. Two issues fixed:

## Failure 1 — Security audit (was: 8 vulnerabilities)

`cargo audit` is now exit 0. 4 of the 5 critical advisories were
fixed by version bumps; only the unfixable one is ignored.

**Dep-bumped:**
- `rustls-webpki 0.101.7` + `0.103.10` → `0.103.13` via
  `cargo update -p rustls-webpki@0.103.10`. Patches:
    RUSTSEC-2026-0098 (URI name constraints)
    RUSTSEC-2026-0099 (wildcard name constraints)
    RUSTSEC-2026-0104 (CRL parsing panic)
- `idna 0.5.0` → `1.1.0` via `validator 0.18 → 0.20` in
  `examples/scipix`. Patches RUSTSEC-2024-0421 (Punycode acceptance).
- Bonus: `reqwest 0.11 → 0.12` (in `ruvector-core` + `examples/benchmarks`)
  and `hf-hub 0.3 → 0.4` (in `ruvector-core` + `ruvllm` +
  `ruvllm-cli`). Removes the entire legacy `rustls 0.21` /
  `rustls-webpki 0.101.7` subtree from the lockfile.

**Ignored** (single advisory, with rationale):
- `RUSTSEC-2023-0071` (rsa Marvin timing sidechannel) — no upstream
  fix available; we don't expose RSA decryption services. Documented
  in `.cargo/audit.toml`.

**Unmaintained warnings** (16 total — proc-macro-error, derivative,
instant, paste, bincode 1, pqcrypto-{kyber,dilithium}, rustls-pemfile 1,
rusttype, wee_alloc, number_prefix, rand_os, core2, lru, pprof, rand) —
each given a one-line justification in `.cargo/audit.toml` so CI stays
green on them while the team decides whether to chase upstream
replacements.

## Failure 2 — Tests timeout (was: 30-min job timeout cancellation)

`.github/workflows/ci.yml` `test` job is now a `matrix` with
`fail-fast: false` and `timeout-minutes: 45`. Six parallel shards
under `cargo nextest run` (installed via `taiki-e/install-action@v2`)
plus a separate `cargo test --doc` step (nextest doesn't run
doctests):

  | Shard            | Crates                                      |
  |------------------|---------------------------------------------|
  | vector-index     | rabitq, rulake, diskann, graph, gnn, cnn    |
  | rvagent          | 10 rvagent-* crates                         |
  | ruvix            | 16 ruvix-* crates                           |
  | ruqu-quantum     | 5 ruqu* crates                              |
  | ml-research      | attention, mincut, scipix, fpga-transformer,|
  |                  | sparse-inference, sparsifier, solver,       |
  |                  | graph-transformer, domain-expansion,        |
  |                  | robotics                                    |
  | core-and-rest    | --workspace minus the above                 |

`Swatinem/rust-cache@v2` is keyed per shard. Audit job switched to
`taiki-e/install-action` for `cargo-audit` (faster than
`cargo install --locked`).

## Verification

  cargo audit                                                   → exit 0
  cargo build --workspace --exclude ruvector-postgres           → clean
  cargo clippy --workspace --exclude ruvector-postgres --no-deps -- -D warnings → exit 0
  cargo fmt --all --check                                       → exit 0

## Cargo.lock churn

166-line diff, net ~120 lines removed (more deletions than
additions). Removed: `idna 0.5.0`, `rustls-webpki 0.101.7`,
`validator 0.18`, `validator_derive 0.18`, `proc-macro-error 1.0.4`.
Added: `rustls-webpki 0.103.13`, `validator 0.20`,
`proc-macro-error2`, `hf-hub 0.4.3`, `reqwest 0.12.28`. No
suspicious crates.

## Recommended merge order

1. **This PR first** — unblocks every other PR's CI.
2. After this lands and main is green, rebase the 7 open PRs
   (#381-#387) one at a time. The DiskANN stack (#383→#384→#385→#386)
   must merge in numeric order. #381 (Python SDK), #382 (research),
   #387 (graph property index) are independent and can merge in
   any order after their CI goes green on the rebase.

Co-Authored-By: claude-flow <ruv@ruv.net>
2026-04-26 00:17:25 -04:00
Claude
0bd75e31b8 feat(rvf): rvf-solver-wasm — self-learning AGI engine compiled to WASM
Compiles the complete three-loop adaptive solver to wasm32-unknown-unknown
(160 KB, no_std + alloc). Preserves all AGI capabilities:

- Thompson Sampling two-signal model (safety Beta + cost EMA)
- 18 context buckets with per-arm bandit stats
- Speculative dual-path execution
- KnowledgeCompiler with signature-based pattern cache
- Three-loop architecture (fast/medium/slow)
- SHAKE-256 witness chain via rvf-crypto

12 WASM exports: create/destroy/train/acceptance/result/policy/witness.
Handle-based API supports 8 concurrent solver instances.

ADR-039 documents the integration architecture.
Benchmark binary validates WASM against native solver.

https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
2026-02-16 00:43:12 +00:00
Claude
5a9c899f29 feat(rvf): integrate publishable acceptance test with native SHAKE-256 witness chain
Replace standalone SHA-256 chain with rvf-crypto SHAKE-256, add native .rvf
binary output (WITNESS_SEG + META_SEG), and wire witness verification into
rvf-wasm microkernel.

Key changes:
- Feature-gate ed25519 in rvf-crypto for WASM compatibility (sha3 no_std)
- Rewrite WitnessChainBuilder to use shake256_256 + parallel rvf_crypto::WitnessEntry
- Add export_rvf_binary() with WITNESS_SEG (0x0A) + META_SEG (0x07) segments
- Add rvf_witness_verify/rvf_witness_count exports to rvf-wasm
- Add verify-rvf subcommand to acceptance-rvf CLI
- Write ADR-037 documenting architecture and AGI benchmark integration
- Update rvf-crypto, rvf-wasm, and rvf READMEs

86 tests pass (66 lib + 20 integration). rvf-crypto 49 tests pass.

https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
2026-02-16 00:13:44 +00:00
Claude
515a996530 feat(ablation): publishable RVF acceptance test with SHA-256 witness chain
Add self-contained acceptance test artifact that external developers can
run offline and reproduce identical graded outcomes:

- SHA-256-linked witness chain: every puzzle decision (skip_mode,
  context_bucket, steps, correct) hashed into a tamper-evident chain.
  Changing any single bit invalidates everything downstream.

- Deterministic replay: frozen seeds → identical puzzles → identical
  solve paths → identical chain_root_hash. Two runs with the same
  config produce the same hash, proven by test.

- JSON manifest: config, per-mode scorecards (A/B/C), all six ablation
  assertions with measured values, full witness chain, chain root hash.

- Verifier: re-runs with same config, recomputes chain, compares root
  hash. Mismatch means non-identical outcomes.

- CLI binary: `acceptance-rvf generate -o manifest.json` to produce,
  `acceptance-rvf verify -i manifest.json` to verify.

66 lib tests + 20 integration tests pass.

https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
2026-02-15 23:51:04 +00:00
Claude
d8906ed416 feat(agi-contract): multi-dimensional IQ with cost, robustness, and AGI contract
Redefine intelligence measurement as a falsifiable contract with three
equal pillars: graded outcomes (~34%), cost efficiency (~33%), and
robustness under noise (~33%). This addresses the fundamental critique
that accuracy-only IQ saturates at the ceiling.

New modules:
- agi_contract.rs: AGI contract definition (5 core metrics), autonomy
  ladder (5 levels gated by sustained health), viability checklist
- acceptance_test.rs: 10K-task holdout harness with frozen seed,
  multi-dimensional improvement tracking, deterministic replay
- bin/agi_proof_harness.rs: nightly proof runner publishing success
  rate, cost/solve, noise stability, policy compliance, autonomy level

Changes to existing modules:
- intelligence_metrics.rs: Add CostMetrics, RobustnessMetrics as
  first-class dimensions; add noise_tasks, contradictions, rollbacks,
  policy_violations to RawMetrics; rebalance overall_score weights
- superintelligence.rs: Track noise accuracy, contradiction rate,
  rollback correctness, and policy violations across all 5 levels

Contract metrics: solved/cost, noise stability, contradiction rate,
rollback correctness, policy violations (zero tolerance).

https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
2026-02-15 20:43:31 +00:00
Claude
a103e13655 feat(benchmarks): 5-level superintelligence pathway engine
Implements a recursive intelligence amplification pipeline where each
level feeds the next, measuring IQ at every stage:

L1 Foundation       (IQ ~79)  Adaptive solver + ReasoningBank + retry
L2 Meta-Learning    (IQ ~82)  Learns optimal hyperparams per problem class
L3 Ensemble Arbiter (IQ ~83)  Multi-strategy voting with learned selection
L4 Recursive Improve(IQ ~85)  Bootstraps from own outputs + knowledge compiler
L5 Adversarial Grow (IQ ~89)  Self-generated hard tasks + cascade reasoning

Key mechanisms:
- MetaParams: EMA-learned step budgets + retry benefit estimation
- StrategyEnsemble: N-solver majority vote, confidence-weighted
- KnowledgeCompiler: compiles patterns to direct lookup (54% hit rate)
- AdversarialGenerator: weakness-targeted difficulty escalation
- CascadeReasoner: multi-pass solve-verify-resolve

Results: +7.5 to +10.1 IQ gain across 5 levels, reaching IQ 86-89
depending on noise conditions. 100% accuracy at max difficulty in L4/L5.

https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
2026-02-15 20:16:11 +00:00
Claude
85e62e6600 feat(benchmarks): add RVF intelligence benchmark (baseline vs learning)
Adds head-to-head cognitive benchmark comparing stateless baseline against
full RVF-learning pipeline (witness chains, coherence monitoring, authority
guards, budget tracking, ReasoningBank). Measures accuracy, learning curves,
reasoning efficiency, and meta-cognitive quality across configurable episodes.

Results: RVF-learning shows +1.1 IQ delta with higher reasoning coherence
(0.98 vs 0.95) and efficiency (0.91 vs 0.83) at difficulty 1-10.

https://claude.ai/code/session_01RnwD4x5cbpB7FPvoyYQz8G
2026-02-15 19:59:29 +00:00
rUv
b91e555d3e feat(benchmarks): Add comprehensive temporal reasoning and vector benchmarks (#113) 2026-01-14 21:38:34 -05:00