ruvector

mirror of https://github.com/ruvnet/RuVector.git synced 2026-05-28 09:53:36 +00:00

Author	SHA1	Message	Date
github-actions[bot]	edddeee1ee	chore: Update NAPI-RS binaries for all platforms Built from commit `00e8fc3f1e` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-08 17:07:51 +00:00
rUv	96c84f422e	feat: publish ruQu quantum simulation engine crates Published crates: - ruqu-core v2.0.2 - State-vector simulator - ruqu-algorithms v2.0.2 - VQE, Grover, QAOA, Surface Code - ruqu-exotic v2.0.2 - Quantum-classical hybrids - ruqu-wasm v2.0.2 - WebAssembly bindings Updated README with quantum engine section linking ADRs: - QE-001 to QE-012: Core architecture to MinCut coherence - Code example for GHZ state creation Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-08 17:06:58 +00:00
github-actions[bot]	7970c07036	chore: Update NAPI-RS binaries for all platforms Built from commit `8cfb2e8393` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-08 17:06:47 +00:00
rUv	efd0f70f56	Merge pull request #154 from ruvnet/claude/quantum-engine-adrs-6OsEO feat: Add quantum simulation engine ADR series (QE-001 to QE-012) and DDD design documents	2026-02-08 12:05:03 -05:00
rUv	4a59e7e4de	Merge origin/main into claude/quantum-engine-adrs-6OsEO - resolve Cargo.toml conflict	2026-02-08 17:04:57 +00:00
rUv	7781e10c0e	docs: expand temporal tensor store section with PR #156 details Added ADR links (018-023) and DDD reference for: - Block-based storage engine - Tiered quantization formats - Temporal scoring tier migration - Delta compression reconstruction - WASM API cross-platform - Benchmarking acceptance criteria Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-08 17:03:22 +00:00
rUv	239f22214c	docs: update README with new crates and BitNet features Added: - ruvector-temporal-tensor: Temporal tensor store with tiered quantization - ruvector-crv: CRV signal line protocol for vector search - BitNet 1.58-bit quantization features to ruvllm description Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-08 17:02:09 +00:00
rUv	2c607c7327	chore: bump versions for BitNet integration publish - Workspace version: 2.0.1 → 2.0.2 - ruvector-sona: 0.1.4 → 0.1.5 (adds Debug impl for SonaEngine) - ruvllm: 2.0.2 (BitNet integration from PR #151) Published crates: - ruvector-sona v0.1.5 - ruvllm v2.0.2 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-08 17:00:26 +00:00
github-actions[bot]	c0b610dcd1	chore: Update NAPI-RS binaries for all platforms Built from commit `593b44a543` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-08 16:59:58 +00:00
github-actions[bot]	445dace141	chore: Update NAPI-RS binaries for all platforms Built from commit `e98ce26da3` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-08 16:58:30 +00:00
github-actions[bot]	926376e227	chore: Update NAPI-RS binaries for all platforms Built from commit `0989957d15` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-08 16:55:50 +00:00
rUv	a213303f3a	Merge pull request #151 from ruvnet/claude/bitnet-ruvllm-research-Cz4Ot docs: Add ADR-017 and DDD for Craftsman Ultra 30b 1bit BitNet integration	2026-02-08 11:55:15 -05:00
rUv	2c95c8085f	docs(ruvector-crv): add README for crates.io Adds comprehensive README documenting the 6-stage CRV signal line methodology mapping to ruvector subsystems. Bumped to v0.1.1. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-08 16:53:49 +00:00
rUv	a8e019f9f3	chore: add version specifications for crates.io publishing Updated Cargo.toml files to specify explicit version requirements for path dependencies, enabling successful publishing to crates.io. Published crates: - ruvector-temporal-tensor v2.0.1 - ruvector-core v2.0.1 - ruvector-gnn v2.0.1 - ruvector-raft v2.0.1 - ruvector-cluster v2.0.1 - ruvector-replication v2.0.1 - ruvector-graph v2.0.1 - ruvector-mincut v2.0.1 - ruvector-crv v0.1.0 - rvlite v0.3.0 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-08 16:51:20 +00:00
github-actions[bot]	3d56868f28	chore: Update NAPI-RS binaries for all platforms Built from commit `1f3e643e2c` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-08 16:44:32 +00:00
rUv	e9c8e11a6c	Merge pull request #156 from ruvnet/claude/temporal-tensor-compression-adr-Lc5Do docs: Add ADR-018 through ADR-023 and DDD for temporal tensor store	2026-02-08 11:40:16 -05:00
Claude	f73f13c08a	feat: Add benchmarks for new features + persistence integration tests Benchmarks (store.rs, 8 new bench tests): - Batch scoring 10k blocks vs individual scoring - 5-bit and 7-bit dequant fast paths (4096 values) - 5-bit quantize fast path (4096 values) - SVD adaptive rank selection (64x64 matrix) - format_report and format_json throughput - MetricsSeries trend computation (100 snapshots) Persistence tests (10 tests, feature-gated): - FileBlockIO: write/read, multi-tier, delete, overwrite, missing key - FileMetaLog: append/get, upsert, iter, missing key, multi-block 354 tests pass (with --features persistence). https://claude.ai/code/session_01Ksy165BL5nGpVoWaAfTE7t	2026-02-08 04:56:40 +00:00
Claude	8fa851c917	feat: Wire coherence gate, epoch tracker, and metrics series into TieredStore - Add CoherenceCheck, EpochTracker, MetricsSeries fields to TieredStore - put() now records write epochs for staleness detection - tick() auto-records metrics snapshots for trend analysis - Add enable_coherence()/disable_coherence() + accessor methods - Add coherence_check() convenience method on TieredStore - 4 new integration tests verify wiring https://claude.ai/code/session_01Ksy165BL5nGpVoWaAfTE7t	2026-02-08 04:50:33 +00:00
Claude	ed8f369585	feat: Implement all 11 temporal tensor improvements - 5-bit quantize fast path (8 values → 5 bytes, no bit accumulator) - 5-bit + 7-bit dequant fast paths (8-at-a-time byte extraction) - Batch scoring: compute_scores_batch, choose_tiers_batch, score_and_partition, top_k_coldest with partial sort - SVD improvements: reconstruction_error, energy_captured, compression_ratio, from_data_adaptive (auto-rank selection) - Metrics dashboard: format_report, format_json, health_check with StoreHealthStatus, MetricsSeries with trend analysis - Core trait integration (TensorStore, TensorStoreExt, TensorStoreSnapshot) - AgentDB adapter (PatternIndex, InMemoryPatternIndex, AdaptiveTiering) - Coherence gate (CoherenceCheck, EpochTracker, verify_put) - Persistence layer (FileBlockIO, FileMetaLog, feature-gated) - Stress/fuzz tests (8 adversarial scenarios) - WASM FFI end-to-end test (feature-gated behind ffi) 306 tests pass (257 unit + 12 integration + 11 benchmarks + 14 property + 8 stress + 4 doctests). https://claude.ai/code/session_01Ksy165BL5nGpVoWaAfTE7t	2026-02-08 04:34:12 +00:00
Claude	fb56bd45e1	perf: Optimize quantizer + store; add comprehensive tests - Eliminate round() call in quantize hot path (1.8x speedup) - Add 3-bit dequant fast path (8-values-from-3-bytes, 2.4x speedup) - Wire WitnessLog into TieredStore (put/get/evict audit trail) - Add TieredStore.metrics() for aggregate store statistics - Add TieredStore.witness_log() accessors - Update store.get() to accept `now` tick for access tracking - 14 property-based tests (roundtrip, bitpack, segment, delta, f16, score monotonicity, extreme values, compression ratio, determinism) - 11 end-to-end integration tests (lifecycle, delta chain, quality sweep, persistence, eviction, checksum, multi-tensor, stress, compressor-to-store, factor reconstruction, witness logging) Benchmarks (4096-element tensors, release mode): 8-bit quantize: 10,745 ns (1.52 GB/s) 8-bit dequant: 992 ns (16.52 GB/s) 3-bit dequant: 2,998 ns (5.46 GB/s) Zipf P95 read: 41 ns Tier flip rate: 0.074/block/min (threshold: 0.1) All 204 tests pass. https://claude.ai/code/session_01Ksy165BL5nGpVoWaAfTE7t	2026-02-08 03:57:57 +00:00
Claude	bc745b6115	feat: Implement temporal tensor store with block-based tiered compression Implements the block-based storage engine specified in ADR-018 through ADR-023 with 5 new modules and 1 benchmark/test suite. New modules: - store.rs (1056 lines): BlockKey, BlockMeta, Tier, TieredStore with HashMap index, per-tier data storage, CRC32 checksums, eviction, and BlockIO/MetaLog/Clock traits - tiering.rs (846 lines): EMA + popcount + recency scoring with LUT-based fast_exp_neg, hysteresis, min_residency, budgeted maintenance, MigrationCandidate selection, warm aggressive mode (7->5 bit) - delta.rs (825 lines): Sparse delta format (u16 index + i16 value), DeltaChain with bounded length and compaction, FactorSet for low-rank reconstruction, encode/decode serialization - metrics.rs (770 lines): WitnessLog (ring buffer), WitnessEvent enum (Access, TierChange, Eviction, Maintenance, Compaction, etc.), StoreMetrics aggregates, StoreSnapshot serialization - store_ffi.rs (680 lines): tts_init/put/get/tick/stats/touch/evict WASM exports with u128 split into hi/lo u64, feature-gated Optimizations: - 8-bit fast path in quantizer: direct byte read/write, no bit accumulator. Dequant: 7313ns -> 1290ns (5.7x faster, 12.7 GB/s) - 8-bit fast path in bitpack: direct copy, no accumulator. Pack: 8484ns -> 742ns (11.4x), Unpack: 8845ns -> 396ns (22.3x) - #[inline] on hot functions Benchmark results (release, 16KB blocks): Quantize 8-bit: 18.9us Dequant 8-bit: 1.3us (12.7 GB/s) Quantize 3-bit: 22.5us Dequant 3-bit: 7.2us (2.3 GB/s) Score compute: 10ns Single frame decode: 178ns Segment 8-bit decode: 1.5us (11.2 GB/s) Zipf P95 read: 48ns Tier flip rate: 0.074/block/min Quality (all PASS): 8-bit: 0.39% max error 7-bit: 0.79% max error 5-bit: 3.33% max error 3-bit: 16.67% max error Tests: 170 unit + 12 integration/benchmark, all passing. https://claude.ai/code/session_01Ksy165BL5nGpVoWaAfTE7t	2026-02-08 03:18:51 +00:00
Claude	5b1bf32ea7	docs: Add ADR-018 through ADR-023 and DDD for temporal tensor store Extends ADR-017 with detailed architecture for block-based temporal tensor compression. Introduces 6 new ADRs covering storage engine, tiered quantization formats, temporal scoring algorithm, delta compression/reconstruction, WASM API, and benchmarking criteria. Adds comprehensive DDD with 5 bounded contexts mapping to the proposed 6-crate workspace layout. ADR-018: Block-based storage engine with BlockKey/BlockMeta model, append-only MetaLog, tiered data files, CRC32 checksums ADR-019: 8/7/5/3-bit quantization formats with two-level scale, Tier0 compression-to-zero, codec_bits bit packing ADR-020: EMA + popcount + recency scoring, hysteresis, budgeted maintenance ticks, fast exp approximation ADR-021: Read/write paths, sparse delta format, delta chain compaction, factor-based reconstruction policies ADR-022: WASM exports (tts_init/put/get/tick/stats), host-imported BlockIO, cross-platform strategy (native/Node/browser/edge) ADR-023: Zipf simulation acceptance test (1M blocks, 10M accesses), microbench targets, failure mode catalog DDD: 5 bounded contexts (Block Management, Quantization, Temporal Scoring, Storage Engine, Delta & Reconstruction) with aggregate roots, domain events, repository interfaces, and context map. Total: 8,084 lines across 7 documents. https://claude.ai/code/session_01Ksy165BL5nGpVoWaAfTE7t	2026-02-08 02:12:38 +00:00
Claude	aacbdec281	feat: Phase 2 cross-module discoveries — 6 new experiments, all validated Discovery 5: Time-dependent disambiguation (decay + interference) Faster-decohering meaning loses embedding structure over time, shifting which meaning wins. "Financial" starts dominant but "river" takes over as financial embedding decoheres faster. Discovery 6: QEC on swarm reasoning chains (reasoning_qec + swarm) Syndrome bits map to agent boundaries. Fired syndromes indicate where adjacent agents disagree, enabling targeted identification of incoherent reasoning steps. Discovery 7: Counterfactual search explanation (collapse + reversible) Removing each gate and measuring divergence reveals which operation was most responsible for a search result. Ry gate (divergence=0.45) vs identity-like gate (divergence=0.0). Discovery 8: Syndrome-diagnosed swarm health (diagnosis + swarm) Syndrome extraction localizes faults to the disruptor's neighborhood. Low-health agent creates structural vulnerability that propagates through connected components. Discovery 9: Decoherence as differential privacy (decay + collapse) Light noise (0.01): preserves top results, divergence=0.12, entropy=1.44 Heavy noise (1.0): randomizes results, divergence=0.61, entropy=2.07 Calibrated decoherence provides tunable privacy for embedding search. Discovery 10: Full 4-module pipeline (decay→interfere→collapse→QEC) Fresh knowledge (fidelity=0.99): correct results, 0 QEC syndromes Stale knowledge (fidelity=0.28): corrupted results, QEC detects degradation Pipeline degrades gracefully with automatic reliability signaling. 105 total tests: 57 lib + 42 Phase 1 integration + 6 Phase 2 discoveries https://claude.ai/code/session_01B1NkbLDWYPaacS9miKsnvW	2026-02-06 16:25:02 +00:00
Claude	c0b4f90b7b	docs: Add ADR-QE-014 documenting exotic quantum-classical discoveries Documents 4 validated Phase 1 discoveries: - Decoherence trajectory fingerprinting (clustering without similarity) - Interference-based polysemy resolution (microsecond disambiguation) - Counterfactual dependency mapping (pipeline importance scoring) - Phase-coherent swarm coordination (quality > headcount) Outlines 8 Phase 2 hypotheses for cross-module experiments including time-dependent disambiguation, QEC on swarm reasoning, counterfactual search explanation, and the full decohere-interfere-collapse-verify pipeline. https://claude.ai/code/session_01B1NkbLDWYPaacS9miKsnvW	2026-02-06 15:51:08 +00:00
Claude	c33ebf78f6	feat: Complete ruqu-exotic with all 8 modules, 99 tests passing, 4 discoveries Add reversible_memory.rs: time-reversible quantum state with gate inversion, rewind, counterfactual analysis, and sensitivity analysis. Add reality_check.rs: browser-native verification circuits for superposition, entanglement, interference, phase kickback, and no-cloning theorem. Add comprehensive integration test suite (42 tests) covering all 8 exotic modules plus 4 cross-module discovery experiments: - Decoherence trajectory fingerprinting (similar embeddings decohere similarly) - Interference-based polysemy resolution (context resolves word meanings) - Counterfactual dependency mapping (identify critical vs redundant operations) - Swarm phase alignment (phase-coherent agents outperform count-based voting) Fix flaky unit tests in quantum_decay and quantum_collapse modules. 99 total tests: 57 lib + 42 integration, all passing. https://claude.ai/code/session_01B1NkbLDWYPaacS9miKsnvW	2026-02-06 14:41:20 +00:00
Claude	b289e69032	feat: Add 4 exotic quantum-classical modules (decay, interference, QEC reasoning, syndrome diagnosis) quantum_decay: Embeddings decohere via quantum noise channels instead of TTL deletion. Phase fidelity degrades smoothly before magnitude, giving structured forgetfulness. interference_search: Concepts represented as amplitude superpositions. Retrieval uses constructive/destructive interference in complex space instead of cosine similarity reranking. reasoning_qec: Reasoning steps encoded as qubits with repetition-code syndrome extraction. Detects structural incoherence in reasoning traces via parity checks between adjacent steps. syndrome_diagnosis: System components mapped to a quantum graph. Fault injection + syndrome extraction localizes fragile components and identifies fault propagation paths. https://claude.ai/code/session_01B1NkbLDWYPaacS9miKsnvW	2026-02-06 14:04:49 +00:00
Claude	99fdbf4a4c	feat(wip): Add ruqu-exotic crate scaffold with quantum collapse and swarm interference Initial scaffold for 8 exotic quantum-classical hybrid algorithms: - Quantum-shaped memory decay (embeddings decohere instead of deletion) - Interference-based concept disambiguation (amplitude-space retrieval) - Quantum-driven search collapse (superposition → measurement retrieval) - Quantum-modulated agent swarms (interference instead of voting) - Error-corrected reasoning traces (QEC on reasoning steps) - Syndrome-based AI self diagnosis (fault localization via syndromes) - Time-reversible memory (counterfactual debugging) - Browser-native quantum reality checks (verification circuits) Includes complete implementations for quantum_collapse and swarm_interference modules. Remaining modules being implemented by concurrent agents. https://claude.ai/code/session_01B1NkbLDWYPaacS9miKsnvW	2026-02-06 02:26:33 +00:00
github-actions[bot]	586529e74f	chore: Update NAPI-RS binaries for all platforms Built from commit `c66dd1de5a` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-06 02:15:25 +00:00
rUv	354391f2fa	Merge pull request #153 from ruvnet/claude/temporal-tensor-compression-1r58N	2026-02-06 03:10:27 +01:00
Claude	2782fc6305	docs: Add ADR-QE-013 Deutsch's theorem proof with historical comparison Complete proof of Deutsch's theorem with phase kickback lemma and step-by-step derivation. Compares five major formulations: - Deutsch (1985): original probabilistic version (p=1/2) - Deutsch-Jozsa (1992): deterministic n-bit, 2 queries - Cleve-Ekert-Macchiavello-Mosca (1998): deterministic, single query - Nielsen-Chuang (2000): canonical textbook presentation - Calude (2006): de-quantization using higher-dimensional classical bits Includes de-quantization critique (Abbott et al.), classical wave analogies, and analysis of when quantum advantage is genuine vs artifactual. Adds 6 verification tests to ruqu-algorithms confirming all four oracles produce deterministic correct results via the ruqu-core simulator, including a phase-kickback amplitude-level check. https://claude.ai/code/session_01B1NkbLDWYPaacS9miKsnvW	2026-02-06 02:00:35 +00:00
Claude	d453d2da86	feat: Implement quantum simulation engine (ruqu-core, ruqu-algorithms, ruqu-wasm) Full Rust implementation of the quantum simulation engine as specified in ADR-QE-001 through ADR-QE-012: ruqu-core: State-vector simulator with 2^n complex amplitudes, single and two-qubit gate kernels (H, X, Y, Z, S, T, Rx, Ry, Rz, CNOT, CZ, SWAP, Rzz), projective measurement with collapse, expectation values for Pauli strings and Hamiltonians, gate fusion optimizer, circuit builder API, and multi-shot simulator with noise model support. ruqu-algorithms: VQE with hardware-efficient ansatz and parameter-shift gradients, Grover's search with optimal iteration count, QAOA MaxCut with Rzz phase separation, and distance-3 rotated surface code with syndrome extraction and lookup decoder. ruqu-wasm: WebAssembly bindings via wasm-bindgen exposing circuit construction, simulation, Grover search, and QAOA to browser clients with 25-qubit memory limit. 257 tests passing across all crates. Criterion benchmarks included for gate throughput, bell state preparation, algorithm scaling, and memory allocation across 4-20 qubit systems. https://claude.ai/code/session_01B1NkbLDWYPaacS9miKsnvW	2026-02-06 01:24:14 +00:00
Claude	1ac58b38cf	docs: Add README for ruvector-temporal-tensor crate Plain-language introduction explaining what temporal tensor compression does and why it matters, feature tables, Quick Start with 4 code examples (basic, streaming, random-access, custom policy), full API reference, segment binary format spec, FFI/WASM guide, and build instructions. https://claude.ai/code/session_01U63xtGd5Q8mUevyY7nUSfJ	2026-02-06 01:01:48 +00:00
Claude	fbcc13c5f8	docs: Polish temporal tensor crate with clippy fixes, docs, and utilities - Fix all clippy warnings: module-level //! docs, .div_ceil(), is_empty() - Optimize segment::decode to call dequantize_f32 directly (skip legacy wrapper) - Add decode_single_frame() for random-access frame decoding - Add compression_ratio() utility for segment inspection - Add comprehensive doc-examples with 3 tested examples in lib.rs - Fix HEADER_SIZE offset bug in decode_single_frame (22 vs 26) - All 41 unit tests + 3 doc-tests pass, 0 clippy warnings https://claude.ai/code/session_01U63xtGd5Q8mUevyY7nUSfJ	2026-02-06 00:50:48 +00:00
Claude	b200b0cb47	perf: Optimize temporal tensor compression hot paths Key optimizations: - Eliminate per-value modulo/division in dequantize by restructuring to iterate by frame→group→element (was: flat index with val_idx % tensor_len) - Cache f32 scales in TemporalTensorCompressor to avoid repeated f16→f32 conversion on every push_frame (drift check + quantization) - Add optimized _f32 API variants (frame_fits_scales_f32, quantize_and_pack_f32, dequantize_f32) that accept pre-converted scales - Pre-reserve Vec capacity in quantize_and_pack (avoids reallocations) - Add #[inline] on qmax_from_bits, f32_to_f16_bits, f16_bits_to_f32 - Use chunks() iterator instead of manual index tracking in compute_scales New tests (41 total, up from 33): - Roundtrip tests for 5-bit and 7-bit quantization - Non-finite value handling (NaN, Inf, -Inf) - Single-element group edge case - Compression ratio validation for all tiers - Cold-tier (3-bit) full roundtrip - Large tensor multi-group (512-dim, 50 frames) - Accessor method coverage https://claude.ai/code/session_01U63xtGd5Q8mUevyY7nUSfJ	2026-02-06 00:40:36 +00:00
Claude	501e02d753	feat: Add quantum simulation engine ADR series (QE-001 to QE-012) and DDD design documents Comprehensive architecture decision records and domain-driven design documentation for integrating a Rust-based quantum simulation engine (ruQu) into the ruVector stack. ADR Series (12 documents): - QE-001: Core Architecture - pure Rust state-vector simulator decision - QE-002: Crate Structure - three-crate architecture (ruqu-core, ruqu-wasm, ruqu-algorithms) - QE-003: WASM Compilation - WebAssembly strategy with 25-qubit limit enforcement - QE-004: Performance Optimization - SIMD, multithreading, gate fusion, benchmarks - QE-005: VQE Algorithm - variational eigensolver with exact expectation values - QE-006: Grover Search - O(1) oracle optimization via direct state vector access - QE-007: QAOA MaxCut - graph-based optimization with Rzz native gates - QE-008: Surface Code Error Correction - mid-circuit measurement, syndrome extraction - QE-009: Tensor Network Evaluation - MPS/contraction for shallow circuits - QE-010: Observability & Monitoring - metrics, tracing, health checks integration - QE-011: Memory Gating & Power Management - zero-idle, on-demand allocation - QE-012: Min-Cut Coherence Integration - syndrome-to-decoder bridge with ruQu DDD Design (3 documents): - Strategic Design: 6 bounded contexts, context map, ubiquitous language - Tactical Design: 6 aggregates, 20+ value objects, 15+ domain events, services - Integration Patterns: anti-corruption layers, shared kernel, event flows https://claude.ai/code/session_01B1NkbLDWYPaacS9miKsnvW	2026-02-06 00:39:39 +00:00
Claude	61d91a0ef9	feat: Add ADR-017 temporal tensor compression with tiered quantization Introduces a complete temporal tensor compression system with: - ADR-017: SOTA research-backed architecture decision record covering groupwise symmetric quantization, temporal segment reuse, access-pattern driven tier selection (8/7/5/3 bit), and WASM-compatible design - ruvector-temporal-tensor crate (zero external dependencies): - tier_policy: Score-based hot/warm/cold bit-width selection - f16: Software IEEE 754 half-precision conversion - bitpack: Arbitrary bit-width stream packing (no alignment waste) - quantizer: Groupwise symmetric quantization with f16 scales - segment: Binary segment format (TQTC) encode/decode - compressor: Temporal segment manager with drift detection - ffi: WASM/C FFI with handle-based resource management - ruvector-temporal-tensor-wasm crate for wasm32 targets - 33 passing unit tests covering all modules Compression targets: 4x (hot/8-bit), 4.57x (warm/7-bit), 6.4x (warm/5-bit), 10.67x (cold/3-bit) vs f32 baseline. https://claude.ai/code/session_01U63xtGd5Q8mUevyY7nUSfJ	2026-02-06 00:28:21 +00:00
Claude	30cd63958e	feat: Integrate ExpertPredictor prefetch, CompressedMlaCache, and E2E tests - Wire ExpertPredictor into MoE forward path: predicts likely-next experts from routing history and issues software prefetch hints (volatile read of first cache line of predicted expert gate_proj weights) before routing runs - Rebuild predictor every 16 tokens from routing history (amortized cost) - Fix routing history tracking to target first MoE layer (config.first_k_dense_replace) instead of hardcoded layer_idx==0 (layer 0 is Dense in GLM-4.7-Flash) - Integrate CompressedMlaCache as configurable mode (set_compressed_kv): stores only c_kv + k_pe (576 dims) instead of full K/V (10240 dims) per position (~17.8x memory reduction), recomputing K_nope and V during attention - Add mla_caches field initialized per-layer in load_gguf(), cleared in reset_cache() - Add 13 new tests (216 total, all passing): - E2E: forward produces logits, forward_token with KV cache, determinism, different tokens give different logits, expert predictor builds from inference, cache reset, compressed KV toggle, scratch pool allocation - Benchmarks: forward_token throughput, TL1 GEMV dispatch, RMSNorm, softmax, expert_forward performance https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-04 07:43:37 +00:00
Claude	133792a51e	perf: Ultra-optimize BitNet inference backend with SIMD dispatch, fused SwiGLU, and zero-alloc paths - Wire AVX2 TL1 GEMV SIMD dispatch into backend hot path via tl1_avx2 module with scalar LUT fallback for non-x86_64 platforms - Add ScratchPool with 17 pre-allocated FP32 buffers for zero-alloc forward pass - Fuse SwiGLU gate+up projections with 4-wide unrolled loop and unsafe indexing - Optimize RMSNorm with 4-way parallel accumulator and fused scale pass - Optimize softmax with reciprocal multiply instead of per-element division - Optimize fp32_matvec_transposed with 4-wide unrolled dot product - Optimize GQA attention with 4-wide unrolled score computation and skip for negligible weights - Add routing history tracking via Mutex<Vec<Vec<usize>>> for expert prediction (interior mutability preserves LlmBackend Send+Sync trait compatibility) - Pre-allocate KV caches (512 positions) in load_gguf() - Add tl1_gemv_into() for zero-allocation GEMV into caller-provided buffers - All 203 bitnet tests pass https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-04 07:12:49 +00:00
github-actions[bot]	2c8e8a4549	chore: Update NAPI-RS binaries for all platforms Built from commit `987f822efc` Platforms updated: - linux-x64-gnu - linux-arm64-gnu - darwin-x64 - darwin-arm64 - win32-x64-msvc 🤖 Generated by GitHub Actions	2026-02-04 07:01:21 +00:00
rUv	98d0494f26	Merge pull request #149 from ruvnet/claude/crv-ruvector-integration-Fbl4V	2026-02-04 12:26:44 +05:30
Claude	971c1f9bf3	chore: Update reasoning bank patterns cache https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-04 05:54:40 +00:00
Claude	f7c36764e5	feat: Add streaming generation, predictive expert prefetcher, and compressed MLA KV cache - Streaming generation API (generate_streaming) with per-token callback, early stopping, and GenerationStats for throughput metrics - ExpertPredictor: transition-matrix based predictor that learns from routing history to predict next experts with Laplace smoothing - CompressedMlaCache: stores compressed latents (c_kv + k_pe) instead of full K/V, achieving ~17.8x memory reduction for GLM-4.7-Flash - 15 new tests (203 total bitnet tests, all passing) https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-04 05:53:56 +00:00
Claude	4ae1ad9182	feat: Add GLM-4.7-Flash GGUF tensor mapping, MLA attention, and model validation - TensorNameMapper resolves both llama.cpp (blk.) and HuggingFace (model.layers.) naming - MLA (Multi-Head Latent Attention) with low-rank Q/KV compression (DeepSeek-V2 style) - Stacked 3D expert tensor support (ffn_gate_exps → per-expert slicing) - Shared expert + dense layer-0 support (MoeWithShared/Dense/Moe layer types) - Updated BitNetModelConfig defaults to match GLM-4.7-Flash architecture - Tensor discovery and model validation harness for GGUF files - 188 passing tests (14 new) https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-03 18:00:17 +00:00
Claude	acfb352f40	feat: Add real attention, KV cache, RoPE, and tokenizer to BitNet backend Resolves the three blocking gaps that prevented end-to-end inference: 1. Real attention layer (was pass-through placeholder): - AttentionWeights struct with Q/K/V/O ternary projections - GQA (Grouped Query Attention) with configurable num_heads / num_kv_heads - Pre-computed RoPE cos/sin tables (apply_rope) - Per-layer KV cache for autoregressive generation - forward_token() for efficient single-token inference with cache - forward_layer_cached() with full attention computation - forward_layer_nocache() legacy path for backwards compatibility 2. Tokenizer integration (was raw bytes → token IDs): - load_tokenizer_from_gguf() extracts vocab + merges from GGUF metadata - Byte-level fallback tokenizer (260 tokens) when GGUF has no vocab - TokenizerBridge implements crate-level Tokenizer trait - tok() accessor for direct tokenizer access 3. generate() uses tokenizer (was returning [token_id] strings): - Encodes prompt via BPE tokenizer before forward pass - Decodes generated tokens back to text - generate_cached() for KV-cached autoregressive generation - get_embeddings() now uses tokenizer for text encoding - reset_cache() to clear KV state between sequences Tests: 174/174 bitnet tests pass (9 new: RoPE, KV cache, tokenizer roundtrip, attention weights, byte-level fallback, cache operations) https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-03 17:39:58 +00:00
Claude	c85ba8a498	feat: Add appliance-optimized RLM embedder (Pi 5 + STM32 offload) Implements AD-25 appliance deployment optimizations for the RLM recursive sentence transformer embedder targeting Raspberry Pi 5 + 7 STM32 coprocessors: - Pi 5 config presets: pi5_optimized() (2-iter, 3-neighbor) and pi5_streaming() (1-iter) - STM32 offload protocol: ComputeHash, FilterNeighbors, GateCheck, WatchdogPing, ScheduleReorder - NullStm32 software fallback for development/cloud environments - Batch embedding with per-chunk latency tracking and STM32 gate-checking - Priority-scheduled batch embedding via STM32-driven reordering - HashEmbedder: lightweight FNV-1a pseudo-embedder for testing/baseline - FlatNeighborStore: in-memory neighbor retriever for small corpora (<100K chunks) - EmbedderBenchmark: throughput, P95/P99 latency, peak memory reporting - NEON-optimizable math: 4-element unrolled cosine_similarity, l2_normalize - vec_accumulate_weighted and mean_embedding helpers - 41 tests (27 new): STM32 protocol, batch, HashEmbedder, FlatNeighborStore, benchmark, integration All 165 bitnet module tests pass. https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-03 15:53:40 +00:00
Claude	767901ea79	feat: Add RLM embedder, tokenizer, eval gates, trace writer, and security hardening New modules (4 files, 2,359 lines): - rlm_embedder.rs (743L): RLM-style recursive sentence transformer with 3 variants (query-conditioned, corpus-conditioned, contradiction-aware twin), merge rule, BaseEmbedder/NeighborRetriever traits, 14 tests - tokenizer.rs (418L): BPE tokenizer with GGUF vocab loading, encode/decode, special token handling, 10 tests - trace.rs (554L): JSONL trace writer for routing, citation, refusal decisions, jaccard similarity, manual JSON serialization, 10 tests - eval.rs (644L): Three behavioral gates (routing correctness >= 0.85, citation precision >= 0.90, refusal F1 >= 0.85), EvalSuite, 12 tests Documentation: - AD-24: RLM-Style Recursive Sentence Transformer Embedder — 3 variants, merge rule, training strategy, evaluation criteria, appliance fit - DDD v2.6: 8 new ubiquitous language terms, 4 new open questions (#31-34) - 3 new positive consequences (#31-33) for RLM embeddings Security hardening (across 6 existing files): - Path traversal validation in GGUF export - Division-by-zero epsilon guards in quantizer - Bounds validation on public function inputs - NaN-safe softmax with -inf handling 138 tests pass, 0 compilation errors. Total bitnet module: 9,632 lines across 16 files. https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-03 15:40:59 +00:00
Claude	11e1e3af95	feat: Add AD-23 Phase-1 distillation, expert cache, and DDD updates AD-23: Phase-1 Distillation via External GPU Teacher Artifacts - One-time GPU job produces behavioral artifacts (routing traces, sparse logits, preference labels) — not trained weights - CPU-only refinement: router repair, LoRA correction, EWC++, policy optimization using teacher artifacts - Acceptance criteria: 200-prompt suite, all 3 behavioral gates, stability under 10% corpus perturbation expert_cache.rs: MoE expert hot-set caching (new file) - ExpertCache with LRU/LFU/Adaptive eviction policies - MoeBatchScheduler: reorder token execution by expert for cache reuse - Prefetcher trait for future platform-specific prefetch intrinsics - 12 tests (92/92 bitnet tests pass) DDD v2.5: 6 new ubiquitous language terms (Teacher Artifact, Behavioral Distillation, Router Repair, Sparse Logits, Corpus Perturbation) and 4 new open questions (#27-30) for Phase-1 operability. https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-03 15:12:33 +00:00
Claude	cf8daa60c3	docs: Add AD-22 evaluation infrastructure and behavioral gates Defines three ship/no-ship gates: - Gate 1: Routing correctness (>= 85% teacher agreement) - Gate 2: Citation correctness (precision >= 90%, recall >= 70%) - Gate 3: Refusal calibration (F1 >= 0.85) Includes JSONL trace schema, auto-labeling strategy using RuVector signals (redundancy, cluster disagreement, mincut fragility), and go/no-go rule requiring all gates to pass on same prompt suite run. https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-03 14:57:32 +00:00
Claude	58806480b3	fix: Polish AVX2 and WASM SIMD128 kernel variants Agent refinements to tl1_avx2.rs and tl1_wasm.rs — cleanup of unused imports and linter warnings. https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-03 14:42:30 +00:00
Claude	395e214106	feat: Implement BitNet inference stack — TL1 kernel, backend, GGUF export, RLM refiner Phase 0 + 0.5 implementation (4,283 lines across 6 new files): - tl1_kernel.rs (879L): TL1 ternary GEMV with NEON SIMD + scalar fallback, INT8 activation quantization (absmax), LUT generation, 17 tests - backend.rs (1,179L): Full BitNetBackend implementing LlmBackend trait, GGUF model loading, MoE router (softmax gate + top-K), expert FFN (SwiGLU via TL1 GEMV), RMSNorm, embedding/LM head, 12 tests - gguf_export.rs (662L): GGUF v3 writer for BITNET_T158, FP16 conversion, model export with BitNet metadata, validation, 8 tests - rlm_refiner.rs (696L): Phase 0.5 orchestrator wiring MicroLoRA + EWC++ + GRPO + ContrastiveTrainer, SIMD-only mode (AD-20), checkpointing, 10 tests - tl1_avx2.rs (414L): AVX2 SIMD kernel variant (x86_64 conditional) - tl1_wasm.rs (453L): WASM SIMD128 kernel variant (wasm32 conditional) All 72 bitnet tests pass. Fixed 2 pre-existing compilation errors in autodetect.rs and kernels/mod.rs. https://claude.ai/code/session_011nTcGcn49b8YKJRVoh4TaK	2026-02-03 14:34:37 +00:00

1 2 3 4 5 ...

823 commits