Reuven
bcecd1d904
fix(ruvllm): apply security and performance optimizations to MoE routing
...
HIGH severity security fixes:
- router: Change new() from panic to Result<Self, &'static str>
- router: Change with_default_affinity() to return Result
- precision_allocator: Change new() to return Result, add new_unchecked()
- sram_mapper: Change assign_tier() from assert! to returning bool
MEDIUM severity security fixes:
- router: Add NaN/Inf validation in apply_cache_bonus_inplace()
- router: Handle NaN in select_top_k(), treat as NEG_INFINITY
- affinity: Add NaN handling in top_k_by_affinity() with deterministic tie-breaking
- affinity: Add NaN handling in least_affinity() for eviction decisions
- sram_mapper: Fix division by zero in priority_score() when last_access=0
P0 performance optimizations:
- router: Add apply_cache_bonus_inplace() to avoid allocation in hot path
- router: Use select_nth_unstable_by for partial sort when k << n (O(n) vs O(n log n))
All 103 tests pass (84 unit + 19 integration).
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-12 15:25:36 -04:00
Reuven
830fa5c4ed
feat(ruvllm): implement ADR-092 MoE Memory-Aware Routing
...
Implements memory-aware expert routing with cache residency bonus:
## New moe/ Module (5 files, ~4,300 lines)
- router.rs: MemoryAwareRouter with cache bonus (0.15 default)
- INV-6 compliant (deterministic tie-breaking)
- PagingRequest generation for non-resident experts
- affinity.rs: EMA-based expert affinity tracking
- INV-2 compliant (monotonic decay without activation)
- top_k_by_affinity() for prefetch predictions
- precision_allocator.rs: Hot/warm/cold precision assignment
- Frequency-based percentile thresholds
- GGUF format mapping (Q4_K_M, Q3_K, Q2_K)
- sram_mapper.rs: Hardware memory hierarchy config
- Presets: RPi5, Mobile, Desktop, WasmBrowser
- Tier assignment (SRAM/DRAM/Storage)
- metrics.rs: MoE routing metrics tracking
- Cache hit rate, paging latency, prefetch accuracy
## Extended bitnet/expert_cache.rs
- suggest_eviction_with_affinity(): Combined LRU/LFU + affinity
- prefetch_by_affinity(): Affinity-based expert prefetching
- hot_experts(): List currently cached experts
## Tests (131 total)
- 86 MoE unit tests
- 19 integration tests (GATE-1 through GATE-4 validation)
- 26 ExpertCache tests
## Benchmarks (9 suites)
- Routing overhead: ~22 ns (target: ≤15 μs) ✅
- Cache hit rate simulation
- Affinity update, precision allocation
Target: ≥70% cache hit rate vs 34% baseline
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-12 15:00:59 -04:00
Reuven
9f80f7298f
feat(ruvector-cnn): implement ADR-091 INT8 CNN quantization
...
Complete implementation of INT8 quantization for ruvector-cnn:
Phase 1 - Core Infrastructure:
- QuantizationParams, QuantizationScheme, QuantizationMode
- QuantizedTensor<i8> with quantize/dequantize methods
- CalibrationMethod (MinMax, Percentile, MSE, Entropy)
- 34 unit tests passing
Phase 2 - INT8 Kernels:
- Scalar reference: conv2d, depthwise_conv2d, matmul, requantize
- AVX2 SIMD: _mm256_maddubs_epi16 for 2-4x speedup
- ARM NEON: vmull_s8, vpadalq_s16 for 2-3x speedup
- WASM SIMD128: i8x16 operations for 1.5-2x speedup
Phase 3 - Graph Rewrite Passes:
- GR-1: BatchNorm fusion into Conv weights
- GR-2: Zero-point correction pre-computation
- GR-3: Q/DQ node insertion at FP32/INT8 boundaries
- GR-4: ReLU/HardSwish fusion with LUT
Phase 4 - Quantized Layers:
- QuantizedConv2d with per-channel quantization
- QuantizedDepthwiseConv2d for MobileNet
- QuantizedLinear for FC layers
- QuantizedMaxPool2d/AvgPool2d
- QuantizedResidualAdd with scale alignment
Phase 6 - Tests & Benchmarks:
- quality_validation.rs: cosine similarity ≥0.995
- acceptance_gates.rs: 7 ADR-091 gates
- kernel_equivalence.rs: SIMD vs scalar validation
- int8_bench.rs: Criterion benchmarks
Performance targets:
- 2.5x latency improvement (MobileNetV3)
- 4x memory reduction
- <1% accuracy degradation
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-12 14:45:52 -04:00
github-actions[bot]
b376344cff
chore: Update NAPI-RS binaries for all platforms
...
Built from commit e9f32ff9a8
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-12 18:11:09 +00:00
Reuven
e9f32ff9a8
Merge branch 'feat/adr-090-ultra-low-bit-qat'
2026-03-12 14:04:51 -04:00
Reuven
f7942f91c1
refactor(ruvllm): remove unused NEON helper function
...
Remove neon_process_4_groups_ultra() which was superseded by the
optimized 8-group batching implementation with prefetching.
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-12 13:59:16 -04:00
Reuven
9a7d458d81
perf(ruvllm): optimize pi-quantization SIMD kernels
...
- Add AVX-512 dequantization kernel (16-wide SIMD, target >12 GB/s)
- Add AVX2 quantization kernel (8-wide SIMD) for forward pass
- Add AVX2 2-bit quantization kernel
- Optimize NEON kernel with prefetching and 8-group batching
- Add inline assembly prefetch (prfm pldl1keep)
- Update benchmarks with new throughput tests
- All 77 tests pass (pi_quant: 35, simd_equivalence: 19, hadamard: 23)
Performance optimizations target ADR-090 requirements:
- Quantize throughput: >1 GB/s (was 467 MiB/s)
- NEON dequant: >10 GB/s (was 2.54 GiB/s)
- AVX-512 dequant: >12 GB/s (new)
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-12 13:57:04 -04:00
Reuven
8403d563df
feat(ruvllm): implement ADR-090 Ultra-Low-Bit QAT & Pi-Quantization
...
Phase 1-4 implementation of ADR-090 with 114 tests passing.
## Core Quantization (src/quantize/)
- pi_quant.rs: PiQuantizer with π/k step sizes, Pi3BitBlock, Pi2BitBlock
- pi_quant_simd.rs: NEON/AVX2/scalar dequantization kernels (2.1x speedup)
- hadamard.rs: Fast Walsh-Hadamard O(n log n), INV-4 orthogonality verified
- incoherence.rs: IncoherenceTransform for QuIP-style decorrelation
- quip.rs: Q2_QuIP variant combining incoherence + 2-bit K-quant
- security.rs: WeightIntegrity, GGUF validation, bounds checking
## QAT Infrastructure (src/qat/)
- config.rs: QatConfig, SteVariant, QuantGranularity with builder pattern
- ste.rs: Straight-through estimator (Standard, Clipped, LSQ, EWGS)
- differentiable_quant.rs: DifferentiableQuantizer trait, PiQuantDifferentiable
- calibration.rs: CalibrationEngine with mixed-domain support
- distillation.rs: Teacher-student composite loss (L_task + L_KD + L_reasoning)
- reasoning_loss.rs: Chain-of-thought fidelity preservation
- training_loop.rs: QatTrainer orchestrator with checkpointing
- lora_qat.rs: Memory-efficient LoRA-QAT (50 MB vs 114 GB for full QAT)
## WASM Integration (ruvllm-wasm/)
- pi_quant_wasm.rs: PiQuantWasm with SIMD128 kernel, JSON serialization
- quant_bench_wasm.rs: QuantBenchWasm for in-browser benchmarking
- Feature flags: pi-quant, qat
## Tests (114 passing)
- pi_quant_tests.rs (35): Round-trip, block packing, bounds checking
- hadamard_tests.rs (23): Orthogonality, invertibility, energy preservation
- ste_tests.rs (24): Gradient correctness, PyTorch reference comparison
- simd_equivalence_tests.rs (19): SIMD ≈ scalar within 1 ULP (INV-8)
- acceptance_gates.rs (13): G1-G5 quality and security gates
## Benchmarks (benches/pi_quant_bench.rs)
- Hadamard 4096: 5.3 μs (target <50 μs) ✓
- NEON dequant: 2.54 GiB/s (2.1x over scalar)
- QAT backward: 7.3 Gelem/s
## Invariants Verified
- INV-1: STE gradient flow
- INV-2: Scale positivity (α > 0)
- INV-3: Step size constraint (π/k)
- INV-4: Hadamard orthogonality
- INV-5: Calibration provenance
- INV-8: SIMD ≈ scalar (≤1 ULP)
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-12 12:36:36 -04:00
github-actions[bot]
1745d9054d
chore: Update NAPI-RS binaries for all platforms
...
Built from commit f091d45575
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-12 14:53:46 +00:00
Reuven
f091d45575
docs(adr): add governance improvements to ADR-090/091, create ADR-092, add implementation checklists
...
ADR-090 (Ultra-Low-Bit QAT):
- Changed status to "Accepted (Staged Implementation)"
- Added decision statement choosing LoRA-QAT as first path
- Added staged implementation phases (4 phases, explicit gates)
- Added validation plan defining "better" (MSE, spectral, cosine, outlier retention)
- Added reasoning preservation metrics (PPL, GSM8K, HumanEval, tool use, long context)
- Added system invariants (INV-1 through INV-8)
- Added acceptance gates (G1-G6) with rollback triggers
- Restructured success criteria into correctness/performance/quality/rollout
ADR-091 (INT8 CNN Quantization):
- Changed status to "Accepted"
- Added decision statement with acceptance benchmark
- Added system invariants (INV-1 through INV-8)
- Added operator coverage table (11 operators)
- Added graph rewrite passes section (4 passes)
- Added deployment policy matrix
- Added acceptance gates (7 gates) with rollback conditions
ADR-092 (MoE Memory-Aware Routing):
- Split from ADR-090 as routing affects scheduling/cache, not representation
- Added decision statement with acceptance benchmark (≥70% cache hit rate)
- Added system invariants (INV-1 through INV-6)
- Added acceptance gates (G1-G5) with rollback conditions
- Added domain analysis with bounded context
Implementation Checklists:
- ADR-090: 6 phases, ~28 files, 16 new + 12 extended
- ADR-091: 6 phases, acceptance gate verification commands
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-12 10:47:00 -04:00
github-actions[bot]
e6a5c710dc
chore: Update NAPI-RS binaries for all platforms
...
Built from commit cbd6b1402f
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-12 14:35:32 +00:00
github-actions[bot]
9ca5c30e83
chore: Update NAPI-RS binaries for all platforms
...
Built from commit a36fdc80cf
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-12 14:29:24 +00:00
Reuven
cbd6b1402f
docs(adr): add ADR-091 INT8 CNN Quantization DDD Architecture
...
Formalizes INT8 quantization for ruvector-cnn with DDD bounded contexts:
- Quantization Core: params, tensors, scale computation
- Calibration: statistics, histograms, MinMax/Percentile methods
- Inference: QuantizedConv2d, fused BatchNorm, INT8 ReLU
- SIMD Kernels: AVX2, NEON, WASM INT8 implementations
- Observability: benchmarks, accuracy validation
Targets 2-4x speedup over FP32 with <1% accuracy loss.
Related to ADR-090 (ultra-low-bit QAT for LLMs).
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-12 10:28:54 -04:00
github-actions[bot]
3dbcfe3864
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 3ed78842dd
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-12 14:27:36 +00:00
Reuven
a36fdc80cf
docs(cnn): add INT8 quantization design document
...
Add comprehensive design doc for INT8 quantization implementation
in ruvector-cnn, including calibration strategies and SIMD optimization.
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-12 10:22:35 -04:00
rUv
3ed78842dd
docs(research): add ultra-low-bit quantization & edge deployment research ( #255 )
...
* docs(research): add ultra-low-bit quantization & edge deployment research
Comprehensive research collection on 2-bit/3-bit quantization for ruvLLM:
- 01: Ultra-low-bit quantization survey (ICLR'26, QuIP, BitNet, I-quants)
- 02: Quantization-aware training (QAT) with reasoning preservation
- 03: QuIP 2-bit framework analysis (incoherence processing, E8 lattice)
- 04: MoE memory-aware routing for edge SRAM budgets
- 05: ruvLLM quantization architecture deep review and gap analysis
- 06: Rust implementation plan for 2-bit QAT pipeline (14-week roadmap)
- 07: Novel 3-int pi-constant quantization using irrational scaling
Key findings: ruvLLM has strong foundations (BitNet, K-quants, GGUF, KV cache)
but needs QAT training loop and differentiable quantization primitives.
Pi-constant scaling provides ~0.5 bit effective precision gain at 3-bit.
https://claude.ai/code/session_01E4pmfETYzknb1xq2dzCCaj
* docs(adr): add ADR-090 ultra-low-bit QAT & pi-quantization DDD architecture
Comprehensive architecture decision record for implementing 2-bit/3-bit
quantization-aware training in ruvLLM using Domain-Driven Design:
- 5 bounded contexts: Quantization Core, Training, MoE Routing, WASM Runtime, Observability
- Pi-constant quantization with irrational scaling (pi/k step sizes)
- QAT training loop with STE variants and LoRA-QAT lightweight path
- QuIP incoherence via fast Walsh-Hadamard (O(n log n))
- Memory-aware MoE routing with expert precision allocation
- WASM SIMD128 kernels reusing existing tl1_wasm.rs LUT pattern
- Security: weight integrity, GGUF validation, WASM sandbox
- Benchmarking: criterion suite with throughput/quality targets
- 14-week timeline, maps to 18 existing files for extension
Placed in docs/adr/ddd/ per DDD architectural pattern organization.
https://claude.ai/code/session_01E4pmfETYzknb1xq2dzCCaj
---------
Co-authored-by: Claude <noreply@anthropic.com>
2026-03-12 10:21:30 -04:00
github-actions[bot]
260b870312
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 014bf98ea2
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 23:36:01 +00:00
Reuven
014bf98ea2
feat(demo): add Self-Learning tab with 6 interactive training demos
...
- Live Classifier: Train custom classes with labeled examples, test classification
- Few-Shot Learning: 3-class system (A/B/C) with drag-drop training
- Incremental Learning: Positive/negative examples with prototype visualization
- Feedback Learning: Track predictions and accuracy over time
- Memory Bank: View stored embeddings, export/import as JSON
- Camera Training: Train using webcam with single/auto-capture modes
All demos use real CNN embeddings (512-dim) with prototypical networks
for classification. Includes cosine similarity scoring and confidence bars.
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 19:31:23 -04:00
github-actions[bot]
18a5f5a006
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 3691699569
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 23:03:28 +00:00
Reuven
3691699569
fix(demo): improve motion detection sensitivity and reliability
...
- Wait for video to be fully ready before processing
- Add camera warmup delay
- Increase sensitivity (10x amplification, lower threshold)
- Center crop video for consistent detection
- Add better error handling and logging
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 18:58:45 -04:00
github-actions[bot]
79520fff4f
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 4f49ec2d42
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 22:31:17 +00:00
Reuven
4f49ec2d42
feat(demo): add interactive demos and self-learning examples
...
Interactive Demos:
- Similarity Search: Click image to find similar ones
- Motion Detection: Real-time scene change via embeddings
- A/B Comparison: Side-by-side image comparison
- Batch Processor: Process up to 20 images at once
- Embedding Explorer: Visualize 512-dim features with stats
- Anomaly Detection: Find outlier images in a set
Self-Learning Code Examples:
- Self-Learning System: Adaptive classifier with feedback loop
- Incremental Learning: Online learning without forgetting
- Few-Shot Learning: Learn from 1-5 examples per class
- Image Retrieval: Searchable image database
- Content Moderation: Flag similar inappropriate content
- Visual Recommendations: Recommend similar items
- Video Keyframes: Extract unique frames
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 18:26:30 -04:00
github-actions[bot]
9aaaf03786
chore: Update NAPI-RS binaries for all platforms
...
Built from commit da6c7e260a
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 22:15:23 +00:00
github-actions[bot]
22262d2ba2
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 72c1f225f7
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 22:13:38 +00:00
Reuven
da6c7e260a
fix(pose): force WebGL backend for TensorFlow.js compatibility
...
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 18:10:59 -04:00
Reuven
72c1f225f7
feat(demo): add real-time pose estimation with MoveNet
...
- Add new Pose Estimation tab with 17-keypoint body tracking
- Integrate TensorFlow.js MoveNet (SinglePose Lightning model)
- Real-time skeleton visualization with colored keypoints
- Pose embedding (34-dim) with similarity comparison
- Reference pose capture for pose matching
- Toggle skeleton visibility
- Add pose tracking and gesture control code examples
- Update badges and examples grid
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 18:09:07 -04:00
github-actions[bot]
641a7d355f
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 1799da243f
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 22:06:34 +00:00
github-actions[bot]
9ce3d478f0
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 15fcb8d556
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 22:03:38 +00:00
github-actions[bot]
9aeb9c942f
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 10b03c75c6
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 22:02:36 +00:00
Reuven
1799da243f
feat: add real-time CNN processing with live webcam
...
- Real-time embedding extraction at ~30+ FPS
- Live FPS and latency display
- Reference image comparison with similarity score
- Live embedding visualization
- Center-crop to square for consistent input
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 18:02:13 -04:00
github-actions[bot]
35ee504c64
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 4bf6bf706b
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 22:02:00 +00:00
Reuven
15fcb8d556
fix: host WASM files locally + add console debugging
...
- Host ruvector_cnn_wasm.js and .wasm in docs/cnn/
- Add detailed console.log for debugging WASM init
- Remove CDN dependency for reliability
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 17:59:13 -04:00
github-actions[bot]
8007dc398a
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 955793a301
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 21:58:01 +00:00
Reuven
4bf6bf706b
fix: correct WASM API usage in CNN demo
...
- Use object format for init: { module_or_path: url }
- Use WasmCnnEmbedder (not CnnEmbedder)
- Use snake_case methods: embedding_dim, cosine_similarity
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 17:57:25 -04:00
github-actions[bot]
81e4d73602
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 0625e2c46c
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 21:56:46 +00:00
github-actions[bot]
6072a61d79
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 27b1931130
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 21:56:23 +00:00
Reuven
955793a301
fix: add .nojekyll to disable Jekyll processing
...
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 17:53:19 -04:00
Reuven
0625e2c46c
refactor: move CNN demo to docs/cnn/ for shorter URL
...
URL: https://ruvnet.github.io/ruvector/cnn/
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 17:52:13 -04:00
Reuven
27b1931130
fix: add docs/index.html for GitHub Pages landing
...
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-11 17:51:50 -04:00
rUv
10b03c75c6
feat: add interactive CNN browser demo for GitHub Pages ( #253 )
...
- Single-file HTML demo with modern dark theme UI
- Drag & drop image upload + camera capture
- Real-time embedding extraction and visualization
- Similarity matrix comparing multiple images
- Performance metrics display (~5ms per image)
- Falls back to demo mode if WASM fails to load
- ADR-089 documenting the approach
Deploy to: https://ruvnet.github.io/ruvector/demo/cnn/
Co-authored-by: Reuven <cohen@ruv-mac-mini.local>
2026-03-11 17:50:42 -04:00
github-actions[bot]
1caaa1b2fa
chore: Update NAPI-RS binaries for all platforms
...
Built from commit e743785c7d
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-11 21:46:44 +00:00
rUv
e743785c7d
feat(ruvector-cnn): CNN contrastive learning + SIMD optimization fixes ( #252 )
...
* feat: add CNN contrastive learning crate with SIMD optimization
- Add ruvector-cnn crate with SIMD-optimized convolutions and contrastive losses
- Implement InfoNCE (SimCLR) and TripletLoss for contrastive learning
- Add MobileNet-V3 inspired backbone architecture
- Include AVX2, NEON, WASM SIMD support with scalar fallback
- Add WASM bindings (ruvector-cnn-wasm) for browser/Node.js
- Add npm package with TypeScript definitions
- Include comprehensive research docs and ADR-088
- 36 tests passing
Co-Authored-By: claude-flow <ruv@ruv.net>
* feat: add npm package JavaScript wrapper and TypeScript definitions
Co-Authored-By: claude-flow <ruv@ruv.net>
* fix(ruvector-cnn): implement real SIMD and fix stubbed code
## SIMD Implementations (was using scalar fallbacks)
- AVX2: conv_3x3_avx2, conv_3x3_avx2_fma, depthwise_conv_3x3_avx2
- AVX2: global_avg_pool_avx2, max_pool_2x2_avx2
- WASM: conv_3x3_wasm, depthwise_conv_3x3_wasm
All now use real SIMD intrinsics processing 8 (AVX2) or 4 (WASM)
channels simultaneously with scalar fallback for remainders.
## Backbone Fixes
- Deprecated MobileNetV3Small/Large (use unified MobileNetV3 instead)
- Implemented actual block processing in forward() methods
- Fixed hardcoded channel counts in global_avg_pool calls
## Dead Code Fixes
- Added #[allow(dead_code)] for momentum field (used in training)
- Added #[allow(dead_code)] for rng field (feature-gated)
- Added #[cfg(feature = "augmentation")] for rand::Rng import
- Commented out undefined "parallel" feature reference
Co-Authored-By: claude-flow <ruv@ruv.net>
* feat(ruvector-cnn): add Winograd F(2,3) and π-calibrated INT8 quantization
- Add Winograd F(2,3) transforms for 2.25x faster 3x3 convolutions
- Implement π-calibrated INT8 quantization with anti-resonance offsets
- Apply 4x loop unrolling with 4 accumulators to AVX2 convolutions
- Update README with practical intro, capabilities table, benchmarks
- Update npm README with simpler language and examples
- Add CNN image embeddings to root README capabilities
Co-Authored-By: claude-flow <ruv@ruv.net>
* feat: publish @ruvector/cnn v0.1.0 WASM npm package
- Add unsafe blocks for WASM SIMD intrinsics (v128_load/v128_store)
- Disable wasm-opt to avoid SIMD validation issues
- Build and include WASM bindings in npm package
- Update npm package.json with all WASM files
- Published to npm as @ruvector/cnn@0.1.0
Co-Authored-By: claude-flow <ruv@ruv.net>
---------
Co-authored-by: Reuven <cohen@ruv-mac-mini.local>
2026-03-11 17:41:53 -04:00
github-actions[bot]
aa57b9a7c2
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 7ace64061d
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-10 20:05:41 +00:00
rUv
7ace64061d
fix: auto-generate embedding and witness_hash for page creation
...
CreatePageRequest previously required embedding and witness_hash
fields, causing 422 errors when clients (including MCP SSE) omitted
them. Now matches ShareRequest behavior: #[serde(default)] on
embedding, witness_hash, tags, and evidence_links. Server
auto-generates 128-dim embedding via ruvllm and SHAKE-256 witness
hash when not provided.
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-10 20:01:10 +00:00
github-actions[bot]
c1522bad91
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 82b3431853
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-10 19:25:22 +00:00
rUv
82b3431853
feat: make SSE the default MCP connection method on pi.ruv.io
...
Restructure Guide modal MCP tab to lead with SSE (one command, no
install) and demote npx/Cargo to an optional "Local install" step.
Update main page terminal section to show SSE first.
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-10 19:20:45 +00:00
github-actions[bot]
0122f35a1a
chore: Update NAPI-RS binaries for all platforms
...
Built from commit 296749aafb
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-10 19:12:11 +00:00
rUv
296749aafb
docs: add SSE transport documentation to pi.ruv.io landing page
...
Add remote MCP SSE connection instructions to the Guide modal (MCP tab)
and the main page terminal section. Users can now connect via
`claude mcp add pi-brain --transport sse https://pi.ruv.io/sse `
without any local install.
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-10 19:06:25 +00:00
github-actions[bot]
48c3812e4e
chore: Update NAPI-RS binaries for all platforms
...
Built from commit d11dde6dbd
Platforms updated:
- linux-x64-gnu
- linux-arm64-gnu
- darwin-x64
- darwin-arm64
- win32-x64-msvc
🤖 Generated by GitHub Actions
2026-03-10 15:02:35 +00:00
rUv
d11dde6dbd
fix: prevent .mcp.json overwrites by removing empty mcpServers from settings ( #250 )
...
Stop writing empty mcpServers: {} to settings.json which could trigger
Claude Code to regenerate .mcp.json, stripping user-added fields like
autoStart. Doctor --fix now cleans up stale empty mcpServers entries.
Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-10 14:55:40 +00:00