Commit graph

3 commits

Author SHA1 Message Date
rUv
1b6e725b49 fix: resolve 5 P0 critical issues + 2 pre-existing compile errors
- ONNX embeddings: dynamic dimension detection + conditional token_type_ids (#237)
- rvf-node: add compression field pass-through to Rust N-API struct (#225)
- Cargo workspace: add glob excludes for nested rvf sub-packages (#214)
- ruvllm: fix stats crash (null guard + try/catch) + generate warning (#103)
- ruvllm-wasm: deprecated placeholder on npm (#238)
- Pre-existing: fix ruvector-sparse-inference-wasm API mismatch, exclude from workspace
- Pre-existing: fix ruvector-cloudrun-gpu RuvectorLayer::new() Result handling

Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-06 14:03:42 +00:00
rUv
42d869a196 style: apply rustfmt across entire codebase
Run rustfmt on all Rust files to fix CI formatting checks.
This addresses pre-existing formatting inconsistencies across:
- cognitum-gate-kernel
- cognitum-gate-tilezero
- prime-radiant
- ruvector-* crates
- examples/benchmarks
- and other crates

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-28 17:00:26 +00:00
rUv
04b26c8d69 feat: Add PowerInfer-style sparse inference engine with precision lanes (#106)
## Summary
- Add PowerInfer-style sparse inference engine with precision lanes
- Add memory module with QuantizedWeights and NeuronCache
- Fix compilation and test issues
- Demonstrated 2.9-8.7x speedup at typical sparsity levels
- Published to crates.io as ruvector-sparse-inference v0.1.30

## Key Features
- Low-rank predictor using P·Q matrix factorization for fast neuron selection
- Sparse FFN kernels that only compute active neurons
- SIMD optimization for AVX2, SSE4.1, NEON, and WASM SIMD
- GGUF parser with full quantization support (Q4_0 through Q6_K)
- Precision lanes (3/5/7-bit layered quantization)
- π integration for low-precision systems

🤖 Generated with [Claude Code](https://claude.com/claude-code)
2026-01-04 23:40:31 -05:00