ruvector

vrr/ruvector

Fork 0

mirror of https://github.com/ruvnet/RuVector.git synced 2026-05-25 15:03:46 +00:00

Commit graph

Author	SHA1	Message	Date
rUv	1b6e725b49	fix: resolve 5 P0 critical issues + 2 pre-existing compile errors - ONNX embeddings: dynamic dimension detection + conditional token_type_ids (#237) - rvf-node: add compression field pass-through to Rust N-API struct (#225) - Cargo workspace: add glob excludes for nested rvf sub-packages (#214) - ruvllm: fix stats crash (null guard + try/catch) + generate warning (#103) - ruvllm-wasm: deprecated placeholder on npm (#238) - Pre-existing: fix ruvector-sparse-inference-wasm API mismatch, exclude from workspace - Pre-existing: fix ruvector-cloudrun-gpu RuvectorLayer::new() Result handling Co-Authored-By: claude-flow <ruv@ruv.net>	2026-03-06 14:03:42 +00:00
rUv	42d869a196	style: apply rustfmt across entire codebase Run rustfmt on all Rust files to fix CI formatting checks. This addresses pre-existing formatting inconsistencies across: - cognitum-gate-kernel - cognitum-gate-tilezero - prime-radiant - ruvector-* crates - examples/benchmarks - and other crates Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-28 17:00:26 +00:00
rUv	04b26c8d69	feat: Add PowerInfer-style sparse inference engine with precision lanes (#106 ) ## Summary - Add PowerInfer-style sparse inference engine with precision lanes - Add memory module with QuantizedWeights and NeuronCache - Fix compilation and test issues - Demonstrated 2.9-8.7x speedup at typical sparsity levels - Published to crates.io as ruvector-sparse-inference v0.1.30 ## Key Features - Low-rank predictor using P·Q matrix factorization for fast neuron selection - Sparse FFN kernels that only compute active neurons - SIMD optimization for AVX2, SSE4.1, NEON, and WASM SIMD - GGUF parser with full quantization support (Q4_0 through Q6_K) - Precision lanes (3/5/7-bit layered quantization) - π integration for low-precision systems 🤖 Generated with [Claude Code](https://claude.com/claude-code)	2026-01-04 23:40:31 -05:00

Author

SHA1

Message

Date

rUv

1b6e725b49

fix: resolve 5 P0 critical issues + 2 pre-existing compile errors

- ONNX embeddings: dynamic dimension detection + conditional token_type_ids (#237)
- rvf-node: add compression field pass-through to Rust N-API struct (#225)
- Cargo workspace: add glob excludes for nested rvf sub-packages (#214)
- ruvllm: fix stats crash (null guard + try/catch) + generate warning (#103)
- ruvllm-wasm: deprecated placeholder on npm (#238)
- Pre-existing: fix ruvector-sparse-inference-wasm API mismatch, exclude from workspace
- Pre-existing: fix ruvector-cloudrun-gpu RuvectorLayer::new() Result handling

Co-Authored-By: claude-flow <ruv@ruv.net>

2026-03-06 14:03:42 +00:00

rUv

42d869a196

style: apply rustfmt across entire codebase

Run rustfmt on all Rust files to fix CI formatting checks.
This addresses pre-existing formatting inconsistencies across:
- cognitum-gate-kernel
- cognitum-gate-tilezero
- prime-radiant
- ruvector-* crates
- examples/benchmarks
- and other crates

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

2026-01-28 17:00:26 +00:00

rUv

04b26c8d69

feat: Add PowerInfer-style sparse inference engine with precision lanes (#106 )

## Summary
- Add PowerInfer-style sparse inference engine with precision lanes
- Add memory module with QuantizedWeights and NeuronCache
- Fix compilation and test issues
- Demonstrated 2.9-8.7x speedup at typical sparsity levels
- Published to crates.io as ruvector-sparse-inference v0.1.30

## Key Features
- Low-rank predictor using P·Q matrix factorization for fast neuron selection
- Sparse FFN kernels that only compute active neurons
- SIMD optimization for AVX2, SSE4.1, NEON, and WASM SIMD
- GGUF parser with full quantization support (Q4_0 through Q6_K)
- Precision lanes (3/5/7-bit layered quantization)
- π integration for low-precision systems

🤖 Generated with [Claude Code](https://claude.com/claude-code)

2026-01-04 23:40:31 -05:00

3 commits