ruvector/crates/ruvector-diskann
Reuven fd7a9fce7d perf(diskann): FlatVectors + VisitedSet + ILP + optional SIMD/GPU
Optimizations applied:
- FlatVectors: contiguous f32 slab (eliminates Vec<Vec> indirection)
- VisitedSet: O(1) clear via generation counter (replaces HashSet)
- 4-accumulator ILP for L2 distance (auto-vectorized)
- Flat PQ distance table (cache-line friendly)
- Parallel medoid finding via rayon
- Zero-copy save (write flat slab directly)
- Optional simsimd feature for hardware NEON/AVX2/AVX-512
- Optional gpu feature with Metal/CUDA/Vulkan dispatch stubs

Results (5K vectors, 128d):
- Search: 90µs → 55µs (1.6x faster)
- Build: 6.9s → 6.2s (10% faster)
- Recall@10: 0.998 (maintained)
- 17 tests passing

Co-Authored-By: claude-flow <ruv@ruv.net>
2026-04-06 17:52:58 -04:00
..
src perf(diskann): FlatVectors + VisitedSet + ILP + optional SIMD/GPU 2026-04-06 17:52:58 -04:00
Cargo.toml perf(diskann): FlatVectors + VisitedSet + ILP + optional SIMD/GPU 2026-04-06 17:52:58 -04:00