ruvector/examples/data/framework
rUv cbacb0b9d6 feat(data-framework): v0.3.0 with HNSW, similarity cache, and batch embeddings (#107)
## New Features
- HNSW Integration: O(log n) similarity search replaces O(n²) brute force (10-50x speedup)
- Similarity Cache: 2-3x speedup for repeated similarity queries
- Batch ONNX Embeddings: Chunked processing with progress callbacks
- Shared Utils Module: cosine_similarity, euclidean_distance, normalize_vector
- Auto-connect by Embeddings: CoherenceEngine creates edges from vector similarity

## Performance Improvements
- 8.8x faster batch vector insertion (parallel processing)
- 10-50x faster similarity search (HNSW vs brute force)
- 2.9x faster similarity computation (SIMD acceleration)
- 2-3x faster repeated queries (similarity cache)

## Files Changed
- coherence.rs: HNSW integration, new CoherenceConfig fields
- optimized.rs: Similarity cache implementation
- utils.rs: New shared utility functions
- api_clients.rs: Batch embedding methods (embed_batch_chunked, embed_batch_with_progress)
- README.md: Documented all new features and configuration options

Published as ruvector-data-framework v0.3.0 on crates.io

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-05 16:16:38 -05:00
..
benches feat: Add comprehensive dataset discovery framework for RuVector (#104) 2026-01-04 14:36:41 -05:00
discovery_exports feat: Add comprehensive dataset discovery framework for RuVector (#104) 2026-01-04 14:36:41 -05:00
docs feat: Add comprehensive dataset discovery framework for RuVector (#104) 2026-01-04 14:36:41 -05:00
examples feat(data-framework): v0.3.0 with HNSW, similarity cache, and batch embeddings (#107) 2026-01-05 16:16:38 -05:00
output feat: Add comprehensive dataset discovery framework for RuVector (#104) 2026-01-04 14:36:41 -05:00
src feat(data-framework): v0.3.0 with HNSW, similarity cache, and batch embeddings (#107) 2026-01-05 16:16:38 -05:00
tests feat: Add comprehensive dataset discovery framework for RuVector (#104) 2026-01-04 14:36:41 -05:00
Cargo.toml feat(data-framework): v0.3.0 with HNSW, similarity cache, and batch embeddings (#107) 2026-01-05 16:16:38 -05:00
EXPORT_GUIDE.md feat: Add comprehensive dataset discovery framework for RuVector (#104) 2026-01-04 14:36:41 -05:00
GEOSPATIAL_IMPLEMENTATION.md feat: Add comprehensive dataset discovery framework for RuVector (#104) 2026-01-04 14:36:41 -05:00
PATENT_CLIENT_README.md feat: Add comprehensive dataset discovery framework for RuVector (#104) 2026-01-04 14:36:41 -05:00
PERSISTENCE.md feat: Add comprehensive dataset discovery framework for RuVector (#104) 2026-01-04 14:36:41 -05:00
STREAMING_SUMMARY.md feat: Add comprehensive dataset discovery framework for RuVector (#104) 2026-01-04 14:36:41 -05:00