ruvector/.github
Reuven 59655d064a feat(ruvllm): Add Core ML/ANE backend with Apple Neural Engine support
- Add Core ML backend with objc2-core-ml bindings for .mlmodel/.mlmodelc/.mlpackage
- Implement ANE optimization kernels with dimension-based crossover thresholds
  - ANE_OPTIMAL_DIM=512, GPU_CROSSOVER=1536, GPU_DOMINANCE=2048
  - Automatic hardware selection based on tensor dimensions
- Add hybrid pipeline for intelligent CPU/GPU/ANE workload distribution
- Implement LlmBackend trait with generate(), generate_stream(), get_embeddings()
- Add streaming token generation with both iterator and channel-based approaches
- Enhance autodetect with Core ML model path discovery and capability detection
- Add comprehensive ANE benchmarks and integration tests
- Fix test failures in autodetect_integration (memory calculation) and
  serving_integration (KV cache FIFO slot allocation, churn test cleanup)
- Add GitHub Actions workflow for ruvllm benchmarks
- Create comprehensive v2 release documentation (GITHUB_ISSUE_V2.md)

Performance targets:
- ANE: 38 TOPS on M4 Pro for matrix operations
- Hybrid pipeline: Automatic workload balancing across compute units
- Memory: Efficient tensor allocation with platform-specific alignment

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-19 17:32:08 -05:00
..
benchmarks feat: Add Neo4j-compatible hypergraph database package (ruvector-graph) 2025-11-25 23:11:54 +00:00
workflows feat(ruvllm): Add Core ML/ANE backend with Apple Neural Engine support 2026-01-19 17:32:08 -05:00