mirror of
https://github.com/ruvnet/RuVector.git
synced 2026-05-22 19:56:25 +00:00
P0: Router buffer reuse optimization - Add pre-allocated result_buffer to MemoryAwareRouter - Eliminate collect() allocation in select_top_k_buffered() - Use std::mem::take for zero-copy buffer handoff - Expected savings: 1-2µs per routing call P1: Optional routing metrics feature flag - Add 'routing-metrics' feature (enabled by default) - Conditionally compile Instant::now() and metrics tracking - Allows production builds to avoid syscall overhead (~0.04-0.08µs) Performance Analysis Documentation: - MoE routing optimization analysis report - Comprehensive architecture review (5 documents) - Identifies 8 additional optimization opportunities ADR-092 targets: <10µs routing latency, 70%+ cache hit rate All 26 MoE router tests pass. Co-Authored-By: claude-flow <ruv@ruv.net> |
||
|---|---|---|
| .. | ||
| RUVLLM_ARCHITECTURE_REVIEW.md | ||
| RUVLLM_OPTIMIZATION_CHECKLIST.md | ||
| RUVLLM_REVIEW_INDEX.md | ||
| RUVLLM_REVIEW_SUMMARY.md | ||
| RUVLLM_UNSAFE_CODE_AUDIT.md | ||