ruvector/docs/reviews
Reuven 383ff5e99f perf(ruvllm): optimize MoE routing with buffer reuse and optional metrics
P0: Router buffer reuse optimization
- Add pre-allocated result_buffer to MemoryAwareRouter
- Eliminate collect() allocation in select_top_k_buffered()
- Use std::mem::take for zero-copy buffer handoff
- Expected savings: 1-2µs per routing call

P1: Optional routing metrics feature flag
- Add 'routing-metrics' feature (enabled by default)
- Conditionally compile Instant::now() and metrics tracking
- Allows production builds to avoid syscall overhead (~0.04-0.08µs)

Performance Analysis Documentation:
- MoE routing optimization analysis report
- Comprehensive architecture review (5 documents)
- Identifies 8 additional optimization opportunities

ADR-092 targets: <10µs routing latency, 70%+ cache hit rate
All 26 MoE router tests pass.

Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-12 23:27:00 -04:00
..
RUVLLM_ARCHITECTURE_REVIEW.md perf(ruvllm): optimize MoE routing with buffer reuse and optional metrics 2026-03-12 23:27:00 -04:00
RUVLLM_OPTIMIZATION_CHECKLIST.md perf(ruvllm): optimize MoE routing with buffer reuse and optional metrics 2026-03-12 23:27:00 -04:00
RUVLLM_REVIEW_INDEX.md perf(ruvllm): optimize MoE routing with buffer reuse and optional metrics 2026-03-12 23:27:00 -04:00
RUVLLM_REVIEW_SUMMARY.md perf(ruvllm): optimize MoE routing with buffer reuse and optional metrics 2026-03-12 23:27:00 -04:00
RUVLLM_UNSAFE_CODE_AUDIT.md perf(ruvllm): optimize MoE routing with buffer reuse and optional metrics 2026-03-12 23:27:00 -04:00