mirror of
https://github.com/ruvnet/RuVector.git
synced 2026-05-22 19:56:25 +00:00
Comprehensive research document covering TurboQuant (ICLR 2026) and its mapping to ruvLLM. Covers algorithm details, performance results, integration architecture, PiQ3 comparison, risks/mitigations, and implementation summary. https://claude.ai/code/session_011ogX2uc7Zf8d8aQ3UAbNcd |
||
|---|---|---|
| .. | ||
| 00-README.md | ||
| 01-ultra-low-bit-quantization-survey.md | ||
| 02-quantization-aware-training-qat.md | ||
| 03-quip-2bit-framework.md | ||
| 04-moe-memory-aware-routing.md | ||
| 05-ruvllm-quantization-architecture.md | ||
| 06-implementation-plan-rust-ruvllm.md | ||
| 07-3int-pi-constant-quantization.md | ||
| 08-turboquant-kv-cache-compression.md | ||