ruvector/docs/research/quantization-edge
Claude 51bb16ca09 docs(research): add TurboQuant KV cache compression research document
Comprehensive research document covering TurboQuant (ICLR 2026) and its
mapping to ruvLLM. Covers algorithm details, performance results,
integration architecture, PiQ3 comparison, risks/mitigations, and
implementation summary.

https://claude.ai/code/session_011ogX2uc7Zf8d8aQ3UAbNcd
2026-03-25 12:14:17 +00:00
..
00-README.md docs(research): add ultra-low-bit quantization & edge deployment research (#255) 2026-03-12 10:21:30 -04:00
01-ultra-low-bit-quantization-survey.md docs(research): add ultra-low-bit quantization & edge deployment research (#255) 2026-03-12 10:21:30 -04:00
02-quantization-aware-training-qat.md docs(research): add ultra-low-bit quantization & edge deployment research (#255) 2026-03-12 10:21:30 -04:00
03-quip-2bit-framework.md docs(research): add ultra-low-bit quantization & edge deployment research (#255) 2026-03-12 10:21:30 -04:00
04-moe-memory-aware-routing.md docs(research): add ultra-low-bit quantization & edge deployment research (#255) 2026-03-12 10:21:30 -04:00
05-ruvllm-quantization-architecture.md docs(research): add ultra-low-bit quantization & edge deployment research (#255) 2026-03-12 10:21:30 -04:00
06-implementation-plan-rust-ruvllm.md docs(research): add ultra-low-bit quantization & edge deployment research (#255) 2026-03-12 10:21:30 -04:00
07-3int-pi-constant-quantization.md docs(research): add ultra-low-bit quantization & edge deployment research (#255) 2026-03-12 10:21:30 -04:00
08-turboquant-kv-cache-compression.md docs(research): add TurboQuant KV cache compression research document 2026-03-25 12:14:17 +00:00