Claude
|
2a21efbb5c
|
feat: Add ADR-017 temporal tensor compression with tiered quantization
Introduces a complete temporal tensor compression system with:
- ADR-017: SOTA research-backed architecture decision record covering
groupwise symmetric quantization, temporal segment reuse, access-pattern
driven tier selection (8/7/5/3 bit), and WASM-compatible design
- ruvector-temporal-tensor crate (zero external dependencies):
- tier_policy: Score-based hot/warm/cold bit-width selection
- f16: Software IEEE 754 half-precision conversion
- bitpack: Arbitrary bit-width stream packing (no alignment waste)
- quantizer: Groupwise symmetric quantization with f16 scales
- segment: Binary segment format (TQTC) encode/decode
- compressor: Temporal segment manager with drift detection
- ffi: WASM/C FFI with handle-based resource management
- ruvector-temporal-tensor-wasm crate for wasm32 targets
- 33 passing unit tests covering all modules
Compression targets: 4x (hot/8-bit), 4.57x (warm/7-bit),
6.4x (warm/5-bit), 10.67x (cold/3-bit) vs f32 baseline.
https://claude.ai/code/session_01U63xtGd5Q8mUevyY7nUSfJ
|
2026-02-06 00:28:21 +00:00 |
|