This website requires JavaScript.
Explore
Help
Sign in
vrr
/
kvcache-ai-ktransformers
Watch
2
Star
0
Fork
You've already forked kvcache-ai-ktransformers
0
mirror of
https://github.com/kvcache-ai/ktransformers.git
synced
2026-05-05 07:11:39 +00:00
Code
Issues
Projects
Releases
Packages
Wiki
Activity
Actions
1
3986e2d2cf
kvcache-ai-ktransformers
/
ktransformers
/
ktransformers_ext
/
cuda
History
Download ZIP
Download TAR.GZ
Azure-Tang
ed8437413b
merge main; Add torch q8 linear
2025-03-14 05:52:07 -04:00
..
custom_gguf
merge main; Add torch q8 linear
2025-03-14 05:52:07 -04:00
gptq_marlin
merge main; Add torch q8 linear
2025-03-14 05:52:07 -04:00
binding.cpp
Ensure backward compatibility with Torch 2.2
2025-02-24 21:55:30 +08:00
setup.py
[ADD] support multi-gpu qlen>1 q5_k
2024-08-12 11:41:26 +00:00
test_dequant.py
optimize gguf dequant, save mem, support Q2_K
2025-02-22 06:13:01 +00:00