This website requires JavaScript.
Explore
Help
Sign in
vrr
/
kvcache-ai-ktransformers
Watch
2
Star
0
Fork
You've already forked kvcache-ai-ktransformers
0
mirror of
https://github.com/kvcache-ai/ktransformers.git
synced
2025-09-08 13:39:48 +00:00
Code
Issues
Projects
Releases
Packages
Wiki
Activity
Actions
0b57627bb9
kvcache-ai-ktransformers
/
ktransformers
/
ktransformers_ext
/
cuda
History
BITcyman
7c4cb520bd
[feature] support q2_k & q3_k dequantize on gpu
2024-08-12 12:53:12 +00:00
..
custom_gguf
[feature] support q2_k & q3_k dequantize on gpu
2024-08-12 12:53:12 +00:00
gptq_marlin
[ADD] support multi-gpu qlen>1 q5_k
2024-08-12 11:41:26 +00:00
binding.cpp
[feature] support q2_k & q3_k dequantize on gpu
2024-08-12 12:53:12 +00:00
setup.py
[ADD] support multi-gpu qlen>1 q5_k
2024-08-12 11:41:26 +00:00