kvcache-ai-ktransformers/ktransformers
2025-07-01 16:43:19 +08:00
..
configs update kvc disk path config. 2025-06-30 15:09:35 +00:00
ktransformers_ext add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
models fix md typo, fix code style, and update setup value error message 2025-05-15 10:14:39 +00:00
operators Load DS-R1-0528 for module is BaseInjectedModule instance 2025-06-10 11:31:58 +08:00
optimize add XPU support for qwen3moe local chat 2025-05-22 21:01:41 +08:00
server update kvc disk path config. 2025-06-30 15:09:35 +00:00
tests add prefix cache support for kvc2. 2025-06-26 04:57:25 +00:00
util Fix kv_b_proj shape for unsloth quantized models 2025-06-05 17:33:11 +08:00
website : refactor local_chat and fix message slice bug in server 2024-11-04 14:02:19 +08:00
__init__.py Update __init__.py 2025-07-01 16:43:19 +08:00
local_chat.py revert using FP16 2025-07-01 14:24:27 +08:00
local_chat_test.py fix some bugs 2025-04-17 00:48:09 +08:00