kvcache-ai-ktransformers/ktransformers
2025-04-09 10:54:00 +08:00
..
configs format kvc2, delete quant_configs, move model_configs to ~/.ktransformers 2025-04-08 10:06:07 +00:00
ktransformers_ext add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
models Fix bug with non-base-multiple chunk_size, update test examples, and resolve issue with writing model_config. Hugging Face URL input is still unsupported. 2025-04-04 15:41:07 +08:00
operators Update attention.py 2025-04-09 10:54:00 +08:00
optimize rm KMoEGateDeepSeekV3, fall back to KMoEGate 2025-04-01 07:13:05 +00:00
server format kvc2, delete quant_configs, move model_configs to ~/.ktransformers 2025-04-08 10:06:07 +00:00
tests Fix bug with non-base-multiple chunk_size, update test examples, and resolve issue with writing model_config. Hugging Face URL input is still unsupported. 2025-04-04 15:41:07 +08:00
util update install doc and fix local_chat bug 2025-04-03 12:42:41 +08:00
website : refactor local_chat and fix message slice bug in server 2024-11-04 14:02:19 +08:00
__init__.py 🔖 release v0.2.4post1 2025-04-04 16:01:25 +08:00
local_chat.py add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
local_chat_test.py local chat for cicd test 2025-03-15 02:31:19 +08:00