kvcache-ai-ktransformers/ktransformers
2025-07-22 10:58:25 +00:00
..
configs support npu 2025-07-22 10:58:16 +00:00
ktransformers_ext add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
models support npu 2025-07-22 10:58:25 +00:00
operators support npu 2025-07-22 10:58:25 +00:00
optimize support npu 2025-07-22 10:58:25 +00:00
server support npu 2025-07-22 10:58:25 +00:00
tests add prefix cache support for kvc2. 2025-06-26 04:57:25 +00:00
util support npu 2025-07-22 10:58:25 +00:00
website : refactor local_chat and fix message slice bug in server 2024-11-04 14:02:19 +08:00
__init__.py Update __init__.py 2025-07-01 16:43:19 +08:00
local_chat.py revert using FP16 2025-07-01 14:24:27 +08:00
local_chat_npu.py support npu 2025-07-22 10:58:25 +00:00
local_chat_test.py fix some bugs 2025-04-17 00:48:09 +08:00