kvcache-ai-ktransformers/ktransformers/server
2025-08-05 15:24:17 +08:00
..
api fix load default max_new_tokens 2025-04-25 04:20:12 +00:00
backend GLM4 and SmallThinker 2025-07-25 16:56:36 +00:00
balance_serve Merge 8c8cb207aa into ee2ede0412 2025-08-05 15:24:17 +08:00
config update kvc disk path config. 2025-06-30 15:09:35 +00:00
crud Initial commit 2024-07-27 16:06:58 +08:00
models Initial commit 2024-07-27 16:06:58 +08:00
schemas fix load default max_new_tokens 2025-04-25 04:20:12 +00:00
utils add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
__init__.py Initial commit 2024-07-27 16:06:58 +08:00
args.py GLM4 and SmallThinker 2025-07-25 16:56:36 +00:00
exceptions.py Initial commit 2024-07-27 16:06:58 +08:00
main.py Move KV cache creation to balance_serve 2025-04-18 10:10:07 +00:00
requirements.txt support qwen3, dont speak human language 2025-04-28 08:44:47 +00:00