kvcache-ai-ktransformers/ktransformers/server/balance_serve
2025-04-18 12:11:18 +08:00
..
inference remove hard code max_length 2025-04-18 12:11:18 +08:00
sched_rpc.py add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
settings.py format kvc2, delete quant_configs, move model_configs to ~/.ktransformers 2025-04-08 10:06:07 +00:00