kvcache-ai-ktransformers/ktransformers/server/balance_serve/inference
2025-08-05 15:24:17 +08:00
..
distributed add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
sampling Apply magikRUKKOLA's patch from issue #1417 2025-07-06 19:45:06 +00:00
__init__.py add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
config.py add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
forward_batch.py support safetensor load, delete architectures argument 2025-05-09 10:38:29 +00:00
model_runner.py support smt and glm4 2025-07-25 15:03:27 +00:00
query_manager.py remove hard code max_length 2025-04-18 12:11:18 +08:00