api
|
fix load default max_new_tokens
|
2025-04-25 04:20:12 +00:00 |
backend
|
GLM4 and SmallThinker
|
2025-07-25 16:56:36 +00:00 |
balance_serve
|
Merge 8c8cb207aa into ee2ede0412
|
2025-08-05 15:24:17 +08:00 |
config
|
update kvc disk path config.
|
2025-06-30 15:09:35 +00:00 |
crud
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
models
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
schemas
|
fix load default max_new_tokens
|
2025-04-25 04:20:12 +00:00 |
utils
|
add balance-serve, support concurrence
|
2025-03-31 22:55:32 +08:00 |
__init__.py
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
args.py
|
GLM4 and SmallThinker
|
2025-07-25 16:56:36 +00:00 |
exceptions.py
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
main.py
|
Move KV cache creation to balance_serve
|
2025-04-18 10:10:07 +00:00 |
requirements.txt
|
support qwen3, dont speak human language
|
2025-04-28 08:44:47 +00:00 |