api
|
Revert repetition_penalty as it is not in API spec
|
2025-02-24 21:30:03 +08:00 |
backend
|
Left out
|
2025-02-24 21:51:14 +08:00 |
config
|
⚡ update force_think
|
2025-02-12 11:42:55 +08:00 |
crud
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
models
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
schemas
|
Default values
|
2025-02-24 21:38:01 +08:00 |
utils
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
__init__.py
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
args.py
|
feat: add prefix cache for server
|
2025-02-17 00:10:55 +08:00 |
exceptions.py
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
main.py
|
fix: fix server for triton kernel
|
2025-02-17 18:08:45 +08:00 |
requirements.txt
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |