kvcache-ai-ktransformers/ktransformers/server
2025-02-24 21:07:35 +08:00
..
api Also allow repetition_penalty 2025-02-24 21:07:35 +08:00
backend Also allow repetition_penalty 2025-02-24 21:07:35 +08:00
config update force_think 2025-02-12 11:42:55 +08:00
crud Initial commit 2024-07-27 16:06:58 +08:00
models Initial commit 2024-07-27 16:06:58 +08:00
schemas Also allow repetition_penalty 2025-02-24 21:07:35 +08:00
utils Initial commit 2024-07-27 16:06:58 +08:00
__init__.py Initial commit 2024-07-27 16:06:58 +08:00
args.py feat: add prefix cache for server 2025-02-17 00:10:55 +08:00
exceptions.py Initial commit 2024-07-27 16:06:58 +08:00
main.py fix: fix server for triton kernel 2025-02-17 18:08:45 +08:00
requirements.txt Initial commit 2024-07-27 16:06:58 +08:00