kvcache-ai-ktransformers/ktransformers/server
2025-02-26 17:05:00 +08:00
..
api implementation of chat routing for Ollama 2025-02-26 17:05:00 +08:00
backend Merge pull request #657 from kvcache-ai/feat-absorb-for-long-prefill 2025-02-25 16:53:21 +08:00
config update force_think 2025-02-12 11:42:55 +08:00
crud Initial commit 2024-07-27 16:06:58 +08:00
models Initial commit 2024-07-27 16:06:58 +08:00
schemas Initial commit 2024-07-27 16:06:58 +08:00
utils Initial commit 2024-07-27 16:06:58 +08:00
__init__.py Initial commit 2024-07-27 16:06:58 +08:00
args.py feat: add prefix cache for server 2025-02-17 00:10:55 +08:00
exceptions.py Initial commit 2024-07-27 16:06:58 +08:00
main.py fix: fix server for triton kernel 2025-02-17 18:08:45 +08:00
requirements.txt Initial commit 2024-07-27 16:06:58 +08:00