kvcache-ai-ktransformers/ktransformers/server/backend/interfaces
2025-02-17 14:25:27 +08:00
..
__init__.py Initial commit 2024-07-27 16:06:58 +08:00
exllamav2.py Initial commit 2024-07-27 16:06:58 +08:00
ktransformers.py feat: add prefix cache for server 2025-02-17 00:10:55 +08:00
transformers.py fix: server: drop <think> tag in chat template 2025-02-17 14:25:27 +08:00