kvcache-ai-ktransformers/ktransformers
Azure ae5d9e11a9
Merge pull request #227 from hrz6976/main
Add a lock to server inference()
2025-02-14 10:35:11 +08:00
..
configs update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
ktransformers_ext v0.2 ongoing 2025-02-09 22:41:14 +08:00
models update FAQ 2025-02-12 08:50:58 +00:00
operators 📝 fix some debug output and update doc 2025-02-13 17:25:12 +08:00
optimize Add optimization config for Deepseek V3/R1 with 4 GPUs 2025-02-13 16:32:28 +08:00
server Add a lock to server inference() 2025-02-13 10:05:22 +00:00
tests [fix] format classes and files name 2024-08-15 10:44:59 +08:00
util support R1 force thinking 2025-02-11 15:43:41 +08:00
website : refactor local_chat and fix message slice bug in server 2024-11-04 14:02:19 +08:00
__init__.py [feature] update version and github action jobs for package 2025-02-10 01:00:57 +00:00
local_chat.py support force thinking 2025-02-12 12:43:53 +08:00