kvcache-ai-ktransformers/ktransformers
Jiaqi Liao 05f6cede37
Merge pull request #943 from SkqLiao/main
fix benchmark params for human eval benchmark
2025-03-20 18:49:34 +08:00
..
configs update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
ktransformers_ext fix rocm compilation 2025-03-15 12:34:03 -04:00
models optimize gguf dequant, save mem, support Q2_K 2025-02-22 06:13:01 +00:00
operators Update gate.py 2025-03-20 14:54:01 +08:00
optimize Update DeepSeek-V3-Chat-multi-gpu-marlin.yaml 2025-03-17 17:05:01 +08:00
server Merge pull request #842 from BITcyman/fix-openai_chat_completion 2025-03-07 22:56:19 +08:00
tests Merge pull request #943 from SkqLiao/main 2025-03-20 18:49:34 +08:00
util merge main; Add torch q8 linear 2025-03-14 05:52:07 -04:00
website : refactor local_chat and fix message slice bug in server 2024-11-04 14:02:19 +08:00
__init__.py 🔖 release v0.2.3post2 2025-03-15 18:04:10 +08:00
local_chat.py merge main; Add torch q8 linear 2025-03-14 05:52:07 -04:00
local_chat_test.py local chat for cicd test 2025-03-15 02:31:19 +08:00