kvcache-ai-ktransformers/ktransformers
2025-02-27 12:12:32 +08:00
..
configs update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
ktransformers_ext fix numa cpu distribution 2025-02-26 14:49:57 +08:00
models optimize gguf dequant, save mem, support Q2_K 2025-02-22 06:13:01 +00:00
operators fix experts torch 2025-02-26 15:04:40 +08:00
optimize Update DeepSeek-V3-Chat-multi-gpu-marlin.yaml 2025-02-26 21:53:50 +08:00
server modify 2025-02-26 19:21:30 +08:00
tests update git ignore add docker dev container 2025-02-25 17:22:11 +08:00
util Merge branch 'main' into main 2025-02-27 12:12:32 +08:00
website : refactor local_chat and fix message slice bug in server 2024-11-04 14:02:19 +08:00
__init__.py release v0.2.2rc1 2025-02-25 22:06:36 +08:00
local_chat.py Merge pull request #657 from kvcache-ai/feat-absorb-for-long-prefill 2025-02-25 16:53:21 +08:00