kvcache-ai-ktransformers/ktransformers
2025-02-23 15:37:09 +08:00
..
configs update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
ktransformers_ext fix merge bug, this branch also padding Marlin 2025-02-22 09:00:09 +00:00
models optimize gguf dequant, save mem, support Q2_K 2025-02-22 06:13:01 +00:00
operators Merge branch 'main' into feat-more-context 2025-02-22 06:17:39 +00:00
optimize optimize gguf dequant, save mem, support Q2_K 2025-02-22 06:13:01 +00:00
server Merge branch 'main' into feat-more-context 2025-02-22 06:17:39 +00:00
tests fix .so bug 2025-02-20 21:24:46 +08:00
util fix bf16 load, TODO: refactor cpu dequant 2025-02-23 15:37:09 +08:00
website : refactor local_chat and fix message slice bug in server 2024-11-04 14:02:19 +08:00
__init__.py 🔖 release v0.2.1.post1 2025-02-18 20:45:48 +08:00
local_chat.py optimize GPU 2025-02-21 05:06:57 +00:00