kvcache-ai-ktransformers/ktransformers
2025-02-14 19:40:36 +08:00
..
configs update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
ktransformers_ext v0.2 ongoing 2025-02-09 22:41:14 +08:00
models Merge pull request #294 from kvcache-ai/feat-fast-MLA 2025-02-14 19:40:36 +08:00
operators Merge pull request #294 from kvcache-ai/feat-fast-MLA 2025-02-14 19:40:36 +08:00
optimize add V3/R1 8 gpu yaml example 2025-02-14 02:56:13 +00:00
server Add a lock to server inference() 2025-02-13 10:05:22 +00:00
tests [fix] format classes and files name 2024-08-15 10:44:59 +08:00
util init support for MLA using Attention kernel 2025-02-13 15:01:14 +00:00
website : refactor local_chat and fix message slice bug in server 2024-11-04 14:02:19 +08:00
__init__.py [feature] update version and github action jobs for package 2025-02-10 01:00:57 +00:00
local_chat.py support force thinking 2025-02-12 12:43:53 +08:00