kvcache-ai-ktransformers/ktransformers
2025-02-15 21:57:08 +08:00
..
configs update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
ktransformers_ext v0.2 ongoing 2025-02-09 22:41:14 +08:00
models Merge pull request #294 from kvcache-ai/feat-fast-MLA 2025-02-14 19:40:36 +08:00
operators Update attention.py 2025-02-15 15:43:35 +08:00
optimize add V3/R1 8 gpu yaml example 2025-02-14 02:56:13 +00:00
server Solve torch.backends.cuda.sdp_kernel() is deprecated. 2025-02-15 12:41:51 +08:00
tests [fix] format classes and files name 2024-08-15 10:44:59 +08:00
util warm_up before capture 2025-02-14 15:52:21 +00:00
website : refactor local_chat and fix message slice bug in server 2024-11-04 14:02:19 +08:00
__init__.py [feature] update docker image and entrypoint 2025-02-15 07:55:33 +00:00
local_chat.py support force thinking 2025-02-12 12:43:53 +08:00