ktransformers_ext
|
⚡ v0.2 ongoing
|
2025-02-09 22:41:14 +08:00 |
models
|
Merge pull request #294 from kvcache-ai/feat-fast-MLA
|
2025-02-14 19:40:36 +08:00 |
operators
|
Update attention.py
|
2025-02-15 15:43:35 +08:00 |
optimize
|
add V3/R1 8 gpu yaml example
|
2025-02-14 02:56:13 +00:00 |
tests
|
[fix] format classes and files name
|
2024-08-15 10:44:59 +08:00 |
util
|
warm_up before capture
|
2025-02-14 15:52:21 +00:00 |
__init__.py
|
[feature] update docker image and entrypoint
|
2025-02-15 07:55:33 +00:00 |
local_chat.py
|
⚡ support force thinking
|
2025-02-12 12:43:53 +08:00 |