|
__init__.py
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
|
attention.py
|
fix flashinfer precision
|
2025-03-07 14:07:00 +00:00 |
|
cpuinfer.py
|
[feature] release 0.1.3
|
2024-08-28 16:11:43 +00:00 |
|
dynamic_attention.py
|
[feature] release 0.1.3
|
2024-08-28 16:11:43 +00:00 |
|
experts.py
|
⚡ fix experts torch
|
2025-02-26 15:04:40 +08:00 |
|
flashinfer_wrapper.py
|
fix flashinfer precision
|
2025-03-07 14:07:00 +00:00 |
|
linear.py
|
fix: wrong shape in KLinearMarlin.
|
2025-03-03 17:34:45 +08:00 |
|
models.py
|
support absorb for prefill long context
|
2025-02-25 08:52:02 +00:00 |
|
triton_attention.py
|
Update triton_attention.py
|
2025-02-15 15:41:01 +08:00 |