__init__.py
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
attention.py
|
fix flashinfer precision
|
2025-03-07 14:07:00 +00:00 |
cpuinfer.py
|
[feature] release 0.1.3
|
2024-08-28 16:11:43 +00:00 |
dynamic_attention.py
|
[feature] release 0.1.3
|
2024-08-28 16:11:43 +00:00 |
experts.py
|
⚡ fix experts torch
|
2025-02-26 15:04:40 +08:00 |
flashinfer_wrapper.py
|
fix flashinfer precision
|
2025-03-07 14:07:00 +00:00 |
linear.py
|
fix: wrong shape in KLinearMarlin.
|
2025-03-03 17:34:45 +08:00 |
models.py
|
support absorb for prefill long context
|
2025-02-25 08:52:02 +00:00 |
triton_attention.py
|
Update triton_attention.py
|
2025-02-15 15:41:01 +08:00 |