__init__.py
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
attention.py
|
linux support triton MLA kernel
|
2025-02-14 11:38:55 +00:00 |
base_operator.py
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
cpuinfer.py
|
[feature] release 0.1.3
|
2024-08-28 16:11:43 +00:00 |
dynamic_attention.py
|
[feature] release 0.1.3
|
2024-08-28 16:11:43 +00:00 |
experts.py
|
toy support for experts on GPU, no CUDA Graph
|
2025-02-15 15:16:00 +00:00 |
gate.py
|
done support deepseekv3
|
2025-02-04 15:53:38 +00:00 |
linear.py
|
toy support for experts on GPU, no CUDA Graph
|
2025-02-15 15:16:00 +00:00 |
models.py
|
done support deepseekv3
|
2025-02-04 15:53:38 +00:00 |
RoPE.py
|
⚡ ready to publish
|
2025-02-10 12:29:23 +08:00 |