|
bench
|
[feature] release 0.1.3
|
2024-08-28 16:11:43 +00:00 |
|
cmake
|
[feature] support python 310 and multi instruction
|
2024-07-31 13:58:17 +00:00 |
|
cpu_backend
|
merge main; Add torch q8 linear
|
2025-03-14 05:52:07 -04:00 |
|
cuda
|
merge main; Add torch q8 linear
|
2025-03-14 05:52:07 -04:00 |
|
examples
|
[feature] release 0.1.3
|
2024-08-28 16:11:43 +00:00 |
|
triton
|
fix fp8 multi gpu; update FQA
|
2025-02-25 10:52:29 +00:00 |
|
vendors
|
merge main; Add torch q8 linear
|
2025-03-14 05:52:07 -04:00 |
|
CMakeLists.txt
|
merge main; Add torch q8 linear
|
2025-03-14 05:52:07 -04:00 |
|
ext_bindings.cpp
|
merge main; Add torch q8 linear
|
2025-03-14 05:52:07 -04:00 |