update fp8 kernel tutorial

2026-04-28 11:49:51 +00:00 · 2025-02-24 15:37:01 +00:00 · 2025-02-24 15:37:01 +00:00 · 4dc5518e4d
commit 4dc5518e4d
parent ca7366d2db
7 changed files with 46 additions and 5 deletions
--- a/doc/en/injection_tutorial.md
+++ b/doc/en/injection_tutorial.md
@ -59,6 +59,7 @@ Supported operators and their corresponding classes are as follows:
 | Linear    | KTransformersLinear    | KLinearMarlin           | Marlin as backend    |
 |           |                        | KLinearTorch            | pytorch as backend   |
 |           |                        | KLinearCPUInfer         | llamafile as backend |
+|           |                        | KLinearFP8         | Triton fp8_gemm kernel. Requires GPU be able to caluculate fp8 data |
 | experts   | KTransformersExperts   | KExpertsTorch           | pytorch as backend   |
 |           |                        | KExpertsMarlin          | Marlin as backend    |
 |           |                        | KExpertsCPU             | llamafile as backend |