mirror of
https://github.com/kvcache-ai/ktransformers.git
synced 2026-04-28 11:49:51 +00:00
update fp8 kernel tutorial
This commit is contained in:
parent
ca7366d2db
commit
4dc5518e4d
7 changed files with 46 additions and 5 deletions
|
|
@ -59,6 +59,7 @@ Supported operators and their corresponding classes are as follows:
|
|||
| Linear | KTransformersLinear | KLinearMarlin | Marlin as backend |
|
||||
| | | KLinearTorch | pytorch as backend |
|
||||
| | | KLinearCPUInfer | llamafile as backend |
|
||||
| | | KLinearFP8 | Triton fp8_gemm kernel. Requires GPU be able to caluculate fp8 data |
|
||||
| experts | KTransformersExperts | KExpertsTorch | pytorch as backend |
|
||||
| | | KExpertsMarlin | Marlin as backend |
|
||||
| | | KExpertsCPU | llamafile as backend |
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue