mirror of
https://github.com/kvcache-ai/ktransformers.git
synced 2026-05-01 21:21:12 +00:00
|
Some checks are pending
Book-CI / test (push) Waiting to run
Book-CI / test-1 (push) Waiting to run
Book-CI / test-2 (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
* feat: support avx2 bf16 fp8 inference * feat: support avx2 gptq int4 inference * fix: numeric issues in fp8 dequant * Tutorial avx2 (#1900) * fix: prevent injecting -DLLAMA_AVX512=ON on AVX2-only machines * docs: add AVX2 tutorial for running KTransformers on AVX2-only CPUs * Tutorial avx2 (#1901) * fix: prevent injecting -DLLAMA_AVX512=ON on AVX2-only machines * docs: add AVX2 tutorial for running KTransformers on AVX2-only CPUs * docs: update README.md --------- Co-authored-by: Benjamin F <159887351+yyj6666667@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| test_amd_placeholder.py | ||
| test_basic_cpu.py | ||
| test_cuda_placeholder.py | ||
| test_moe_amx_accuracy_int4.py | ||
| test_moe_amx_accuracy_int4_1.py | ||
| test_moe_amx_accuracy_int4_1k.py | ||
| test_moe_amx_accuracy_int8.py | ||
| test_moe_amx_bench_int4.py | ||
| test_moe_amx_bench_int4_1.py | ||
| test_moe_amx_bench_int4_1k.py | ||
| test_moe_amx_bench_int8.py | ||
| test_moe_avx2_accuracy_bf16.py | ||
| test_moe_avx2_accuracy_fp8.py | ||