kvcache-ai-ktransformers

mirror of https://github.com/kvcache-ai/ktransformers.git synced 2026-05-01 21:21:12 +00:00

History

mrhaoxx 7a9daf0cd4 Some checks are pending Book-CI / test (push) Waiting to run Details Book-CI / test-1 (push) Waiting to run Details Book-CI / test-2 (push) Waiting to run Details Deploy / deploy (macos-latest) (push) Waiting to run Details Deploy / deploy (ubuntu-latest) (push) Waiting to run Details Deploy / deploy (windows-latest) (push) Waiting to run Details [feat](kt-kernel): support avx2 only inference for bf16 fp8 and gptq int4 (#1892 ) * feat: support avx2 bf16 fp8 inference * feat: support avx2 gptq int4 inference * fix: numeric issues in fp8 dequant * Tutorial avx2 (#1900) * fix: prevent injecting -DLLAMA_AVX512=ON on AVX2-only machines * docs: add AVX2 tutorial for running KTransformers on AVX2-only CPUs * Tutorial avx2 (#1901) * fix: prevent injecting -DLLAMA_AVX512=ON on AVX2-only machines * docs: add AVX2 tutorial for running KTransformers on AVX2-only CPUs * docs: update README.md --------- Co-authored-by: Benjamin F <159887351+yyj6666667@users.noreply.github.com>		2026-03-27 14:45:02 +08:00
..
__init__.py	add ci (#1642 )	2025-11-25 20:52:08 +08:00
test_amd_placeholder.py	add ci (#1642 )	2025-11-25 20:52:08 +08:00
test_basic_cpu.py	Kt minimax (#1742 )	2025-12-24 15:39:44 +08:00
test_cuda_placeholder.py	add ci (#1642 )	2025-11-25 20:52:08 +08:00
test_moe_amx_accuracy_int4.py	Kt minimax (#1742 )	2025-12-24 15:39:44 +08:00
test_moe_amx_accuracy_int4_1.py	Kt minimax (#1742 )	2025-12-24 15:39:44 +08:00
test_moe_amx_accuracy_int4_1k.py	Kt minimax (#1742 )	2025-12-24 15:39:44 +08:00
test_moe_amx_accuracy_int8.py	Kt minimax (#1742 )	2025-12-24 15:39:44 +08:00
test_moe_amx_bench_int4.py	Kt minimax (#1742 )	2025-12-24 15:39:44 +08:00
test_moe_amx_bench_int4_1.py	Kt minimax (#1742 )	2025-12-24 15:39:44 +08:00
test_moe_amx_bench_int4_1k.py	Kt minimax (#1742 )	2025-12-24 15:39:44 +08:00
test_moe_amx_bench_int8.py	[feat](kt-kernel): CPU-GPU experts sched (#1796 )	2026-01-16 17:01:15 +08:00
test_moe_avx2_accuracy_bf16.py	[feat](kt-kernel): support avx2 only inference for bf16 fp8 and gptq int4 (#1892 )	2026-03-27 14:45:02 +08:00
test_moe_avx2_accuracy_fp8.py	[feat](kt-kernel): support avx2 only inference for bf16 fp8 and gptq int4 (#1892 )	2026-03-27 14:45:02 +08:00