kvcache-ai-ktransformers

mirror of https://github.com/kvcache-ai/ktransformers.git synced 2026-05-05 15:40:13 +00:00

History

ZiWei Yuan c2b8c60c4e [ci]: add int4_1 & int4_1k (#1653 ) * [feat]: init amd adaption * [feat]: add blis support * [fix]: fix setup and moe kernel warpper * [fix](setup.py): support rebuild with cache and import kt_kernel works fine * [feat]: add moe_kernel converter for amd and implement the load method(haven't tested yet) * [feat](moe_kernel/moe.hpp): delete unused memory when using save * [fix](moe_kernel): update PLAIN for pack * [fix](moe_kernel): rm printf debug * [fix](moe_kernel): skip gpu experts * [fix](moe_kernel/moe.hpp): update include memory path * [feat](moe_kernel/moe.hpp): support expert deferral * [feat]: finish amd * [ci]: add int4_1 & int4_1k --------- Co-authored-by: mrhaoxx <mr.haoxx@gmail.com>		2025-12-02 15:58:14 +08:00
..
__init__.py	add ci (#1642 )	2025-11-25 20:52:08 +08:00
test_amd_placeholder.py	add ci (#1642 )	2025-11-25 20:52:08 +08:00
test_basic_cpu.py	add ci (#1642 )	2025-11-25 20:52:08 +08:00
test_cuda_placeholder.py	add ci (#1642 )	2025-11-25 20:52:08 +08:00
test_moe_amx_accuracy_int4.py	update ci test (#1647 )	2025-11-27 16:39:48 +08:00
test_moe_amx_accuracy_int4_1.py	update ci test (#1647 )	2025-11-27 16:39:48 +08:00
test_moe_amx_accuracy_int4_1k.py	update ci test (#1647 )	2025-11-27 16:39:48 +08:00
test_moe_amx_accuracy_int8.py	update ci test (#1647 )	2025-11-27 16:39:48 +08:00
test_moe_amx_bench_int4.py	update ci test (#1647 )	2025-11-27 16:39:48 +08:00
test_moe_amx_bench_int4_1.py	[ci]: add int4_1 & int4_1k (#1653 )	2025-12-02 15:58:14 +08:00
test_moe_amx_bench_int4_1k.py	[ci]: add int4_1 & int4_1k (#1653 )	2025-12-02 15:58:14 +08:00
test_moe_amx_bench_int8.py	update ci test (#1647 )	2025-11-27 16:39:48 +08:00