kvcache-ai-ktransformers

mirror of https://github.com/kvcache-ai/ktransformers.git synced 2026-04-28 03:39:48 +00:00

History

ZiWei Yuan 1374b98ee5 Some checks are pending Book-CI / test (push) Waiting to run Details Book-CI / test-1 (push) Waiting to run Details Book-CI / test-2 (push) Waiting to run Details Deploy / deploy (macos-latest) (push) Waiting to run Details Deploy / deploy (ubuntu-latest) (push) Waiting to run Details Deploy / deploy (windows-latest) (push) Waiting to run Details [feat](moe_kernel): add amd blis support (int8) (#1600 ) * [feat]: init amd adaption * [feat]: add blis support * [fix]: fix setup and moe kernel warpper * [fix](setup.py): support rebuild with cache and import kt_kernel works fine * [feat]: add moe_kernel converter for amd and implement the load method(haven't tested yet) * [feat](moe_kernel/moe.hpp): delete unused memory when using save * [fix](moe_kernel): update PLAIN for pack * [fix](moe_kernel): rm printf debug * [fix](moe_kernel): skip gpu experts * [fix](moe_kernel/moe.hpp): update include memory path * [feat](moe_kernel/moe.hpp): support expert deferral * [feat]: finish amd --------- Co-authored-by: mrhaoxx <mr.haoxx@gmail.com>		2025-11-27 12:08:53 +08:00
..
__init__.py	Fix kt-kernel compile issue (#1595 )	2025-11-11 19:30:27 +08:00
amx.py	Fix kt-kernel for new wrapper (#1588 )	2025-11-10 21:47:34 +08:00
llamafile.py	Fix kt-kernel for new wrapper (#1588 )	2025-11-10 21:47:34 +08:00
loader.py	Fix kt-kernel for new wrapper (#1588 )	2025-11-10 21:47:34 +08:00
moe_kernel.py	[feat](moe_kernel): add amd blis support (int8) (#1600 )	2025-11-27 12:08:53 +08:00