vrr/kvcache-ai-ktransformers

mirror of https://github.com/kvcache-ai/ktransformers.git synced 2026-04-28 20:00:06 +00:00

Author	SHA1	Message	Date
Jianwei Dong	027832c590	[feat](kt-kernel): CPU-GPU experts sched (#1796 ) Some checks failed Book-CI / test (push) Has been cancelled Details Book-CI / test-1 (push) Has been cancelled Details Book-CI / test-2 (push) Has been cancelled Details Deploy / deploy (macos-latest) (push) Has been cancelled Details Deploy / deploy (ubuntu-latest) (push) Has been cancelled Details Deploy / deploy (windows-latest) (push) Has been cancelled Details	2026-01-16 17:01:15 +08:00
ErvinXie	a8667ddb58	[fix](test): fix import kt-kernel (#1728 )	2025-12-17 19:46:32 +08:00
Jiaqi Liao	e7d1c1de09	fix(llamafile): resolve deferred experts data race and update README (#1646 ) Some checks are pending Book-CI / test-1 (push) Waiting to run Details Book-CI / test-2 (push) Waiting to run Details Book-CI / test (push) Waiting to run Details Deploy / deploy (macos-latest) (push) Waiting to run Details Deploy / deploy (ubuntu-latest) (push) Waiting to run Details Deploy / deploy (windows-latest) (push) Waiting to run Details	2025-11-26 23:19:37 +08:00
Jiaqi Liao	94c25626dc	Fix kt-kernel for new wrapper (#1588 ) Some checks are pending Book-CI / test (push) Waiting to run Details Book-CI / test-1 (push) Waiting to run Details Book-CI / test-2 (push) Waiting to run Details Deploy / deploy (macos-latest) (push) Waiting to run Details Deploy / deploy (ubuntu-latest) (push) Waiting to run Details Deploy / deploy (windows-latest) (push) Waiting to run Details * update README for kt-kernel * style: format C++ and Python code in kt-kernel - Format C++ files: task_queue, ext_bindings, and MoE operators - Format Python utility modules: amx, llamafile, and loader - Improve code readability and consistency	2025-11-10 21:47:34 +08:00
Jiaqi Liao	9bc00e587b	Refactor KTMoEWrapper backend (#1587 ) Some checks are pending Book-CI / test (push) Waiting to run Details Book-CI / test-1 (push) Waiting to run Details Book-CI / test-2 (push) Waiting to run Details Deploy / deploy (macos-latest) (push) Waiting to run Details Deploy / deploy (ubuntu-latest) (push) Waiting to run Details Deploy / deploy (windows-latest) (push) Waiting to run Details * universal backend for cpu inference * expert defer	2025-11-10 20:26:15 +08:00

5 commits