kvcache-ai-ktransformers

mirror of https://github.com/kvcache-ai/ktransformers.git synced 2026-04-28 11:49:51 +00:00

History

Shaoxu Cheng f25e58ad69 fix: qwen3-npu bugs; update: add readme-for-qwen3-npu (#1717 ) * fix: qwen3-npu bugs; update: add readme-for-qwen3-npu * fix: Correct the README description		2025-12-16 14:27:04 +08:00
..
configs	Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581 )	2025-11-10 17:42:26 +08:00
ktransformers_ext	Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581 )	2025-11-10 17:42:26 +08:00
models	fix: qwen3-npu bugs; update: add readme-for-qwen3-npu (#1717 )	2025-12-16 14:27:04 +08:00
operators	update: add cache class and ascend ln mlp op for qwen3 adapt npu (#1708 )	2025-12-11 17:08:35 +08:00
optimize	update: Qwen3 MoE model adaptation for NPU (framework) (#1706 )	2025-12-11 17:07:57 +08:00
server	update: Qwen3 MoE model adaptation for NPU (framework) (#1706 )	2025-12-11 17:07:57 +08:00
tests	update: add attention and ln ut for npu (#1698 )	2025-12-10 16:12:26 +08:00
util	fix: qwen3-npu bugs; update: add readme-for-qwen3-npu (#1717 )	2025-12-16 14:27:04 +08:00
website	Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581 )	2025-11-10 17:42:26 +08:00
__init__.py	Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581 )	2025-11-10 17:42:26 +08:00
ktransformers	Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581 )	2025-11-10 17:42:26 +08:00
local_chat.py	Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581 )	2025-11-10 17:42:26 +08:00
local_chat_test.py	Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581 )	2025-11-10 17:42:26 +08:00