kvcache-ai-ktransformers/archive/ktransformers
Shaoxu Cheng f25e58ad69
fix: qwen3-npu bugs; update: add readme-for-qwen3-npu (#1717)
* fix: qwen3-npu bugs; update: add readme-for-qwen3-npu

* fix: Correct the README description
2025-12-16 14:27:04 +08:00
..
configs Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581) 2025-11-10 17:42:26 +08:00
ktransformers_ext Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581) 2025-11-10 17:42:26 +08:00
models fix: qwen3-npu bugs; update: add readme-for-qwen3-npu (#1717) 2025-12-16 14:27:04 +08:00
operators update: add cache class and ascend ln mlp op for qwen3 adapt npu (#1708) 2025-12-11 17:08:35 +08:00
optimize update: Qwen3 MoE model adaptation for NPU (framework) (#1706) 2025-12-11 17:07:57 +08:00
server update: Qwen3 MoE model adaptation for NPU (framework) (#1706) 2025-12-11 17:07:57 +08:00
tests update: add attention and ln ut for npu (#1698) 2025-12-10 16:12:26 +08:00
util fix: qwen3-npu bugs; update: add readme-for-qwen3-npu (#1717) 2025-12-16 14:27:04 +08:00
website Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581) 2025-11-10 17:42:26 +08:00
__init__.py Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581) 2025-11-10 17:42:26 +08:00
ktransformers Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581) 2025-11-10 17:42:26 +08:00
local_chat.py Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581) 2025-11-10 17:42:26 +08:00
local_chat_test.py Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581) 2025-11-10 17:42:26 +08:00