| .. |
|
npu
|
update: Qwen3 MoE model adaptation for NPU (framework) (#1706)
|
2025-12-11 17:07:57 +08:00 |
|
rocm
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
xpu
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V2-Chat-multi-gpu-4.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V2-Chat-multi-gpu.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V2-Chat.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V2-Lite-Chat-gpu-cpu.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V2-Lite-Chat-multi-gpu.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V2-Lite-Chat.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-amx.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-fp8-linear-ggml-experts-serve-amx.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-fp8-linear-ggml-experts-serve.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-fp8-linear-ggml-experts.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-multi-gpu-4.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-multi-gpu-8.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-multi-gpu-fp8-linear-ggml-experts.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-multi-gpu-marlin.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-multi-gpu.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-npu.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat-serve.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
DeepSeek-V3-Chat.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Glm4Moe-serve.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Internlm2_5-7b-Chat-1m.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Mixtral.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Moonlight-16B-A3B-serve.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Moonlight-16B-A3B.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Qwen2-57B-A14B-Instruct-multi-gpu.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Qwen2-57B-A14B-Instruct.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Qwen2-serve-amx.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Qwen2-serve.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Qwen3Moe-serve-amx.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Qwen3Moe-serve.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Qwen3Next-serve.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |
|
Smallthinker-serve.yaml
|
Refactor: restructure repository to focus on kt-kernel and KT-SFT modulesq recon (#1581)
|
2025-11-10 17:42:26 +08:00 |