Commit graph

177 commits

Author SHA1 Message Date
JimmyPeilinLi
7b6ccc3f57 add the docs and update README for KSFT 2025-11-04 05:51:48 +00:00
RICHARDNAN
6085dea039
Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md 2025-10-30 10:05:54 +08:00
RICHARDNAN
536bea29aa
Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md 2025-10-30 10:03:50 +08:00
RICHARDNAN
d96614627d
Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md 2025-10-30 09:53:31 +08:00
RICHARDNAN
2a29a57b7a
Rename tutorial file for DeepseekR1 V3 2025-10-30 09:50:14 +08:00
RICHARDNAN
2716345637
Update tutorial to reflect Deepseek-R1 deployment 2025-10-30 09:48:37 +08:00
RICHARDNAN
6b68fc68d2
Update optimize_config_path for NPU tutorial 2025-10-29 10:47:44 +08:00
RICHARDNAN
bb14f7594e
Revise KTrans benchmark results in tutorial
Updated benchmark results for KTrans performance.
2025-10-29 09:44:57 +08:00
RICHARDNAN
69af4ddae8
Update DeepseekR1_tutorial_zh_for_Ascend_NPU.md 2025-10-28 22:11:04 +08:00
RICHARDNAN
59a722bf6f
Update doc/zh/DeepseekR1_tutorial_zh_for_Ascend_NPU.md
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-10-28 22:08:27 +08:00
RICHARDNAN
578ed0bfd0
Update doc/zh/DeepseekR1_tutorial_zh_for_Ascend_NPU.md
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-10-28 22:07:48 +08:00
RICHARDNAN
f9028f0315
Update doc/zh/DeepseekR1_tutorial_zh_for_Ascend_NPU.md
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-10-28 22:07:37 +08:00
RICHARDNAN
6f028ea444
Merge branch 'main' into csx-main-fix 2025-10-28 22:05:43 +08:00
RICHARDNAN
727aefe620 Update DeepseekR1_tutorial_zh_for_Ascend_NPU.md 2025-10-28 21:55:59 +08:00
cen121212
7636e861fd
Merge pull request #30 from RICHARDNAN/csx-main-fix
删除废弃代码
2025-10-25 10:06:10 +08:00
RICHARDNAN
48fdacedd0 删除废弃代码 2025-10-25 09:52:43 +08:00
RICHARDNAN
376e9d674f
Add RANK and LOCAL_WORLD_SIZE environment variables
Added environment variables for rank and local world size.
2025-10-24 15:17:33 +08:00
RICHARDNAN
0787ba97ee
Update supported NPU 2025-10-24 15:08:30 +08:00
RICHARDNAN
573c603656 Update DeepseekR1_tutorial_zh_for_Ascend_NPU.md 2025-10-24 11:59:53 +08:00
RICHARDNAN
ca4b3a9011 新增npu readme 2025-10-24 11:56:22 +08:00
djw
0437660e62 fix bug 2025-09-16 13:21:58 +00:00
djw
a44b710649 support qwen3 next 2025-09-11 11:55:09 +00:00
Atream
64b3b30ba3
Update GGUF format link in Kimi-K2 documentation 2025-09-05 20:19:37 +08:00
Azure-Tang
b6d36bffbb update kimi-k2-0905 2025-09-05 03:52:43 +00:00
djw
5771990a07 update smallthinker and glm4 readme 2025-07-31 03:14:49 +00:00
djw
c7307aa0ae support smt and glm4 2025-07-25 16:24:38 +00:00
djw
17246bf84f support smt and glm4 2025-07-25 15:03:27 +00:00
chenxl
b5024f62a4 Update Kimi-K2 Readme 2025-07-12 12:51:00 +08:00
Atream
34d2829f24
Update Kimi-K2.md 2025-07-12 12:44:41 +08:00
Atream
90245d8a6b
Update Kimi-K2.md 2025-07-12 11:57:51 +08:00
Atream
378e4fc035
Update Kimi-K2.md 2025-07-12 11:47:42 +08:00
Atream
b4ed8b6ded
Update Kimi-K2.md 2025-07-11 23:26:18 +08:00
UnicornChan
7800a413a2
Update Kimi-K2.md 2025-07-11 19:31:58 +08:00
Atream
b4ac21454b
Create Kimi-K2.md 2025-07-11 09:31:47 +08:00
ouqingliang
90cff820cf update kvc disk path config. 2025-06-30 15:09:35 +00:00
ErvinXie
5a73aaf652
Update prefix_cache.md 2025-06-30 15:04:37 +08:00
ouqingliang
cc822df65d add prefix cache documentation 2025-06-28 07:13:33 +00:00
Shaojun Liu
404ad39a04 docs: add Dockerfile.xpu and GPU driver setup instructions
- Add Dockerfile.xpu for oneAPI-based container
- Create Docker_xpu.md with usage instructions
- Update xpu.md to include Docker guide
2025-05-28 13:55:35 +08:00
rnwang04
adc0906967 add XPU support for qwen3moe local chat 2025-05-22 21:01:41 +08:00
wang jiahao
32f3d7befb
Merge pull request #1307 from kvcache-ai/hyc
add xpu parameters to install.sh
2025-05-17 15:25:33 +08:00
rnwang04
a56aa45186 fix ipex-llm version to 2.3.0rc1 2025-05-16 12:22:08 +08:00
Shaoyuan CHEN
5d194c5db0
Fix typos 2025-05-15 22:15:55 +08:00
Atream
7faa776659
Merge pull request #1277 from Coekjan/patch-1
Fix typo about `GLIBCXX_3.4.32`
2025-05-15 01:58:00 -06:00
Alisehen
055680e26c add flashinfer to cuda device 2025-05-15 07:03:45 +00:00
Alisehen
f3be33a313 add xpu parameters to install.sh 2025-05-15 06:39:02 +00:00
Aubrey Li
72f6d93ffd xpu.md: add device discovery tips 2025-05-15 14:12:26 +08:00
rnwang04
2f6e14a54b fix md typo, fix code style, and update setup value error message 2025-05-15 10:14:39 +00:00
qiyuxinlin
d35d61f6a1 update readme 2025-05-14 13:15:18 +00:00
qiyuxinlin
c3d0ac80c6 update readme 2025-05-14 13:13:10 +00:00
rnwang04
142fb7ce6c Enable support for Intel XPU devices, add support for DeepSeek V2/V3 first 2025-05-14 19:37:27 +00:00