Commit graph

34 commits

Author SHA1 Message Date
Shaoxu Cheng
f25e58ad69
fix: qwen3-npu bugs; update: add readme-for-qwen3-npu (#1717)
* fix: qwen3-npu bugs; update: add readme-for-qwen3-npu

* fix: Correct the README description
2025-12-16 14:27:04 +08:00
RICHARDNAN
18fb8fc897
Npu revise benchmark results and prerequisites (#1716)
* Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md

* Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md

* Revise Ascend NPU tutorial for Docker deployment

Updated the tutorial for deploying the Ascend NPU, changing sections from 'Conda部署' to '镜像部署' and providing specific commands for Docker container setup and Python environment installation.

* Update DeepseekR1 tutorial for Ascend NPU

* Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md

* Update W8A8 weight link in tutorial

* Update doc/zh/DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

* Refactor Docker command and update package manager

Updated Docker run command to simplify device specifications and corrected package manager command from 'apt' to 'yum'.

* Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md

* Revise benchmark results and prerequisites

Updated performance results and hardware specifications.

* Update doc/zh/DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

---------

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-12-16 14:26:44 +08:00
RICHARDNAN
6431888928
add deploy in docker image (#1691) 2025-12-11 14:11:27 +08:00
RICHARDNAN
2cffdf7033
[docs]: Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md (#1638)
Some checks failed
Book-CI / test (push) Has been cancelled
Book-CI / test-1 (push) Has been cancelled
Book-CI / test-2 (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
2025-11-24 11:51:07 +08:00
JimmyPeilinLi
1c08a4f0fb fix: remove py310 as guide 2025-11-08 08:54:32 +00:00
Peilin Li
fe556bba34
Update installation instructions 2025-11-04 23:03:36 +08:00
KMSorSMS
0c15da437f [feat](cmake & doc): fix bug with cmake arch detect & update doc for sft 2025-11-04 08:46:26 +00:00
JimmyPeilinLi
7b6ccc3f57 add the docs and update README for KSFT 2025-11-04 05:51:48 +00:00
RICHARDNAN
6085dea039
Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md 2025-10-30 10:05:54 +08:00
RICHARDNAN
536bea29aa
Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md 2025-10-30 10:03:50 +08:00
RICHARDNAN
d96614627d
Update DeepseekR1_V3_tutorial_zh_for_Ascend_NPU.md 2025-10-30 09:53:31 +08:00
RICHARDNAN
2a29a57b7a
Rename tutorial file for DeepseekR1 V3 2025-10-30 09:50:14 +08:00
RICHARDNAN
2716345637
Update tutorial to reflect Deepseek-R1 deployment 2025-10-30 09:48:37 +08:00
RICHARDNAN
6b68fc68d2
Update optimize_config_path for NPU tutorial 2025-10-29 10:47:44 +08:00
RICHARDNAN
bb14f7594e
Revise KTrans benchmark results in tutorial
Updated benchmark results for KTrans performance.
2025-10-29 09:44:57 +08:00
RICHARDNAN
69af4ddae8
Update DeepseekR1_tutorial_zh_for_Ascend_NPU.md 2025-10-28 22:11:04 +08:00
RICHARDNAN
59a722bf6f
Update doc/zh/DeepseekR1_tutorial_zh_for_Ascend_NPU.md
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-10-28 22:08:27 +08:00
RICHARDNAN
578ed0bfd0
Update doc/zh/DeepseekR1_tutorial_zh_for_Ascend_NPU.md
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-10-28 22:07:48 +08:00
RICHARDNAN
f9028f0315
Update doc/zh/DeepseekR1_tutorial_zh_for_Ascend_NPU.md
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-10-28 22:07:37 +08:00
RICHARDNAN
6f028ea444
Merge branch 'main' into csx-main-fix 2025-10-28 22:05:43 +08:00
RICHARDNAN
727aefe620 Update DeepseekR1_tutorial_zh_for_Ascend_NPU.md 2025-10-28 21:55:59 +08:00
cen121212
7636e861fd
Merge pull request #30 from RICHARDNAN/csx-main-fix
删除废弃代码
2025-10-25 10:06:10 +08:00
RICHARDNAN
48fdacedd0 删除废弃代码 2025-10-25 09:52:43 +08:00
RICHARDNAN
376e9d674f
Add RANK and LOCAL_WORLD_SIZE environment variables
Added environment variables for rank and local world size.
2025-10-24 15:17:33 +08:00
RICHARDNAN
0787ba97ee
Update supported NPU 2025-10-24 15:08:30 +08:00
RICHARDNAN
573c603656 Update DeepseekR1_tutorial_zh_for_Ascend_NPU.md 2025-10-24 11:59:53 +08:00
RICHARDNAN
ca4b3a9011 新增npu readme 2025-10-24 11:56:22 +08:00
Alisehen
055680e26c add flashinfer to cuda device 2025-05-15 07:03:45 +00:00
qiyuxinlin
c3d0ac80c6 update readme 2025-05-14 13:13:10 +00:00
dongjw
8acb270c90 delete sudo install 2025-04-03 10:46:52 +08:00
Atream
25cee5810e add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
John W. Leimgruber III
bb39eeb005 Add notes to DeepSeek-R1 tutorial documentation 2025-02-16 13:40:27 -05:00
dhliu
d04b570fb5 edit README_ZH.md && add DeepseekR1_V3_tutorial_zh.md 2025-02-13 21:14:44 +08:00
chenxl
18c42e67df Initial commit 2024-07-27 16:06:58 +08:00