Commit graph

103 commits

Author SHA1 Message Date
djw
dfaf2b20fb update llama4 tutorial 2025-04-18 03:42:48 +00:00
Atream
98dbdcd66c
Update llama4.md 2025-04-09 19:23:35 +08:00
Atream
346d202297
Update llama4.md 2025-04-09 19:09:44 +08:00
Atream
d1fcb208cc
Update llama4.md 2025-04-09 18:01:13 +08:00
Atream
ed2b971e02
Update llama4.md 2025-04-09 17:57:37 +08:00
djw
ecc3028c13 update llama4 tutorial 2025-04-09 09:34:04 +00:00
wang jiahao
23ceb1c049
Update balance-serve.md 2025-04-08 19:19:00 +08:00
wang jiahao
2fcdbee769
Update balance-serve.md 2025-04-07 19:19:49 +08:00
wang jiahao
6cbe044aae
Update balance-serve.md 2025-04-05 11:49:05 +08:00
255
578d3d9d09
📝 Docs: Clarify CMake version requirement for CUDA dialects
Adds a note explaining that default CMake versions on systems like
Ubuntu 22.04 LTS might not support newer CUDA dialects (e.g., CUDA 20),
leading to specific build errors.

Recommends installing a newer CMake via the Kitware APT repository
as a resolution. This helps users troubleshoot errors like:
"Target ... requires the language dialect 'CUDA20', but CMake does not know the compile flags..."
2025-04-04 20:11:59 +08:00
wangkuigang-yewu-cmss
c590583262 doc upgrade: model_path requirements and reasoning
* add documentations about `--model_path` requirements
* add `--force_think` in doc (most users would run R1 and would want it to provide reasoning process)
2025-04-03 15:16:56 +08:00
dongjw
1b7672937b update install doc and fix local_chat bug 2025-04-03 12:42:41 +08:00
dongjw
ab0b0f4ea1 fix local_chat and update balance-serve and SUMMARY doc 2025-04-03 12:19:43 +08:00
dongjw
8acb270c90 delete sudo install 2025-04-03 10:46:52 +08:00
wang jiahao
ee179c2ad0
Update balance-serve.md 2025-04-02 14:22:15 +08:00
dongjw
4ed9744ebb update readme 2025-04-02 14:02:57 +08:00
dongjw
b62cefaec9 update readme 2025-04-02 13:11:01 +08:00
Azure-Tang
aeabd783b0 update git action env, add BALANCE_SERVE=1 2025-04-01 11:21:55 +00:00
Azure-Tang
3a5330b215 Merge branch 'main' into work-concurrent 2025-04-01 06:48:19 +00:00
Atream
25cee5810e add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
liam
4748a912e2 📝 fix typo ktransformer->ktransformers 2025-03-17 17:54:00 +08:00
Azure-Tang
e5b001d76f Update readme; Format code; Add example yaml. 2025-03-14 14:25:52 -04:00
jqz
233ac55e2a
Update install.md
this  is a issue in your install tutorial
2025-03-13 17:59:24 +08:00
liam
9c343b4f71 🔖 release v0.2.3 2025-03-05 20:24:11 +08:00
liam
848fe8ab97 release v0.2.3 2025-03-05 20:21:04 +08:00
Atream
f17fb9d8fe
Update install.md 2025-03-05 19:02:32 +08:00
Atream
44dafa034b
Merge pull request #781 from 3wweiweiwu/main
update documentation to fix error in numa enablement
2025-03-05 19:00:59 +08:00
Azure
034a116365 update readme 2025-03-05 10:04:43 +00:00
hybcloud
6b551a6ee3
fix minor typo 2025-03-05 11:09:29 +08:00
3wweiweiwu
b936bcc72e
use 'make dev_install' to make minimum change 2025-03-04 09:35:12 -05:00
Wix Woo
bbcc5481bb update documentation to fix error in numa enablement 2025-03-03 15:49:00 +00:00
Atream
735873a32a
Update DeepseekR1_V3_tutorial.md 2025-03-01 19:56:46 +08:00
Azure
ca93cf7548
Merge pull request #702 from Azure-Tang/update-readme
[UPDATE] Update documents.
2025-02-26 23:45:24 +08:00
Azure
c05ebb74b1 Update fp8 doc; Update install.md broken link 2025-02-26 15:43:08 +00:00
Atream
369f4d917d
Update DeepseekR1_V3_tutorial.md 2025-02-26 22:04:29 +08:00
Azure
99f6e42113
Merge pull request #668 from KMSorSMS/main
📝 update benchmark.md
2025-02-26 00:21:09 +08:00
liam
3ad12751cf 📝 update more detail and fix typo 2025-02-26 00:17:02 +08:00
liam
05339ad0ef 📝 update benchmark.md 2025-02-25 23:57:58 +08:00
Azure
bb6920ed72 update doc 2025-02-25 15:46:15 +00:00
Atream
13974eb264
Update DeepseekR1_V3_tutorial.md 2025-02-25 21:36:52 +08:00
Atream
03f8bc9f79
Update DeepseekR1_V3_tutorial.md add long context 2025-02-25 21:35:31 +08:00
Azure
7e5962af3d fix fp8 multi gpu; update FQA 2025-02-25 10:52:29 +00:00
liam
1b5ac67fca 📝 add benchmark.md 2025-02-25 17:45:17 +08:00
Azure
021822dd01 update FAQ 2025-02-25 09:02:32 +00:00
Azure
36fbeee341 Update doc 2025-02-25 08:21:18 +00:00
Azure
4dc5518e4d update fp8 kernel tutorial 2025-02-24 15:37:01 +00:00
JiamingMai
45faddf668 fix the link addresses 2025-02-21 17:53:20 +08:00
Zhoneym
c92a300bd8
Necessary tips for Node.js related issues 2025-02-19 16:37:18 +08:00
Atream
c9eb1f76f6
Merge pull request #384 from ubergarm/add-tutorial-notes
Add notes to DeepSeek-R1 tutorial documentation
2025-02-19 13:46:49 +08:00
Zhoneym
c72291832d
Update install.md 2025-02-19 10:39:43 +08:00