Commit graph

322 commits

Author SHA1 Message Date
Azure
227e81b0d3 update zh readme 2025-02-15 01:54:01 +00:00
Azure
ef89b1520b * Reorganize documentation/README
* Consolidate the installation section, as it's currently too cluttered
    * Move the Multi-GPU section to the top-level structure
    * Add a **detailed** tutorial on registering extra GPU memory with Marlin
2025-02-14 19:58:26 +00:00
Azure
b0b90270d8
Merge pull request #306 from kvcache-ai/revert-305-main
Revert "[update] Reorganize documentation/README"
2025-02-15 03:44:42 +08:00
Azure
4f4ed36442
Revert "[update] Reorganize documentation/README" 2025-02-15 03:43:48 +08:00
Azure
19d4a50b1c
[update] Reorganize documentation/README
[update] Reorganize documentation/README
2025-02-15 03:41:43 +08:00
Azure
483182fc3a fix typo and detail 2025-02-14 19:40:15 +00:00
Azure
823b25eec9 Reorganize documentation/README 2025-02-14 19:08:17 +00:00
Atream
cc8d627e32
Merge pull request #301 from kvcache-ai/fix-cuda-graph-bug
warm_up before capture
2025-02-14 23:54:43 +08:00
Atream
1946493f2d warm_up before capture 2025-02-14 15:52:21 +00:00
Atream
cadd55078f
Merge pull request #295 from kvcache-ai/update-wechatgroup
Update wechatgroup
2025-02-14 19:52:05 +08:00
Atream
e153d78227
Add files via upload 2025-02-14 19:51:11 +08:00
Atream
96e6dff7ac
Delete WeChatGrouop.png 2025-02-14 19:49:14 +08:00
Atream
885a91e7db
Merge pull request #294 from kvcache-ai/feat-fast-MLA
Feat fast mla
2025-02-14 19:40:36 +08:00
Atream
1084d4e4b4 linux support triton MLA kernel 2025-02-14 11:38:55 +00:00
Azure
6738908699
Merge pull request #280 from Azure-Tang/main
[fix] Fix incorrect image content in the document
2025-02-14 17:12:14 +08:00
Azure
1b1f417267 Fix incorrect image content in the document 2025-02-14 09:04:22 +00:00
Azure
f4bb374eaf
Merge pull request #254 from Azure-Tang/main
[Update] Add V3/R1 8 gpu yaml example
2025-02-14 11:54:14 +08:00
Azure
95c81eaf01
Merge branch 'kvcache-ai:main' into main 2025-02-14 11:53:52 +08:00
Atream
0a9c59922a
Merge pull request #255 from kvcache-ai/update-wechatgroup
Add files via upload
2025-02-14 11:08:59 +08:00
Atream
ce7210321a
Add files via upload 2025-02-14 11:06:56 +08:00
Azure
b7653b9c4f add V3/R1 8 gpu yaml example 2025-02-14 02:56:13 +00:00
Azure
e612b14739
Merge pull request #247 from liugddx/patch-1
[Doc]Fix dead link problem
2025-02-14 10:37:32 +08:00
Azure
ae5d9e11a9
Merge pull request #227 from hrz6976/main
Add a lock to server inference()
2025-02-14 10:35:11 +08:00
Guangdong Liu
e65be580ab
Fix dead link problem 2025-02-14 09:57:57 +08:00
Atream
bb35dc5b0d init support for MLA using Attention kernel 2025-02-13 15:01:14 +00:00
ZiWei Yuan
a456e25a54
Merge pull request #200 from devin2255/main
add README_ZH.md
2025-02-13 22:22:25 +08:00
Hand Sonic
e490265242
feat: add GitHub Actions workflow for building Docker image 2025-02-13 22:09:49 +08:00
dhliu
d04b570fb5 edit README_ZH.md && add DeepseekR1_V3_tutorial_zh.md 2025-02-13 21:14:44 +08:00
Atream
aa21edd2fe
Merge pull request #230 from kvcache-ai/updata-wechatgroup-1
Updata wechatgroup 1
2025-02-13 19:33:51 +08:00
Atream
5fb9d65512
Add files via upload 2025-02-13 19:33:01 +08:00
Atream
ade346e09a
Delete WeChatGrouop.png 2025-02-13 19:31:46 +08:00
Atream
127965494c
Merge pull request #229 from kvcache-ai/updata-wechatgroup
Add files via upload
2025-02-13 19:31:13 +08:00
Atream
30e8e6a32a
Add files via upload 2025-02-13 19:30:39 +08:00
hrz6976
2c3dcd9774 Add a lock to server inference() 2025-02-13 10:05:22 +00:00
ZiWei Yuan
76b081879a
Merge pull request #224 from kvcache-ai/server_support
Server support
2025-02-13 17:28:08 +08:00
liam
8d5ebe49ab 📝 fix some debug output and update doc 2025-02-13 17:25:12 +08:00
liam
ad2c52d72a 📝 update doc 2025-02-13 17:16:27 +08:00
Azure
8324e7fd9b
Merge pull request #220 from TensorBlock/main
Add optimization config for Deepseek V3/R1 with 4 GPUs
2025-02-13 16:41:39 +08:00
liam
c74453d8ca 📝 add doc support and fix bug in qwen2 2025-02-13 16:37:43 +08:00
MorphisZhang
aea4243712 Add optimization config for Deepseek V3/R1 with 4 GPUs 2025-02-13 16:32:28 +08:00
dhliu
318c88cbeb add README_ZH.md 2025-02-13 12:43:06 +08:00
Atream
8bad019ef2
Merge pull request #180 from lusipad/patch-1
doc: fix clerical error
2025-02-13 10:25:30 +08:00
Atream
0905d2e270
Merge pull request #189 from Kattos/main
fix typo in README.md
2025-02-13 10:24:01 +08:00
ZiWei Yuan
9b5fd55a3c
Merge pull request #190 from kvcache-ai/KMSorSMS-patch-2
Update README.md
2025-02-13 10:18:08 +08:00
ZiWei Yuan
36ab3d7e6c
Update README.md
update png
2025-02-13 10:17:56 +08:00
cuichengyi
01655f7500 fix typo in README.md 2025-02-13 10:12:04 +08:00
Atream
a0c16db352
Merge pull request #183 from kvcache-ai/update-WeChatgroup
Update we chatgroup
2025-02-13 09:16:30 +08:00
Atream
78cc219274
Delete WeChatGrouop.jpg 2025-02-13 09:15:57 +08:00
Atream
ea76f7910a
Add files via upload 2025-02-13 09:15:30 +08:00
lusipad
8384badc69
doc: fix clerical error 2025-02-13 07:27:27 +08:00