Azure
227e81b0d3
update zh readme
2025-02-15 01:54:01 +00:00
Azure
ef89b1520b
* Reorganize documentation/README
...
* Consolidate the installation section, as it's currently too cluttered
* Move the Multi-GPU section to the top-level structure
* Add a **detailed** tutorial on registering extra GPU memory with Marlin
2025-02-14 19:58:26 +00:00
Azure
b0b90270d8
Merge pull request #306 from kvcache-ai/revert-305-main
...
Revert "[update] Reorganize documentation/README"
2025-02-15 03:44:42 +08:00
Azure
4f4ed36442
Revert "[update] Reorganize documentation/README"
2025-02-15 03:43:48 +08:00
Azure
19d4a50b1c
[update] Reorganize documentation/README
...
[update] Reorganize documentation/README
2025-02-15 03:41:43 +08:00
Azure
483182fc3a
fix typo and detail
2025-02-14 19:40:15 +00:00
Azure
823b25eec9
Reorganize documentation/README
2025-02-14 19:08:17 +00:00
Atream
cc8d627e32
Merge pull request #301 from kvcache-ai/fix-cuda-graph-bug
...
warm_up before capture
2025-02-14 23:54:43 +08:00
Atream
1946493f2d
warm_up before capture
2025-02-14 15:52:21 +00:00
Atream
cadd55078f
Merge pull request #295 from kvcache-ai/update-wechatgroup
...
Update wechatgroup
2025-02-14 19:52:05 +08:00
Atream
e153d78227
Add files via upload
2025-02-14 19:51:11 +08:00
Atream
96e6dff7ac
Delete WeChatGrouop.png
2025-02-14 19:49:14 +08:00
Atream
885a91e7db
Merge pull request #294 from kvcache-ai/feat-fast-MLA
...
Feat fast mla
2025-02-14 19:40:36 +08:00
Atream
1084d4e4b4
linux support triton MLA kernel
2025-02-14 11:38:55 +00:00
Azure
6738908699
Merge pull request #280 from Azure-Tang/main
...
[fix] Fix incorrect image content in the document
2025-02-14 17:12:14 +08:00
Azure
1b1f417267
Fix incorrect image content in the document
2025-02-14 09:04:22 +00:00
Azure
f4bb374eaf
Merge pull request #254 from Azure-Tang/main
...
[Update] Add V3/R1 8 gpu yaml example
2025-02-14 11:54:14 +08:00
Azure
95c81eaf01
Merge branch 'kvcache-ai:main' into main
2025-02-14 11:53:52 +08:00
Atream
0a9c59922a
Merge pull request #255 from kvcache-ai/update-wechatgroup
...
Add files via upload
2025-02-14 11:08:59 +08:00
Atream
ce7210321a
Add files via upload
2025-02-14 11:06:56 +08:00
Azure
b7653b9c4f
add V3/R1 8 gpu yaml example
2025-02-14 02:56:13 +00:00
Azure
e612b14739
Merge pull request #247 from liugddx/patch-1
...
[Doc]Fix dead link problem
2025-02-14 10:37:32 +08:00
Azure
ae5d9e11a9
Merge pull request #227 from hrz6976/main
...
Add a lock to server inference()
2025-02-14 10:35:11 +08:00
Guangdong Liu
e65be580ab
Fix dead link problem
2025-02-14 09:57:57 +08:00
Atream
bb35dc5b0d
init support for MLA using Attention kernel
2025-02-13 15:01:14 +00:00
ZiWei Yuan
a456e25a54
Merge pull request #200 from devin2255/main
...
add README_ZH.md
2025-02-13 22:22:25 +08:00
Hand Sonic
e490265242
feat: add GitHub Actions workflow for building Docker image
2025-02-13 22:09:49 +08:00
dhliu
d04b570fb5
edit README_ZH.md && add DeepseekR1_V3_tutorial_zh.md
2025-02-13 21:14:44 +08:00
Atream
aa21edd2fe
Merge pull request #230 from kvcache-ai/updata-wechatgroup-1
...
Updata wechatgroup 1
2025-02-13 19:33:51 +08:00
Atream
5fb9d65512
Add files via upload
2025-02-13 19:33:01 +08:00
Atream
ade346e09a
Delete WeChatGrouop.png
2025-02-13 19:31:46 +08:00
Atream
127965494c
Merge pull request #229 from kvcache-ai/updata-wechatgroup
...
Add files via upload
2025-02-13 19:31:13 +08:00
Atream
30e8e6a32a
Add files via upload
2025-02-13 19:30:39 +08:00
hrz6976
2c3dcd9774
Add a lock to server inference()
2025-02-13 10:05:22 +00:00
ZiWei Yuan
76b081879a
Merge pull request #224 from kvcache-ai/server_support
...
Server support
2025-02-13 17:28:08 +08:00
liam
8d5ebe49ab
📝 ⚡ fix some debug output and update doc
2025-02-13 17:25:12 +08:00
liam
ad2c52d72a
📝 update doc
2025-02-13 17:16:27 +08:00
Azure
8324e7fd9b
Merge pull request #220 from TensorBlock/main
...
Add optimization config for Deepseek V3/R1 with 4 GPUs
2025-02-13 16:41:39 +08:00
liam
c74453d8ca
📝 add doc support and fix bug in qwen2
2025-02-13 16:37:43 +08:00
MorphisZhang
aea4243712
Add optimization config for Deepseek V3/R1 with 4 GPUs
2025-02-13 16:32:28 +08:00
dhliu
318c88cbeb
add README_ZH.md
2025-02-13 12:43:06 +08:00
Atream
8bad019ef2
Merge pull request #180 from lusipad/patch-1
...
doc: fix clerical error
2025-02-13 10:25:30 +08:00
Atream
0905d2e270
Merge pull request #189 from Kattos/main
...
fix typo in README.md
2025-02-13 10:24:01 +08:00
ZiWei Yuan
9b5fd55a3c
Merge pull request #190 from kvcache-ai/KMSorSMS-patch-2
...
Update README.md
2025-02-13 10:18:08 +08:00
ZiWei Yuan
36ab3d7e6c
Update README.md
...
update png
2025-02-13 10:17:56 +08:00
cuichengyi
01655f7500
fix typo in README.md
2025-02-13 10:12:04 +08:00
Atream
a0c16db352
Merge pull request #183 from kvcache-ai/update-WeChatgroup
...
Update we chatgroup
2025-02-13 09:16:30 +08:00
Atream
78cc219274
Delete WeChatGrouop.jpg
2025-02-13 09:15:57 +08:00
Atream
ea76f7910a
Add files via upload
2025-02-13 09:15:30 +08:00
lusipad
8384badc69
doc: fix clerical error
2025-02-13 07:27:27 +08:00