chenxl
b5024f62a4
Update Kimi-K2 Readme
2025-07-12 12:51:00 +08:00
Atream
34d2829f24
Update Kimi-K2.md
2025-07-12 12:44:41 +08:00
Atream
90245d8a6b
Update Kimi-K2.md
2025-07-12 11:57:51 +08:00
Atream
378e4fc035
Update Kimi-K2.md
2025-07-12 11:47:42 +08:00
Atream
b4ed8b6ded
Update Kimi-K2.md
2025-07-11 23:26:18 +08:00
UnicornChan
7800a413a2
Update Kimi-K2.md
2025-07-11 19:31:58 +08:00
Atream
b4ac21454b
Create Kimi-K2.md
2025-07-11 09:31:47 +08:00
ouqingliang
90cff820cf
update kvc disk path config.
2025-06-30 15:09:35 +00:00
ErvinXie
5a73aaf652
Update prefix_cache.md
2025-06-30 15:04:37 +08:00
ouqingliang
cc822df65d
add prefix cache documentation
2025-06-28 07:13:33 +00:00
Shaojun Liu
404ad39a04
docs: add Dockerfile.xpu and GPU driver setup instructions
...
- Add Dockerfile.xpu for oneAPI-based container
- Create Docker_xpu.md with usage instructions
- Update xpu.md to include Docker guide
2025-05-28 13:55:35 +08:00
rnwang04
adc0906967
add XPU support for qwen3moe local chat
2025-05-22 21:01:41 +08:00
wang jiahao
32f3d7befb
Merge pull request #1307 from kvcache-ai/hyc
...
add xpu parameters to install.sh
2025-05-17 15:25:33 +08:00
rnwang04
a56aa45186
fix ipex-llm version to 2.3.0rc1
2025-05-16 12:22:08 +08:00
Shaoyuan CHEN
5d194c5db0
Fix typos
2025-05-15 22:15:55 +08:00
Atream
7faa776659
Merge pull request #1277 from Coekjan/patch-1
...
Fix typo about `GLIBCXX_3.4.32`
2025-05-15 01:58:00 -06:00
Alisehen
055680e26c
add flashinfer to cuda device
2025-05-15 07:03:45 +00:00
Alisehen
f3be33a313
add xpu parameters to install.sh
2025-05-15 06:39:02 +00:00
Aubrey Li
72f6d93ffd
xpu.md: add device discovery tips
2025-05-15 14:12:26 +08:00
rnwang04
2f6e14a54b
fix md typo, fix code style, and update setup value error message
2025-05-15 10:14:39 +00:00
qiyuxinlin
d35d61f6a1
update readme
2025-05-14 13:15:18 +00:00
qiyuxinlin
c3d0ac80c6
update readme
2025-05-14 13:13:10 +00:00
rnwang04
142fb7ce6c
Enable support for Intel XPU devices, add support for DeepSeek V2/V3 first
2025-05-14 19:37:27 +00:00
Yip Coekjan
1edc6d9de0
Fix typo about GLIBCXX_3.4.32
2025-05-09 20:34:45 +08:00
Atream
28948aacc9
Update AMX.md
2025-04-29 11:12:51 +08:00
Atream
14efb15593
Update AMX.md
2025-04-29 11:03:59 +08:00
Atream
073ce601e0
Update AMX.md
2025-04-29 10:29:51 +08:00
qiyuxinlin
e70db18b63
update AMX readme
2025-04-28 23:08:38 +00:00
qiyuxinlin
2e905c8bd4
update AMX readme
2025-04-28 23:03:32 +00:00
qiyuxinlin
a3ba63665a
update readme
2025-04-28 22:38:41 +00:00
qiyuxinlin
89823ccb1f
update readme
2025-04-28 22:34:47 +00:00
qiyuxinlin
e7763a4b59
update readme
2025-04-28 22:32:35 +00:00
qiyuxinlin
d3ebdafd4b
update readme
2025-04-28 22:31:09 +00:00
qiyuxinlin
59b0631e33
update readme
2025-04-28 22:26:38 +00:00
qiyuxinlin
cb5617b479
update readme
2025-04-28 22:14:23 +00:00
qiyuxinlin
74bb7fdcf6
Merge remote-tracking branch 'dev/support-amx-2'
2025-04-28 18:46:51 +00:00
qiyuxinlin
be4b27e841
update doc
2025-04-28 18:24:15 +00:00
Emmanuel Ferdman
cb80cb31a6
Update reference to optimize rules directory
...
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-04-26 01:43:18 -07:00
djw
dfaf2b20fb
update llama4 tutorial
2025-04-18 03:42:48 +00:00
Atream
98dbdcd66c
Update llama4.md
2025-04-09 19:23:35 +08:00
Atream
346d202297
Update llama4.md
2025-04-09 19:09:44 +08:00
Atream
d1fcb208cc
Update llama4.md
2025-04-09 18:01:13 +08:00
Atream
ed2b971e02
Update llama4.md
2025-04-09 17:57:37 +08:00
djw
ecc3028c13
update llama4 tutorial
2025-04-09 09:34:04 +00:00
wang jiahao
23ceb1c049
Update balance-serve.md
2025-04-08 19:19:00 +08:00
wang jiahao
2fcdbee769
Update balance-serve.md
2025-04-07 19:19:49 +08:00
wang jiahao
6cbe044aae
Update balance-serve.md
2025-04-05 11:49:05 +08:00
255
578d3d9d09
📝 Docs: Clarify CMake version requirement for CUDA dialects
...
Adds a note explaining that default CMake versions on systems like
Ubuntu 22.04 LTS might not support newer CUDA dialects (e.g., CUDA 20),
leading to specific build errors.
Recommends installing a newer CMake via the Kitware APT repository
as a resolution. This helps users troubleshoot errors like:
"Target ... requires the language dialect 'CUDA20', but CMake does not know the compile flags..."
2025-04-04 20:11:59 +08:00
wangkuigang-yewu-cmss
c590583262
doc upgrade: model_path requirements and reasoning
...
* add documentations about `--model_path` requirements
* add `--force_think` in doc (most users would run R1 and would want it to provide reasoning process)
2025-04-03 15:16:56 +08:00
dongjw
1b7672937b
update install doc and fix local_chat bug
2025-04-03 12:42:41 +08:00