Atream
|
7faa776659
|
Merge pull request #1277 from Coekjan/patch-1
Fix typo about `GLIBCXX_3.4.32`
|
2025-05-15 01:58:00 -06:00 |
|
aubreyli
|
af9472b518
|
Merge pull request #1306 from aubreyli/xpu-doc
xpu.md: add device discovery tips
|
2025-05-15 14:18:47 +08:00 |
|
Aubrey Li
|
72f6d93ffd
|
xpu.md: add device discovery tips
|
2025-05-15 14:12:26 +08:00 |
|
wang jiahao
|
8caecf37d8
|
Merge pull request #1305 from kvcache-ai/update-readme
fix deduplicate_and_sort cudagraphs
|
2025-05-15 12:10:20 +08:00 |
|
qiyuxinlin
|
b40f13abeb
|
fix deduplicate_and_sort cudagraphs
|
2025-05-15 04:09:34 +00:00 |
|
aubreyli
|
09f0ddc00b
|
Merge pull request #1303 from rnwang04/fix_typo_and_style
fix typo and code style, and update setup.py ValueError message
|
2025-05-15 10:55:58 +08:00 |
|
rnwang04
|
2f6e14a54b
|
fix md typo, fix code style, and update setup value error message
|
2025-05-15 10:14:39 +00:00 |
|
Atream
|
07c5f23da5
|
Merge pull request #1304 from kvcache-ai/Atream-patch-1
Update README.md
|
2025-05-14 20:29:54 -06:00 |
|
Atream
|
d051a14941
|
Update README.md
|
2025-05-15 10:29:43 +08:00 |
|
wang jiahao
|
2d3aaef8b6
|
Merge pull request #1301 from kvcache-ai/update-readme
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
update readme
|
2025-05-14 21:15:55 +08:00 |
|
qiyuxinlin
|
d35d61f6a1
|
update readme
|
2025-05-14 13:15:18 +00:00 |
|
qiyuxinlin
|
c3d0ac80c6
|
update readme
|
2025-05-14 13:13:10 +00:00 |
|
wang jiahao
|
ee524b0f41
|
Merge pull request #1300 from kvcache-ai/qiyuxinlin-patch-1
Update install.sh
|
2025-05-14 21:09:20 +08:00 |
|
wang jiahao
|
9fe3f35c37
|
Update install.sh
|
2025-05-14 21:08:58 +08:00 |
|
aubreyli
|
f7ee993fdc
|
Merge pull request #1295 from rnwang04/xpu_support
Enable ktransformers on Intel GPU with local chat backend
|
2025-05-14 20:58:35 +08:00 |
|
rnwang04
|
142fb7ce6c
|
Enable support for Intel XPU devices, add support for DeepSeek V2/V3 first
|
2025-05-14 19:37:27 +00:00 |
|
wang jiahao
|
333351c7c8
|
Merge pull request #1298 from kvcache-ai/fix-workspace-buffer
Book-CI / test (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
update norm cpu kernel
|
2025-05-14 17:50:40 +08:00 |
|
qiyuxinlin
|
ecc01cda17
|
update norm cpu kernel
|
2025-05-14 09:49:35 +00:00 |
|
wang jiahao
|
8974cc9d75
|
Merge pull request #1297 from kvcache-ai/fix-workspace-buffer
update torch MLA kernel
|
2025-05-14 17:46:55 +08:00 |
|
qiyuxinlin
|
64742bec83
|
update torch MLA kernel
|
2025-05-14 09:45:12 +00:00 |
|
wang jiahao
|
4e015ccc65
|
Merge pull request #1296 from kvcache-ai/fix-workspace-buffer
fix flashinfer float_workspace_buffer small
|
2025-05-14 17:35:27 +08:00 |
|
qiyuxinlin
|
e8e83308a9
|
fix flashinfer float_workspace_buffer small
|
2025-05-14 09:33:52 +00:00 |
|
wang jiahao
|
02948bc1b8
|
Merge pull request #1289 from kvcache-ai/update-default-config
Book-CI / test (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
update default config
|
2025-05-13 20:23:25 +08:00 |
|
qiyuxinlin
|
697444905a
|
update default config
|
2025-05-13 12:20:21 +00:00 |
|
wang jiahao
|
8456222852
|
Merge pull request #1276 from kvcache-ai/support_load_safetensor
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
support safetensor load, delete architectures argument
|
2025-05-12 11:10:26 +08:00 |
|
Yip Coekjan
|
1edc6d9de0
|
Fix typo about GLIBCXX_3.4.32
|
2025-05-09 20:34:45 +08:00 |
|
qiyuxinlin
|
c6aa379de2
|
support safetensor load, delete architectures argument
|
2025-05-09 10:38:29 +00:00 |
|
Atream
|
30eab48a75
|
Merge pull request #799 from aubreyli/cpu_offloading
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
Restore CPU offloading capability
|
2025-05-09 00:38:54 -06:00 |
|
Atream
|
8025def197
|
Merge pull request #1246 from aubreyli/GenerationMixin
modeling_deepseek_v3: fix GenerationMixin warning
|
2025-05-09 00:35:15 -06:00 |
|
Atream
|
900a7f7c3e
|
Merge pull request #1271 from kvcache-ai/fix-AMX
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
fix AMX
|
2025-05-07 05:12:38 -06:00 |
|
Atream
|
b22cded890
|
fix AMX
|
2025-05-07 19:12:19 +08:00 |
|
Yaochen Han
|
3f14e311cb
|
Merge pull request #1247 from aubreyli/_get_logits_warper
ktransformers/utils: fix _get_logits_warper error
|
2025-05-07 15:22:35 +08:00 |
|
Aubrey Li
|
b3a1fcf471
|
ktransformers/utils: fix _get_logits_warper error
|
2025-05-01 08:13:09 +08:00 |
|
Aubrey Li
|
def1ec7683
|
modeling_deepseek_v3: fix GenerationMixin warning
Fix GenerationMixin warning introduced by upgrading transformers to 4.51.3.
|
2025-05-01 07:48:15 +08:00 |
|
Atream
|
7530491f5b
|
Merge pull request #1244 from kvcache-ai/update-custom-flashinfer
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
update-custom-flashinfer
|
2025-04-30 04:46:19 -06:00 |
|
Atream
|
753075728c
|
update-custom-flashinfer
|
2025-04-30 10:45:25 +00:00 |
|
Atream
|
a4bd6818ed
|
Merge pull request #1241 from kvcache-ai/fix-cache-lens
fix-cache-lens
|
2025-04-29 21:38:12 -06:00 |
|
Atream
|
7adb7281f4
|
fix-cache-lens
|
2025-04-30 03:37:43 +00:00 |
|
wang jiahao
|
8ba7e5d4b8
|
Merge pull request #1227 from kvcache-ai/change-yaml
change inject yaml
|
2025-04-29 16:10:37 +08:00 |
|
qiyuxinlin
|
48dfbc8f9f
|
change inject yaml
|
2025-04-29 08:09:39 +00:00 |
|
ZiWei Yuan
|
2a224b256e
|
Merge pull request #1225 from kvcache-ai/fix_typo_main
✨ update ignore
|
2025-04-29 13:26:02 +08:00 |
|
liam Yuan
|
0e8a36770a
|
✨ update ignore
|
2025-04-29 13:24:14 +08:00 |
|
ZiWei Yuan
|
c519747f3c
|
Merge pull request #1224 from kvcache-ai/fix_typo_main
✨ fix typo
|
2025-04-29 13:22:27 +08:00 |
|
liam Yuan
|
2762012039
|
✨ fix typo
|
2025-04-29 13:20:03 +08:00 |
|
Atream
|
ab26e7d7db
|
Merge pull request #1223 from kvcache-ai/fix-client
fix-client
|
2025-04-28 22:35:04 -06:00 |
|
Atream
|
0f7a3e5fea
|
fix-client
|
2025-04-29 12:34:20 +08:00 |
|
Atream
|
cc94a02ab5
|
Merge pull request #1222 from kvcache-ai/fix-compile
Fix compile
|
2025-04-28 22:13:09 -06:00 |
|
Atream
|
08035a7cda
|
Update requirements-local_chat.txt
|
2025-04-29 12:12:35 +08:00 |
|
Atream
|
fd9876049d
|
Update pyproject.toml
|
2025-04-29 12:11:11 +08:00 |
|
Atream
|
9d6e09efa6
|
Merge pull request #1221 from kvcache-ai/Atream-patch-5
Update AMX.md
|
2025-04-28 21:14:10 -06:00 |
|