Atream
|
7ebf82a492
|
Update Qwen3 date
|
2025-04-29 09:43:13 +08:00 |
|
wang jiahao
|
f27e4850f1
|
Merge pull request #1212 from kvcache-ai/support-amx-qwen
update AMX readme
|
2025-04-29 07:09:53 +08:00 |
|
qiyuxinlin
|
e70db18b63
|
update AMX readme
|
2025-04-28 23:08:38 +00:00 |
|
qiyuxinlin
|
2e905c8bd4
|
update AMX readme
|
2025-04-28 23:03:32 +00:00 |
|
wang jiahao
|
d7811a4f32
|
Merge pull request #1211 from kvcache-ai/support-amx-qwen
Support amx qwen
|
2025-04-29 06:44:48 +08:00 |
|
qiyuxinlin
|
a3ba63665a
|
update readme
|
2025-04-28 22:38:41 +00:00 |
|
qiyuxinlin
|
89823ccb1f
|
update readme
|
2025-04-28 22:34:47 +00:00 |
|
qiyuxinlin
|
e7763a4b59
|
update readme
|
2025-04-28 22:32:35 +00:00 |
|
qiyuxinlin
|
d3ebdafd4b
|
update readme
|
2025-04-28 22:31:09 +00:00 |
|
qiyuxinlin
|
59b0631e33
|
update readme
|
2025-04-28 22:26:38 +00:00 |
|
wang jiahao
|
ffb1f7bf09
|
Merge pull request #1210 from kvcache-ai/support-amx-qwen
Support amx and qwen3
|
2025-04-29 06:18:45 +08:00 |
|
qiyuxinlin
|
8f76c37d86
|
fix readme
|
2025-04-28 22:17:22 +00:00 |
|
qiyuxinlin
|
cb5617b479
|
update readme
|
2025-04-28 22:14:23 +00:00 |
|
qiyuxinlin
|
063c5489b3
|
fix can not compile amx
|
2025-04-28 21:52:14 +00:00 |
|
qiyuxinlin
|
27990dc6fb
|
fix load bug
|
2025-04-28 21:08:13 +00:00 |
|
qiyuxinlin
|
74bb7fdcf6
|
Merge remote-tracking branch 'dev/support-amx-2'
|
2025-04-28 18:46:51 +00:00 |
|
qiyuxinlin
|
be4b27e841
|
update doc
|
2025-04-28 18:24:15 +00:00 |
|
djw
|
33cbd47086
|
support qwen3
|
2025-04-28 18:15:35 +00:00 |
|
djw
|
68c2b2e6e6
|
support qwen3
|
2025-04-28 18:02:07 +00:00 |
|
djw
|
0da3792b27
|
support qwen3
|
2025-04-28 14:05:24 +00:00 |
|
djw
|
3f9bbf1181
|
support qwen3, dont speak human language
|
2025-04-28 08:44:47 +00:00 |
|
Chengyu Qiu
|
ba92cf1a3b
|
Merge pull request #1204 from emmanuel-ferdman/main
Change install.md and Update reference to optimize rules directory
|
2025-04-28 15:10:14 +08:00 |
|
Chen Hongtao
|
27e3b2b98d
|
Merge pull request #1202 from PC-DOS/main
Replaced Chinese comments in iqk_mul_mat.inc with English to avoid breaking MSVC compiling
|
2025-04-27 14:36:37 +08:00 |
|
Emmanuel Ferdman
|
cb80cb31a6
|
Update reference to optimize rules directory
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
|
2025-04-26 01:43:18 -07:00 |
|
PC-DOS
|
5379e68f19
|
Replaced Chinese comments with English to avoid breaking MSVC compiling
|
2025-04-26 03:20:23 +08:00 |
|
PC-DOS
|
5b4d9c41ac
|
Replaced Chinese comments with English to avoid breaking MSVC compiling
|
2025-04-26 03:18:01 +08:00 |
|
chenht2022
|
f3d842a0ca
|
support AMX
|
2025-04-25 14:47:16 +00:00 |
|
ZiWei Yuan
|
a7b995365e
|
Merge pull request #1197 from jizhilong/jizhilong-patch-1
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
fix: make cpufeature a local import
|
2025-04-25 14:50:58 +08:00 |
|
liam
|
82920e7943
|
:spakles: update requirements for cpufeature
|
2025-04-25 06:49:56 +00:00 |
|
wang jiahao
|
b90362b5e6
|
Merge pull request #1198 from kvcache-ai/fix-max_new_tokens
Book-CI / test (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
fix load default max_new_tokens
|
2025-04-25 12:22:41 +08:00 |
|
qiyuxinlin
|
7af83f9efb
|
fix load default max_new_tokens
|
2025-04-25 04:20:12 +00:00 |
|
jzl
|
9a759e9fb8
|
fix: make cpufeature a local import
|
2025-04-25 11:42:38 +08:00 |
|
Atream
|
67042d11e3
|
Merge pull request #1193 from kvcache-ai/fix-chat-template-encoding
Book-CI / test (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
fix chat template encoding
|
2025-04-23 22:44:46 -06:00 |
|
Atream
|
46493789eb
|
fix chat template encoding
|
2025-04-24 12:44:16 +08:00 |
|
wang jiahao
|
449a83dff6
|
Merge pull request #1183 from kvcache-ai/check-para
Book-CI / test (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
add check-para
|
2025-04-23 16:27:18 +08:00 |
|
Alisehen
|
f7d939313b
|
Merge remote-tracking branch 'origin/main' into check-para
|
2025-04-23 02:40:14 +00:00 |
|
Alisehen
|
99540ad01f
|
add check parameters
|
2025-04-23 02:38:43 +00:00 |
|
wang jiahao
|
7e4813e8ad
|
Merge pull request #1184 from kvcache-ai/update_param
Book-CI / test (push) Failing after 3s
Deploy / deploy (ubuntu-latest) (push) Failing after 2s
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
change test
|
2025-04-22 20:55:11 +08:00 |
|
qiyuxinlin
|
3a044e6b14
|
change test
|
2025-04-22 12:50:39 +00:00 |
|
Alisehen
|
c995bdbbfa
|
add check-para
|
2025-04-22 09:30:08 +00:00 |
|
wang jiahao
|
739358789e
|
Merge pull request #1182 from kvcache-ai/fix-kill-balance_serve
Book-CI / test (push) Failing after 5s
Deploy / deploy (ubuntu-latest) (push) Failing after 3s
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
kill serve lead to kill sched and engine
|
2025-04-22 17:28:06 +08:00 |
|
qiyuxinlin
|
4f9950e30c
|
kill serve lead to kill sched and engine
|
2025-04-22 09:25:44 +00:00 |
|
wang jiahao
|
4c41f3a35f
|
Merge pull request #1180 from kvcache-ai/update_param
update speed test
|
2025-04-22 15:39:57 +08:00 |
|
qiyuxinlin
|
b17ab8653c
|
update speed test
|
2025-04-22 07:38:05 +00:00 |
|
wang jiahao
|
485588017b
|
Merge pull request #1177 from kvcache-ai/update_param
Book-CI / test (push) Failing after 4s
Deploy / deploy (ubuntu-latest) (push) Failing after 2s
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
Update param
|
2025-04-22 10:14:36 +08:00 |
|
qiyuxinlin
|
f5287e908a
|
fix no balance_serve import error
|
2025-04-22 02:11:18 +00:00 |
|
qiyuxinlin
|
03a65d6bea
|
roll back ktransformers backend, add max_tokens, max_completion_tokens param
|
2025-04-21 12:55:37 +00:00 |
|
wang jiahao
|
a1162eea01
|
Merge pull request #1158 from Creeper-MZ/function_call
Book-CI / test (push) Failing after 9s
Deploy / deploy (ubuntu-latest) (push) Failing after 2s
Deploy / deploy (windows-latest) (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Update Function call
|
2025-04-19 16:31:37 +08:00 |
|
Creeper-MZ
|
133ba746e9
|
优化提示词,解决部分Deepseek r1的兼容性
优化提示词,解决部分Deepseek r1的兼容性
fix non stream
|
2025-04-19 01:20:27 -04:00 |
|
Atream
|
34c199403b
|
Merge pull request #1170 from onepick/fix-cmake-error
Deploy / deploy (ubuntu-latest) (push) Failing after 3s
Deploy / deploy (windows-latest) (push) Has been cancelled
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Fix cmake config error
|
2025-04-18 07:51:03 -06:00 |
|