Commit graph

23 commits

Author SHA1 Message Date
ouqingliang
3b4a1c7532 add prefix cache support for kvc2. 2025-06-26 04:57:25 +00:00
Atream
753075728c update-custom-flashinfer 2025-04-30 10:45:25 +00:00
qiyuxinlin
74bb7fdcf6 Merge remote-tracking branch 'dev/support-amx-2' 2025-04-28 18:46:51 +00:00
djw
3f9bbf1181 support qwen3, dont speak human language 2025-04-28 08:44:47 +00:00
PC-DOS
5379e68f19 Replaced Chinese comments with English to avoid breaking MSVC compiling 2025-04-26 03:20:23 +08:00
PC-DOS
5b4d9c41ac Replaced Chinese comments with English to avoid breaking MSVC compiling 2025-04-26 03:18:01 +08:00
qiyuxinlin
03a65d6bea roll back ktransformers backend, add max_tokens, max_completion_tokens param 2025-04-21 12:55:37 +00:00
Iwan Kawrakow
99a247e167 Spelling 2025-04-11 10:15:42 +03:00
Iwan Kawrakow
c46b0c59d0 Add missing references to ik_llama.cpp 2025-04-11 09:39:57 +03:00
Atream
80c5cbecdd add nlohmann 2025-04-01 10:38:45 +08:00
Atream
9360d1e3c8 add submodules 2025-03-31 23:20:29 +08:00
liu.shen
26bd889ff8 fix #829: 兼容Intel Cascade Lake架构的CPU 2025-03-09 19:26:12 +08:00
liam
8eeb6dd432 update compile option for avx512vpopcntdq 2025-03-06 12:18:04 +08:00
Azure
dd390835ca Add compile condition 2025-03-06 03:25:39 +00:00
Azure
8068018504 fix gcc compilation 2025-03-05 15:59:56 +00:00
moonshadow-25
d24d369332 iq1s files 2025-03-01 22:44:06 +08:00
moonshadow-25
c513ae59c3 iq1s files 2025-03-01 22:38:04 +08:00
moonshadow-25
9781d1e6f4 iq1s core 2025-03-01 21:48:25 +08:00
godrosev
93c5b75716 rem 2025-03-01 21:25:18 +08:00
godrosev
e6349eb240 iq1s 2025-03-01 21:00:11 +08:00
chenxl
1db4a67dca [feature] add github action for pre compile 2024-08-14 16:54:50 +00:00
chenxl
f5f79f5c0e [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
chenxl
18c42e67df Initial commit 2024-07-27 16:06:58 +08:00