ouqingliang
|
3b4a1c7532
|
add prefix cache support for kvc2.
|
2025-06-26 04:57:25 +00:00 |
|
Atream
|
753075728c
|
update-custom-flashinfer
|
2025-04-30 10:45:25 +00:00 |
|
qiyuxinlin
|
74bb7fdcf6
|
Merge remote-tracking branch 'dev/support-amx-2'
|
2025-04-28 18:46:51 +00:00 |
|
djw
|
3f9bbf1181
|
support qwen3, dont speak human language
|
2025-04-28 08:44:47 +00:00 |
|
PC-DOS
|
5379e68f19
|
Replaced Chinese comments with English to avoid breaking MSVC compiling
|
2025-04-26 03:20:23 +08:00 |
|
PC-DOS
|
5b4d9c41ac
|
Replaced Chinese comments with English to avoid breaking MSVC compiling
|
2025-04-26 03:18:01 +08:00 |
|
qiyuxinlin
|
03a65d6bea
|
roll back ktransformers backend, add max_tokens, max_completion_tokens param
|
2025-04-21 12:55:37 +00:00 |
|
Iwan Kawrakow
|
99a247e167
|
Spelling
|
2025-04-11 10:15:42 +03:00 |
|
Iwan Kawrakow
|
c46b0c59d0
|
Add missing references to ik_llama.cpp
|
2025-04-11 09:39:57 +03:00 |
|
Atream
|
80c5cbecdd
|
add nlohmann
|
2025-04-01 10:38:45 +08:00 |
|
Atream
|
9360d1e3c8
|
add submodules
|
2025-03-31 23:20:29 +08:00 |
|
liu.shen
|
26bd889ff8
|
fix #829: 兼容Intel Cascade Lake架构的CPU
|
2025-03-09 19:26:12 +08:00 |
|
liam
|
8eeb6dd432
|
⚡ update compile option for avx512vpopcntdq
|
2025-03-06 12:18:04 +08:00 |
|
Azure
|
dd390835ca
|
Add compile condition
|
2025-03-06 03:25:39 +00:00 |
|
Azure
|
8068018504
|
fix gcc compilation
|
2025-03-05 15:59:56 +00:00 |
|
moonshadow-25
|
d24d369332
|
iq1s files
|
2025-03-01 22:44:06 +08:00 |
|
moonshadow-25
|
c513ae59c3
|
iq1s files
|
2025-03-01 22:38:04 +08:00 |
|
moonshadow-25
|
9781d1e6f4
|
iq1s core
|
2025-03-01 21:48:25 +08:00 |
|
godrosev
|
93c5b75716
|
rem
|
2025-03-01 21:25:18 +08:00 |
|
godrosev
|
e6349eb240
|
iq1s
|
2025-03-01 21:00:11 +08:00 |
|
chenxl
|
1db4a67dca
|
[feature] add github action for pre compile
|
2024-08-14 16:54:50 +00:00 |
|
chenxl
|
f5f79f5c0e
|
[ADD] support multi-gpu qlen>1 q5_k
|
2024-08-12 11:41:26 +00:00 |
|
chenxl
|
18c42e67df
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
|