ouqingliang
4d51831316
fix MPSC
2025-06-26 13:11:40 +00:00
ouqingliang
3b4a1c7532
add prefix cache support for kvc2.
2025-06-26 04:57:25 +00:00
chenht2022
66453981ff
Fix NaN bug
2025-05-21 03:39:49 +00:00
rnwang04
142fb7ce6c
Enable support for Intel XPU devices, add support for DeepSeek V2/V3 first
2025-05-14 19:37:27 +00:00
qiyuxinlin
c6aa379de2
support safetensor load, delete architectures argument
2025-05-09 10:38:29 +00:00
Atream
b22cded890
fix AMX
2025-05-07 19:12:19 +08:00
Atream
192746cf93
clean-up
2025-04-29 10:32:42 +08:00
qiyuxinlin
063c5489b3
fix can not compile amx
2025-04-28 21:52:14 +00:00
djw
33cbd47086
support qwen3
2025-04-28 18:15:35 +00:00
djw
68c2b2e6e6
support qwen3
2025-04-28 18:02:07 +00:00
djw
0da3792b27
support qwen3
2025-04-28 14:05:24 +00:00
chenht2022
f3d842a0ca
support AMX
2025-04-25 14:47:16 +00:00
onepick
c5edd3fdf0
Fix cmake config error
...
Signed-off-by: onepick <jiajuku12@163.com>
2025-04-18 15:43:03 +08:00
onepick
6a7624fe4a
Change the logic to build device since cuda is as default
...
Signed-off-by: onepick <jiajuku12@163.com>
2025-04-17 19:44:05 +08:00
onepick
97f1995696
Fix some build error for ROCM
...
1. Fix terrible logic in CMakeLists.txt
2. using the correct typedef for hip
Signed-off-by: onepick <jiajuku12@163.com>
2025-04-17 11:34:33 +08:00
Aubrey Li
63ca2fa84d
xxHash: fix link error due to non-position-independent code
...
Add PROPERTIES POSITION_INDEPENDENT_CODE option to fix the
following error:
/usr/bin/ld: ../../third_party/xxHash/libxxhash.a(xxhash.c.o):
relocation R_X86_64_32S against `.rodata' can not be used when
making a shared object; recompile with -fPIC
Trying to link a non-PIC static library libxxhash.a into a
.so shared library, which is not allowed. The object file
xxhash.c.o must be recompiled with explicit -fPIC support.
2025-04-10 21:50:23 +08:00
Aubrey Li
45d20fa87b
balance_serve: Add g++ to compiler list
...
In some OS distributions, g++ exists in the following form:
# ls -l /usr/bin/g++*
-rwxr-xr-x 4 root root 985784 Dec 9 12:51 /usr/bin/g++
So make sure to add g++ to the compiler list as well.
2025-04-09 11:25:35 +08:00
qiyuxinlin
64de784328
format kvc2, delete quant_configs, move model_configs to ~/.ktransformers
2025-04-08 10:06:07 +00:00
Atream
9dd24ecd72
fix compile, add abi check to setup.py
2025-04-08 06:18:30 +00:00
Atream
fa87c1aeea
Update CMakeLists.txt
2025-04-07 10:32:30 +08:00
Atream
ad2009425c
Update CMakeLists.txt
2025-04-07 10:32:12 +08:00
Atream
25cee5810e
add balance-serve, support concurrence
2025-03-31 22:55:32 +08:00
Atream
8d0292aa44
refactor folders
2025-03-31 22:45:37 +08:00