Commit graph

877 commits

Author SHA1 Message Date
ZiWei Yuan
4f22d726a5
Merge pull request #847 from isxylands/fix-avx512-popcount
fix #829: support older AVX512-capable CPUs
2025-03-11 21:07:51 +08:00
Lander-Hatsune
d166fb9f6e cpuinfer: filter repeated backend instantiation 2025-03-10 22:03:04 +08:00
Yuhao Tsui
e5694f91c0
Merge branch 'kvcache-ai:main' into main 2025-03-10 09:10:28 +08:00
liu.shen
26bd889ff8 fix #829: 兼容Intel Cascade Lake架构的CPU 2025-03-09 19:26:12 +08:00
Atream
09c043d8a6
Merge pull request #842 from BITcyman/fix-openai_chat_completion
[fix] thread context bug
2025-03-07 22:56:19 +08:00
BITcyman
08a8b553d6 [fix] thread context bug 2025-03-07 14:52:16 +00:00
Atream
7544eadd41
Merge pull request #840 from kvcache-ai/Atream-patch-3
release 0.2.3.post1
2025-03-07 22:09:09 +08:00
Atream
f8c1821f1d
Update __init__.py 2025-03-07 22:08:48 +08:00
Atream
0bc43e02bf
Merge pull request #839 from kvcache-ai/fix-precision-flashinfer
fix flashinfer precision
2025-03-07 22:08:06 +08:00
Atream
d453c320f1 fix flashinfer precision 2025-03-07 14:07:00 +00:00
wang jiahao
96d75d53df
Merge pull request #835 from BITcyman/fix-openai_chat_completion
[fix] support openai chat completion api
2025-03-07 17:22:00 +08:00
BITcyman
299c4dca64 [update] support openai chat completion api 2025-03-07 08:51:09 +00:00
ZiWei Yuan
63b1c8525b
Merge pull request #820 from kvcache-ai/develop-0.2.3
Develop 0.2.3 ready to release
2025-03-06 14:46:09 +08:00
liam
407e1b9ab2 🚑 Merge branch 'develop-0.2.3' of https://github.com/kvcache-ai/ktransformers into develop-0.2.3 2025-03-06 13:52:01 +08:00
Azure
27475cf8a5 fix compile 2025-03-06 05:34:37 +00:00
liam
8eeb6dd432 update compile option for avx512vpopcntdq 2025-03-06 12:18:04 +08:00
Azure
dd390835ca Add compile condition 2025-03-06 03:25:39 +00:00
Yuhao Tsui
d050d8655f
Update completions.py 2025-03-06 11:16:33 +08:00
Azure
1bcfce8cad
Merge pull request #815 from kvcache-ai/develop-0.2.3
[fix] fix gcc compilation
2025-03-06 00:07:17 +08:00
Azure
8068018504 fix gcc compilation 2025-03-05 15:59:56 +00:00
Atream
6f142a51e6
Merge pull request #813 from chenmz00/main
fix: list models API
2025-03-05 22:00:38 +08:00
chenmz00
b2ba795cfd
fix: list models API
Fix the list models API to match the corresponding OpenAI API format.
2025-03-05 21:49:27 +08:00
ZiWei Yuan
2753a4a654
Merge pull request #810 from kvcache-ai/v0.2.3
V0.2.3
2025-03-05 20:32:53 +08:00
liam
9c343b4f71 🔖 release v0.2.3 2025-03-05 20:24:11 +08:00
ZiWei Yuan
6c35ca75b3
Merge pull request #809 from KMSorSMS/develop-0.2.3
 release v0.2.3
2025-03-05 20:23:03 +08:00
liam
848fe8ab97 release v0.2.3 2025-03-05 20:21:04 +08:00
Atream
f03faa5376
Merge pull request #808 from kvcache-ai/Atream-patch-2
Update install.md
2025-03-05 19:02:45 +08:00
Atream
f17fb9d8fe
Update install.md 2025-03-05 19:02:32 +08:00
Atream
44dafa034b
Merge pull request #781 from 3wweiweiwu/main
update documentation to fix error in numa enablement
2025-03-05 19:00:59 +08:00
Azure
034a116365 update readme 2025-03-05 10:04:43 +00:00
Azure
d7becadcf7 Merge branch 'develop-0.2.3' of https://github.com/kvcache-ai/ktransformers into develop-0.2.3 2025-03-05 09:26:23 +00:00
Azure
662c1e4c14 small fix about max new token 2025-03-05 09:25:41 +00:00
Atream
52fa3003d1
Merge pull request #791 from KMSorSMS/develop-0.2.3
 add humaneval support
2025-03-05 11:28:36 +08:00
Atream
3a3021502e
Merge pull request #795 from hybcloud/main
fix minor typo
2025-03-05 11:27:26 +08:00
hybcloud
6b551a6ee3
fix minor typo 2025-03-05 11:09:29 +08:00
3wweiweiwu
b936bcc72e
use 'make dev_install' to make minimum change 2025-03-04 09:35:12 -05:00
liam
dc10480ef6 add humaneval support 2025-03-04 20:54:49 +08:00
Wix Woo
bbcc5481bb update documentation to fix error in numa enablement 2025-03-03 15:49:00 +00:00
Azure
5aee6c0446 Merge branch 'main' into develop-0.2.3 2025-03-03 12:19:47 +00:00
Atream
f46b3fd51c
Merge pull request #778 from Conless/fix-shape
[Fix] Shape error in KLinearMarlin
2025-03-03 19:58:52 +08:00
Yi Pan
01755a60c0
fix: wrong shape in KLinearMarlin. 2025-03-03 17:34:45 +08:00
Atream
4e43e8a4ee
Merge pull request #775 from kvcache-ai/Atream-patch-1
Update __init__.py
2025-03-03 16:50:02 +08:00
Atream
8963ae7817
Update __init__.py 2025-03-03 16:49:50 +08:00
Atream
659583a92c
Merge pull request #770 from SkqLiao/main
Introduce Testing Jobs for kTransformers Setup on Self-Hosted Runner
2025-03-03 14:35:26 +08:00
Azure
216a63b858
Merge pull request #754 from moonshadow-25/dev023
Support for IQ1_S(Dynamic 1.58-bit)
2025-03-03 14:19:33 +08:00
Jiaqi Liao
7d96e2eeba fix environment name 2025-03-03 14:01:40 +08:00
Jiaqi Liao
3d62579a6a fix environment name 2025-03-03 13:55:40 +08:00
Jiaqi Liao
5fe0d138ca add install job for self-host testing 2025-03-03 13:52:13 +08:00
wang jiahao
48b9800790
Merge pull request #759 from 3wweiweiwu/fix_top_p_typo
fix typo for top_p
2025-03-02 13:58:11 +08:00
wang jiahao
bb54b68e5c
Merge pull request #761 from kvcache-ai/fix-server-bug
fix ollama api temperature bug
2025-03-02 13:56:46 +08:00