ZiWei Yuan
|
4f22d726a5
|
Merge pull request #847 from isxylands/fix-avx512-popcount
fix #829: support older AVX512-capable CPUs
|
2025-03-11 21:07:51 +08:00 |
|
Lander-Hatsune
|
d166fb9f6e
|
cpuinfer: filter repeated backend instantiation
|
2025-03-10 22:03:04 +08:00 |
|
Yuhao Tsui
|
e5694f91c0
|
Merge branch 'kvcache-ai:main' into main
|
2025-03-10 09:10:28 +08:00 |
|
liu.shen
|
26bd889ff8
|
fix #829: 兼容Intel Cascade Lake架构的CPU
|
2025-03-09 19:26:12 +08:00 |
|
Atream
|
09c043d8a6
|
Merge pull request #842 from BITcyman/fix-openai_chat_completion
[fix] thread context bug
|
2025-03-07 22:56:19 +08:00 |
|
BITcyman
|
08a8b553d6
|
[fix] thread context bug
|
2025-03-07 14:52:16 +00:00 |
|
Atream
|
7544eadd41
|
Merge pull request #840 from kvcache-ai/Atream-patch-3
release 0.2.3.post1
|
2025-03-07 22:09:09 +08:00 |
|
Atream
|
f8c1821f1d
|
Update __init__.py
|
2025-03-07 22:08:48 +08:00 |
|
Atream
|
0bc43e02bf
|
Merge pull request #839 from kvcache-ai/fix-precision-flashinfer
fix flashinfer precision
|
2025-03-07 22:08:06 +08:00 |
|
Atream
|
d453c320f1
|
fix flashinfer precision
|
2025-03-07 14:07:00 +00:00 |
|
wang jiahao
|
96d75d53df
|
Merge pull request #835 from BITcyman/fix-openai_chat_completion
[fix] support openai chat completion api
|
2025-03-07 17:22:00 +08:00 |
|
BITcyman
|
299c4dca64
|
[update] support openai chat completion api
|
2025-03-07 08:51:09 +00:00 |
|
ZiWei Yuan
|
63b1c8525b
|
Merge pull request #820 from kvcache-ai/develop-0.2.3
Develop 0.2.3 ready to release
|
2025-03-06 14:46:09 +08:00 |
|
liam
|
407e1b9ab2
|
🚑 Merge branch 'develop-0.2.3' of https://github.com/kvcache-ai/ktransformers into develop-0.2.3
|
2025-03-06 13:52:01 +08:00 |
|
Azure
|
27475cf8a5
|
fix compile
|
2025-03-06 05:34:37 +00:00 |
|
liam
|
8eeb6dd432
|
⚡ update compile option for avx512vpopcntdq
|
2025-03-06 12:18:04 +08:00 |
|
Azure
|
dd390835ca
|
Add compile condition
|
2025-03-06 03:25:39 +00:00 |
|
Yuhao Tsui
|
d050d8655f
|
Update completions.py
|
2025-03-06 11:16:33 +08:00 |
|
Azure
|
1bcfce8cad
|
Merge pull request #815 from kvcache-ai/develop-0.2.3
[fix] fix gcc compilation
|
2025-03-06 00:07:17 +08:00 |
|
Azure
|
8068018504
|
fix gcc compilation
|
2025-03-05 15:59:56 +00:00 |
|
Atream
|
6f142a51e6
|
Merge pull request #813 from chenmz00/main
fix: list models API
|
2025-03-05 22:00:38 +08:00 |
|
chenmz00
|
b2ba795cfd
|
fix: list models API
Fix the list models API to match the corresponding OpenAI API format.
|
2025-03-05 21:49:27 +08:00 |
|
ZiWei Yuan
|
2753a4a654
|
Merge pull request #810 from kvcache-ai/v0.2.3
V0.2.3
|
2025-03-05 20:32:53 +08:00 |
|
liam
|
9c343b4f71
|
🔖 release v0.2.3
|
2025-03-05 20:24:11 +08:00 |
|
ZiWei Yuan
|
6c35ca75b3
|
Merge pull request #809 from KMSorSMS/develop-0.2.3
⚡ release v0.2.3
|
2025-03-05 20:23:03 +08:00 |
|
liam
|
848fe8ab97
|
⚡ release v0.2.3
|
2025-03-05 20:21:04 +08:00 |
|
Atream
|
f03faa5376
|
Merge pull request #808 from kvcache-ai/Atream-patch-2
Update install.md
|
2025-03-05 19:02:45 +08:00 |
|
Atream
|
f17fb9d8fe
|
Update install.md
|
2025-03-05 19:02:32 +08:00 |
|
Atream
|
44dafa034b
|
Merge pull request #781 from 3wweiweiwu/main
update documentation to fix error in numa enablement
|
2025-03-05 19:00:59 +08:00 |
|
Azure
|
034a116365
|
update readme
|
2025-03-05 10:04:43 +00:00 |
|
Azure
|
d7becadcf7
|
Merge branch 'develop-0.2.3' of https://github.com/kvcache-ai/ktransformers into develop-0.2.3
|
2025-03-05 09:26:23 +00:00 |
|
Azure
|
662c1e4c14
|
small fix about max new token
|
2025-03-05 09:25:41 +00:00 |
|
Atream
|
52fa3003d1
|
Merge pull request #791 from KMSorSMS/develop-0.2.3
⚡ add humaneval support
|
2025-03-05 11:28:36 +08:00 |
|
Atream
|
3a3021502e
|
Merge pull request #795 from hybcloud/main
fix minor typo
|
2025-03-05 11:27:26 +08:00 |
|
hybcloud
|
6b551a6ee3
|
fix minor typo
|
2025-03-05 11:09:29 +08:00 |
|
3wweiweiwu
|
b936bcc72e
|
use 'make dev_install' to make minimum change
|
2025-03-04 09:35:12 -05:00 |
|
liam
|
dc10480ef6
|
⚡ add humaneval support
|
2025-03-04 20:54:49 +08:00 |
|
Wix Woo
|
bbcc5481bb
|
update documentation to fix error in numa enablement
|
2025-03-03 15:49:00 +00:00 |
|
Azure
|
5aee6c0446
|
Merge branch 'main' into develop-0.2.3
|
2025-03-03 12:19:47 +00:00 |
|
Atream
|
f46b3fd51c
|
Merge pull request #778 from Conless/fix-shape
[Fix] Shape error in KLinearMarlin
|
2025-03-03 19:58:52 +08:00 |
|
Yi Pan
|
01755a60c0
|
fix: wrong shape in KLinearMarlin.
|
2025-03-03 17:34:45 +08:00 |
|
Atream
|
4e43e8a4ee
|
Merge pull request #775 from kvcache-ai/Atream-patch-1
Update __init__.py
|
2025-03-03 16:50:02 +08:00 |
|
Atream
|
8963ae7817
|
Update __init__.py
|
2025-03-03 16:49:50 +08:00 |
|
Atream
|
659583a92c
|
Merge pull request #770 from SkqLiao/main
Introduce Testing Jobs for kTransformers Setup on Self-Hosted Runner
|
2025-03-03 14:35:26 +08:00 |
|
Azure
|
216a63b858
|
Merge pull request #754 from moonshadow-25/dev023
Support for IQ1_S(Dynamic 1.58-bit)
|
2025-03-03 14:19:33 +08:00 |
|
Jiaqi Liao
|
7d96e2eeba
|
fix environment name
|
2025-03-03 14:01:40 +08:00 |
|
Jiaqi Liao
|
3d62579a6a
|
fix environment name
|
2025-03-03 13:55:40 +08:00 |
|
Jiaqi Liao
|
5fe0d138ca
|
add install job for self-host testing
|
2025-03-03 13:52:13 +08:00 |
|
wang jiahao
|
48b9800790
|
Merge pull request #759 from 3wweiweiwu/fix_top_p_typo
fix typo for top_p
|
2025-03-02 13:58:11 +08:00 |
|
wang jiahao
|
bb54b68e5c
|
Merge pull request #761 from kvcache-ai/fix-server-bug
fix ollama api temperature bug
|
2025-03-02 13:56:46 +08:00 |
|