Commit graph

439 commits

Author SHA1 Message Date
liu.shen
26bd889ff8 fix #829: 兼容Intel Cascade Lake架构的CPU 2025-03-09 19:26:12 +08:00
liam
407e1b9ab2 🚑 Merge branch 'develop-0.2.3' of https://github.com/kvcache-ai/ktransformers into develop-0.2.3 2025-03-06 13:52:01 +08:00
Azure
27475cf8a5 fix compile 2025-03-06 05:34:37 +00:00
liam
8eeb6dd432 update compile option for avx512vpopcntdq 2025-03-06 12:18:04 +08:00
Azure
dd390835ca Add compile condition 2025-03-06 03:25:39 +00:00
Azure
8068018504 fix gcc compilation 2025-03-05 15:59:56 +00:00
ZiWei Yuan
6c35ca75b3
Merge pull request #809 from KMSorSMS/develop-0.2.3
 release v0.2.3
2025-03-05 20:23:03 +08:00
liam
848fe8ab97 release v0.2.3 2025-03-05 20:21:04 +08:00
Azure
034a116365 update readme 2025-03-05 10:04:43 +00:00
Azure
d7becadcf7 Merge branch 'develop-0.2.3' of https://github.com/kvcache-ai/ktransformers into develop-0.2.3 2025-03-05 09:26:23 +00:00
Azure
662c1e4c14 small fix about max new token 2025-03-05 09:25:41 +00:00
Atream
52fa3003d1
Merge pull request #791 from KMSorSMS/develop-0.2.3
 add humaneval support
2025-03-05 11:28:36 +08:00
liam
dc10480ef6 add humaneval support 2025-03-04 20:54:49 +08:00
Azure
5aee6c0446 Merge branch 'main' into develop-0.2.3 2025-03-03 12:19:47 +00:00
Azure
216a63b858
Merge pull request #754 from moonshadow-25/dev023
Support for IQ1_S(Dynamic 1.58-bit)
2025-03-03 14:19:33 +08:00
wang jiahao
48b9800790
Merge pull request #759 from 3wweiweiwu/fix_top_p_typo
fix typo for top_p
2025-03-02 13:58:11 +08:00
wang jiahao
bb54b68e5c
Merge pull request #761 from kvcache-ai/fix-server-bug
fix ollama api temperature bug
2025-03-02 13:56:46 +08:00
1668068727@qq.com
7cdf8139f0 fix ollama api temperature bug 2025-03-02 13:55:26 +08:00
Wix Woo
3aa0cfc29d fix typo for top_p 2025-03-01 20:15:36 +00:00
Atream
69382e58f9
Merge pull request #313 from MuWinds/main
Update:Solve  `torch.backends.cuda.sdp_kernel()` is deprecated.
2025-03-01 23:24:44 +08:00
Atream
ca1dc1e7d1
Merge branch 'main' into main 2025-03-01 23:24:10 +08:00
Atream
505f4e2c35
Merge pull request #753 from ningpengtao-coder/main
Update local_chat.py
2025-03-01 23:22:13 +08:00
moonshadow-25
d24d369332 iq1s files 2025-03-01 22:44:06 +08:00
moonshadow-25
c513ae59c3 iq1s files 2025-03-01 22:38:04 +08:00
宁鹏涛
71286ec1c0
Update local_chat.py
修复config.architectures[0] == "DeepseekV2ForCausalLM" or "DeepseekV3ForCausalLM" 永远为真
2025-03-01 21:52:48 +08:00
moonshadow-25
9781d1e6f4 iq1s core 2025-03-01 21:48:25 +08:00
godrosev
93c5b75716 rem 2025-03-01 21:25:18 +08:00
godrosev
e6349eb240 iq1s 2025-03-01 21:00:11 +08:00
Atream
761de49843
Merge pull request #751 from kvcache-ai/Atream-patch-2
Update DeepseekR1_V3_tutorial.md
2025-03-01 19:57:00 +08:00
Atream
735873a32a
Update DeepseekR1_V3_tutorial.md 2025-03-01 19:56:46 +08:00
Atream
bd33a59ecf
Merge pull request #750 from kvcache-ai/feat-chunk-prefill-flashinfer
Support chunk prefill. Support 139K context for DeepSeek-R1 139K with in 24G VRAM.
2025-03-01 19:50:52 +08:00
Atream
fa03ea48dd Merge branch 'main' into feat-chunk-prefill-flashinfer 2025-03-01 11:35:09 +00:00
Atream
f35e8d41d8 support chunk prefill, support 139K context for 24G VRAM 2025-03-01 11:28:25 +00:00
ZiWei Yuan
511958d49c
Merge pull request #743 from KMSorSMS/main
fix cache_lens bug in server and rm test prompt.txt
2025-03-01 00:17:53 +08:00
liam
80e0536fb0 Merge branch 'main' of https://github.com/KMSorSMS/ktransformers into main 2025-03-01 00:12:21 +08:00
liam
8ddc990668 fix server cache lens 2025-03-01 00:09:57 +08:00
Atream
494469d4c5
Merge pull request #722 from ZhangShuaiyi/remove_unused
Delete duplicate code
2025-02-28 15:04:21 +08:00
liam
71f4599dee 📝 rm test_prompt 2025-02-28 11:44:49 +08:00
ZiWei Yuan
1264f9407b
Merge pull request #732 from KMSorSMS/main
 fox docker build
2025-02-28 11:28:06 +08:00
liam
a0e7afa432 fox docker build 2025-02-28 11:25:34 +08:00
Azure
add415124f
Merge pull request #731 from Azure-Tang/update-template
[fix] Fix template name
2025-02-28 11:19:52 +08:00
Azure
bc52969918 fix name 2025-02-28 03:17:33 +00:00
Azure
0439cb36d4
Merge pull request #730 from Azure-Tang/update-template
[UPDATE] Update ZH/EN issue template
2025-02-28 11:10:29 +08:00
Azure
31b01f5b99 update ZH/EN template 2025-02-28 03:09:06 +00:00
Shuaiyi
a34a25d5cc Delete unused code 2025-02-27 13:18:19 +00:00
wang jiahao
7a19f3b781
Merge pull request #721 from kvcache-ai/fix_temperature
fix temperature
2025-02-27 21:01:21 +08:00
qiyuxinlin
22df52e94e fix temperature 2025-02-27 21:00:44 +08:00
Atream
85e2cc7bf4
Merge pull request #719 from kvcache-ai/fix-use-generation-json
use generation config from json file in official repo
2025-02-27 19:49:41 +08:00
Atream
e645d84794 use generation config from json file in official repo 2025-02-27 11:48:34 +00:00
wang jiahao
5e3c6b4f97
Merge pull request #644 from wtdcode/temperature_top_p_from_request
Allow temperature and top_p from /v1/chat/completions
2025-02-27 18:13:13 +08:00