liu.shen
|
26bd889ff8
|
fix #829: 兼容Intel Cascade Lake架构的CPU
|
2025-03-09 19:26:12 +08:00 |
|
liam
|
407e1b9ab2
|
🚑 Merge branch 'develop-0.2.3' of https://github.com/kvcache-ai/ktransformers into develop-0.2.3
|
2025-03-06 13:52:01 +08:00 |
|
Azure
|
27475cf8a5
|
fix compile
|
2025-03-06 05:34:37 +00:00 |
|
liam
|
8eeb6dd432
|
⚡ update compile option for avx512vpopcntdq
|
2025-03-06 12:18:04 +08:00 |
|
Azure
|
dd390835ca
|
Add compile condition
|
2025-03-06 03:25:39 +00:00 |
|
Azure
|
8068018504
|
fix gcc compilation
|
2025-03-05 15:59:56 +00:00 |
|
ZiWei Yuan
|
6c35ca75b3
|
Merge pull request #809 from KMSorSMS/develop-0.2.3
⚡ release v0.2.3
|
2025-03-05 20:23:03 +08:00 |
|
liam
|
848fe8ab97
|
⚡ release v0.2.3
|
2025-03-05 20:21:04 +08:00 |
|
Azure
|
034a116365
|
update readme
|
2025-03-05 10:04:43 +00:00 |
|
Azure
|
d7becadcf7
|
Merge branch 'develop-0.2.3' of https://github.com/kvcache-ai/ktransformers into develop-0.2.3
|
2025-03-05 09:26:23 +00:00 |
|
Azure
|
662c1e4c14
|
small fix about max new token
|
2025-03-05 09:25:41 +00:00 |
|
Atream
|
52fa3003d1
|
Merge pull request #791 from KMSorSMS/develop-0.2.3
⚡ add humaneval support
|
2025-03-05 11:28:36 +08:00 |
|
liam
|
dc10480ef6
|
⚡ add humaneval support
|
2025-03-04 20:54:49 +08:00 |
|
Azure
|
5aee6c0446
|
Merge branch 'main' into develop-0.2.3
|
2025-03-03 12:19:47 +00:00 |
|
Azure
|
216a63b858
|
Merge pull request #754 from moonshadow-25/dev023
Support for IQ1_S(Dynamic 1.58-bit)
|
2025-03-03 14:19:33 +08:00 |
|
wang jiahao
|
48b9800790
|
Merge pull request #759 from 3wweiweiwu/fix_top_p_typo
fix typo for top_p
|
2025-03-02 13:58:11 +08:00 |
|
wang jiahao
|
bb54b68e5c
|
Merge pull request #761 from kvcache-ai/fix-server-bug
fix ollama api temperature bug
|
2025-03-02 13:56:46 +08:00 |
|
1668068727@qq.com
|
7cdf8139f0
|
fix ollama api temperature bug
|
2025-03-02 13:55:26 +08:00 |
|
Wix Woo
|
3aa0cfc29d
|
fix typo for top_p
|
2025-03-01 20:15:36 +00:00 |
|
Atream
|
69382e58f9
|
Merge pull request #313 from MuWinds/main
Update:Solve `torch.backends.cuda.sdp_kernel()` is deprecated.
|
2025-03-01 23:24:44 +08:00 |
|
Atream
|
ca1dc1e7d1
|
Merge branch 'main' into main
|
2025-03-01 23:24:10 +08:00 |
|
Atream
|
505f4e2c35
|
Merge pull request #753 from ningpengtao-coder/main
Update local_chat.py
|
2025-03-01 23:22:13 +08:00 |
|
moonshadow-25
|
d24d369332
|
iq1s files
|
2025-03-01 22:44:06 +08:00 |
|
moonshadow-25
|
c513ae59c3
|
iq1s files
|
2025-03-01 22:38:04 +08:00 |
|
宁鹏涛
|
71286ec1c0
|
Update local_chat.py
修复config.architectures[0] == "DeepseekV2ForCausalLM" or "DeepseekV3ForCausalLM" 永远为真
|
2025-03-01 21:52:48 +08:00 |
|
moonshadow-25
|
9781d1e6f4
|
iq1s core
|
2025-03-01 21:48:25 +08:00 |
|
godrosev
|
93c5b75716
|
rem
|
2025-03-01 21:25:18 +08:00 |
|
godrosev
|
e6349eb240
|
iq1s
|
2025-03-01 21:00:11 +08:00 |
|
Atream
|
761de49843
|
Merge pull request #751 from kvcache-ai/Atream-patch-2
Update DeepseekR1_V3_tutorial.md
|
2025-03-01 19:57:00 +08:00 |
|
Atream
|
735873a32a
|
Update DeepseekR1_V3_tutorial.md
|
2025-03-01 19:56:46 +08:00 |
|
Atream
|
bd33a59ecf
|
Merge pull request #750 from kvcache-ai/feat-chunk-prefill-flashinfer
Support chunk prefill. Support 139K context for DeepSeek-R1 139K with in 24G VRAM.
|
2025-03-01 19:50:52 +08:00 |
|
Atream
|
fa03ea48dd
|
Merge branch 'main' into feat-chunk-prefill-flashinfer
|
2025-03-01 11:35:09 +00:00 |
|
Atream
|
f35e8d41d8
|
support chunk prefill, support 139K context for 24G VRAM
|
2025-03-01 11:28:25 +00:00 |
|
ZiWei Yuan
|
511958d49c
|
Merge pull request #743 from KMSorSMS/main
fix cache_lens bug in server and rm test prompt.txt
|
2025-03-01 00:17:53 +08:00 |
|
liam
|
80e0536fb0
|
Merge branch 'main' of https://github.com/KMSorSMS/ktransformers into main
|
2025-03-01 00:12:21 +08:00 |
|
liam
|
8ddc990668
|
⚡ fix server cache lens
|
2025-03-01 00:09:57 +08:00 |
|
Atream
|
494469d4c5
|
Merge pull request #722 from ZhangShuaiyi/remove_unused
Delete duplicate code
|
2025-02-28 15:04:21 +08:00 |
|
liam
|
71f4599dee
|
📝 rm test_prompt
|
2025-02-28 11:44:49 +08:00 |
|
ZiWei Yuan
|
1264f9407b
|
Merge pull request #732 from KMSorSMS/main
⚡ fox docker build
|
2025-02-28 11:28:06 +08:00 |
|
liam
|
a0e7afa432
|
⚡ fox docker build
|
2025-02-28 11:25:34 +08:00 |
|
Azure
|
add415124f
|
Merge pull request #731 from Azure-Tang/update-template
[fix] Fix template name
|
2025-02-28 11:19:52 +08:00 |
|
Azure
|
bc52969918
|
fix name
|
2025-02-28 03:17:33 +00:00 |
|
Azure
|
0439cb36d4
|
Merge pull request #730 from Azure-Tang/update-template
[UPDATE] Update ZH/EN issue template
|
2025-02-28 11:10:29 +08:00 |
|
Azure
|
31b01f5b99
|
update ZH/EN template
|
2025-02-28 03:09:06 +00:00 |
|
Shuaiyi
|
a34a25d5cc
|
Delete unused code
|
2025-02-27 13:18:19 +00:00 |
|
wang jiahao
|
7a19f3b781
|
Merge pull request #721 from kvcache-ai/fix_temperature
fix temperature
|
2025-02-27 21:01:21 +08:00 |
|
qiyuxinlin
|
22df52e94e
|
fix temperature
|
2025-02-27 21:00:44 +08:00 |
|
Atream
|
85e2cc7bf4
|
Merge pull request #719 from kvcache-ai/fix-use-generation-json
use generation config from json file in official repo
|
2025-02-27 19:49:41 +08:00 |
|
Atream
|
e645d84794
|
use generation config from json file in official repo
|
2025-02-27 11:48:34 +00:00 |
|
wang jiahao
|
5e3c6b4f97
|
Merge pull request #644 from wtdcode/temperature_top_p_from_request
Allow temperature and top_p from /v1/chat/completions
|
2025-02-27 18:13:13 +08:00 |
|