Commit graph

877 commits

Author SHA1 Message Date
Jesse
e204a0bb6b
Merge 8c8cb207aa into ee2ede0412 2025-08-05 15:24:17 +08:00
Jianwei Dong
ee2ede0412
Merge pull request #1466 from kvcache-ai/update-readme-djw
Some checks failed
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
update smallthinker and glm4 readme
2025-07-31 11:15:28 +08:00
djw
5771990a07 update smallthinker and glm4 readme 2025-07-31 03:14:49 +00:00
Jianwei Dong
757add1a39
Merge pull request #1456 from kvcache-ai/support-smt-glm4
Some checks failed
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
Support SmallThinker and GLM4-MoE
2025-07-27 17:20:00 +08:00
qiyuxinlin
1334ddc833 update readme 2025-07-25 17:02:36 +00:00
qiyuxinlin
9e1560bb82 GLM4 and SmallThinker 2025-07-25 16:56:36 +00:00
djw
c7307aa0ae support smt and glm4 2025-07-25 16:24:38 +00:00
djw
17246bf84f support smt and glm4 2025-07-25 15:03:27 +00:00
djw
48bc6185b5 support smt and qlm4 2025-07-25 12:48:51 +00:00
qiyuxinlin
712ad1fa3c smallthinker right 2025-07-25 12:46:14 +00:00
Qiu Chengyu
f8719ee7b9 Add use_silu in MOEConfig in python and hard-determine smallthinker 2025-07-25 11:22:31 +00:00
Qiu Chengyu
cb808979fa Add use_silu in MOEConfig on cpu 2025-07-25 10:57:01 +00:00
qiyuxinlin
71c1d4eed7 smallthink run 2025-07-24 15:08:29 +00:00
djw
590fcb41cd support smt and glm4 2025-07-24 12:31:01 +00:00
djw
613f0b7c37 support smt and glm4 2025-07-24 09:39:19 +00:00
djw
b66d96db97 support smt and glm4 2025-07-24 08:40:58 +00:00
wang jiahao
1677e90092
Merge pull request #1439 from kvcache-ai/qiyuxinlin-patch-3
Some checks failed
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
Update balance_serve.py
2025-07-12 13:14:54 +08:00
wang jiahao
a2e95e467a
Update balance_serve.py 2025-07-12 13:14:35 +08:00
UnicornChan
dc59af6167
Merge pull request #1438 from kvcache-ai/update-readme
Update Kimi-K2 Readme
2025-07-12 12:52:52 +08:00
chenxl
b5024f62a4 Update Kimi-K2 Readme 2025-07-12 12:51:00 +08:00
Atream
4fb367542b
Merge pull request #1437 from kvcache-ai/Atream-patch-5
Update Kimi-K2.md
2025-07-12 12:44:52 +08:00
Atream
34d2829f24
Update Kimi-K2.md 2025-07-12 12:44:41 +08:00
Atream
df19681ec4
Merge pull request #1436 from kvcache-ai/Atream-patch-4
Update Kimi-K2.md
2025-07-12 11:58:25 +08:00
Atream
90245d8a6b
Update Kimi-K2.md 2025-07-12 11:57:51 +08:00
Atream
8e2c67d655
Merge pull request #1435 from kvcache-ai/Atream-patch-3
Update Kimi-K2.md
2025-07-12 11:48:08 +08:00
Atream
378e4fc035
Update Kimi-K2.md 2025-07-12 11:47:42 +08:00
Atream
5d4a644456
Merge pull request #1434 from kvcache-ai/Atream-patch-2
Some checks are pending
Book-CI / test (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Update Kimi-K2.md
2025-07-11 23:26:35 +08:00
Atream
b4ed8b6ded
Update Kimi-K2.md 2025-07-11 23:26:18 +08:00
UnicornChan
83c8e7928e
Merge pull request #1432 from kvcache-ai/UnicornChan-patch-1
Some checks are pending
Book-CI / test (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
Update Kimi-K2.md
2025-07-11 19:32:42 +08:00
UnicornChan
7800a413a2
Update Kimi-K2.md 2025-07-11 19:31:58 +08:00
Atream
2303889709
Merge pull request #1431 from kvcache-ai/support-kimi-k2
Some checks are pending
Book-CI / test (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Support kimi k2
2025-07-11 09:36:01 +08:00
Atream
cf79c93fae
Update README.md 2025-07-11 09:35:12 +08:00
Atream
18690d819f
Update README.md 2025-07-11 09:34:07 +08:00
Atream
b4ac21454b
Create Kimi-K2.md 2025-07-11 09:31:47 +08:00
Jesse CreateThis
8c8cb207aa Apply magikRUKKOLA's patch from issue #1417 2025-07-06 19:45:06 +00:00
Atream
890b0f1622
Merge pull request #1410 from kvcache-ai/Atream-patch-1
Some checks failed
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
Update __init__.py
2025-07-01 16:43:42 +08:00
Atream
5bd40c33eb
Update __init__.py 2025-07-01 16:43:19 +08:00
aubreyli
f96aab3c85
Merge pull request #1409 from rnwang04/fix_fp16
revert using FP16 in XPU
2025-07-01 15:00:41 +08:00
rnwang04
5b5deda420 revert using FP16 2025-07-01 14:24:27 +08:00
ErvinXie
495ae37478
Merge pull request #1407 from kvcache-ai/v0.3.2
Some checks are pending
Book-CI / test (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
V0.3.2
2025-07-01 10:26:56 +08:00
ouqingliang
90cff820cf update kvc disk path config. 2025-06-30 15:09:35 +00:00
ErvinXie
aadf31b35d
Update README.md 2025-06-30 17:55:49 +08:00
ErvinXie
5a73aaf652
Update prefix_cache.md 2025-06-30 15:04:37 +08:00
ErvinXie
a9a72e52c3
Update README.md 2025-06-30 14:56:46 +08:00
ErvinXie
d3fae09252
Merge pull request #1405 from kvcache-ai/prefix-cache
Some checks are pending
Book-CI / test (push) Waiting to run
Deploy / deploy (macos-latest) (push) Waiting to run
Deploy / deploy (ubuntu-latest) (push) Waiting to run
Deploy / deploy (windows-latest) (push) Waiting to run
Prefix cache
2025-06-30 14:39:26 +08:00
ouqingliang
cc822df65d add prefix cache documentation 2025-06-28 07:13:33 +00:00
ouqingliang
4d51831316 fix MPSC 2025-06-26 13:11:40 +00:00
ouqingliang
3b4a1c7532 add prefix cache support for kvc2. 2025-06-26 04:57:25 +00:00
ouqingliang
b154441072 add prefix cache to kvc2. 2025-06-26 04:56:43 +00:00
ZiWei Yuan
ee5ee1103b
Merge pull request #1399 from KMSorSMS/main
Some checks failed
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
Book-CI / test (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
 update vendor ZTE name
2025-06-23 21:10:04 +08:00