Commit graph

17 commits

Author SHA1 Message Date
Jianwei Dong
027832c590
[feat](kt-kernel): CPU-GPU experts sched (#1796)
Some checks failed
Book-CI / test (push) Has been cancelled
Book-CI / test-1 (push) Has been cancelled
Book-CI / test-2 (push) Has been cancelled
Deploy / deploy (macos-latest) (push) Has been cancelled
Deploy / deploy (ubuntu-latest) (push) Has been cancelled
Deploy / deploy (windows-latest) (push) Has been cancelled
2026-01-16 17:01:15 +08:00
ZiWei Yuan
b096b01fbc
[docs]: add kt-cli doc and update corresponding website (#1768) 2025-12-29 23:06:22 +08:00
Atream
4c5fcf9774 add kt-kernel 2025-10-12 05:13:00 +00:00
djw
b66d96db97 support smt and glm4 2025-07-24 08:40:58 +00:00
liam Yuan
0e8a36770a update ignore 2025-04-29 13:24:14 +08:00
liam
a0e7afa432 fox docker build 2025-02-28 11:25:34 +08:00
liam
0ca0b99fab update git ignore add docker dev container 2025-02-25 17:22:11 +08:00
Xiaodong Ye
2207f6cd14 feat: Support Moore Threads GPU
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2025-02-19 18:26:55 +08:00
liam
592e13d453 add mmlu_pro test 2025-02-18 14:43:38 +08:00
liam
07a0555016 fix device and add test 2025-02-18 12:52:17 +08:00
liam
c74453d8ca 📝 add doc support and fix bug in qwen2 2025-02-13 16:37:43 +08:00
liam
098602b08f v0.2 ongoing 2025-02-09 22:41:14 +08:00
liam
a148da2cfe : rm sensitive info in config.yaml, add readme of makefile. support old model_path config 2024-11-04 14:02:19 +08:00
TangJingqi
abd4214b56 fix readme; adjust param 2024-08-29 10:40:08 +08:00
chenxl
4d1d561d28 [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
chenxl
f5f79f5c0e [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
chenxl
18c42e67df Initial commit 2024-07-27 16:06:58 +08:00