Commit graph

531 commits

Author SHA1 Message Date
Atream
8b51b0f058
Merge pull request #915 from kvcache-ai/Atream-patch-4
Atream patch 4
2025-03-17 17:05:39 +08:00
Atream
167506b779
Update DeepSeek-V3-Chat-multi-gpu-marlin.yaml 2025-03-17 17:05:01 +08:00
Atream
c9a0c44213
Update DeepSeek-V3-Chat-multi-gpu-fp8-linear-ggml-experts.yaml 2025-03-17 17:03:52 +08:00
Atream
3aee0fa099
Merge pull request #913 from kvcache-ai/Atream-patch-3
Add files via upload
2025-03-17 17:00:28 +08:00
Atream
094ac8f3a4
Add files via upload 2025-03-17 16:59:57 +08:00
ZiWei Yuan
8a8311cb04
Merge pull request #911 from kvcache-ai/patch_v0.2.3post2
🔧 update multi-gpu-fp8-linear and multi-gpu marlin yaml
2025-03-17 15:09:11 +08:00
liam
19f058ec9e 🔧 update multi-gpu-fp8-linear and multi-gpu marlin yaml 2025-03-17 15:08:12 +08:00
Azure
0e93a09d67
Merge pull request #906 from Azure-Tang/main
[Fix] Fix rocm example yaml
2025-03-16 10:27:59 +08:00
Azure-Tang
85c32fdd10 Fix rocm example yaml 2025-03-15 22:27:02 -04:00
Azure
63604cac59
Merge pull request #904 from Azure-Tang/main
[fix]Fix rocm compilation
2025-03-16 00:36:16 +08:00
Azure-Tang
4a31237346 fix rocm compilation 2025-03-15 12:34:03 -04:00
Atream
c51818c39a
Merge pull request #902 from kvcache-ai/rollback-triton-prefill
rollback-triton-prefill
2025-03-15 23:09:30 +08:00
Atream
3934b9dfc1 rollback-triton-prefill 2025-03-15 14:21:21 +00:00
ZiWei Yuan
bda9cf15e7
Merge pull request #899 from kvcache-ai/develop-0.2.3post2
 fix readme path
2025-03-15 19:20:52 +08:00
liam
ee02a111d7 fix readme path 2025-03-15 19:20:04 +08:00
ZiWei Yuan
9b76cab1a5
Merge pull request #898 from kvcache-ai/develop-0.2.3post2
Release 0.2.3post2
2025-03-15 18:11:42 +08:00
liam
b5ef7c26dc 🔖 release v0.2.3post2 2025-03-15 18:04:10 +08:00
Jiaqi Liao
dfe09b05dd
Merge pull request #897 from SkqLiao/main
Add Unit Test for Local Chat
2025-03-15 17:42:48 +08:00
SkqLiao
c66ca65778 write to log 2025-03-15 17:10:44 +08:00
SkqLiao
a1891b845d remove unsupprted paramters, add force think 2025-03-15 17:04:42 +08:00
SkqLiao
4e23a4c024 split two test 2025-03-15 11:32:43 +08:00
Azure
117a8d2f2a fix compilation 2025-03-14 19:49:20 +00:00
SkqLiao
0899b7dde6 remove file output est 2025-03-15 03:17:35 +08:00
SkqLiao
570c98c52d remove output test 2025-03-15 03:17:17 +08:00
SkqLiao
6385308ff0 replace sed with awk 2025-03-15 03:11:26 +08:00
SkqLiao
9d19b7b4d4 fix sed 2025-03-15 03:03:38 +08:00
SkqLiao
336b5dd590 fix sed command 2025-03-15 02:55:36 +08:00
SkqLiao
2ed4dff85d fix command typo 2025-03-15 02:51:03 +08:00
SkqLiao
f21ea700f3 fix term 2025-03-15 02:45:35 +08:00
SkqLiao
0be19c39e9 change cicd option default 2025-03-15 02:37:54 +08:00
SkqLiao
a31e09969f fix typo 2025-03-15 02:37:08 +08:00
SkqLiao
129e013b41 rename cicd 2025-03-15 02:36:37 +08:00
SkqLiao
57cf449a97 fix command 2025-03-15 02:35:56 +08:00
SkqLiao
9812d57c11 fix typo, logging to file 2025-03-15 02:31:49 +08:00
SkqLiao
0f1684c28d local chat for cicd test 2025-03-15 02:31:19 +08:00
Azure
3986e2d2cf
Merge pull request #178 from fxzjshm/hip
[Feat] Port to ROCm/HIP
2025-03-15 02:31:07 +08:00
Azure-Tang
e5b001d76f Update readme; Format code; Add example yaml. 2025-03-14 14:25:52 -04:00
SkqLiao
12949c8acd fix default options 2025-03-15 01:47:14 +08:00
Jiaqi Liao
8320ae7d43
Merge pull request #893 from SkqLiao/main
Add Local Chat Test for CI/CD
2025-03-15 01:46:39 +08:00
SkqLiao
bd9dc55a8d fix option typos 2025-03-15 01:45:49 +08:00
SkqLiao
6c98bb6009 modify job name 2025-03-15 01:45:07 +08:00
SkqLiao
8c51f520dd fix install issue & add test 2025-03-15 01:43:38 +08:00
SkqLiao
4e1a7630aa fix cicd script 2025-03-14 23:13:39 +08:00
Jiaqi Liao
6a77a3d396
Merge pull request #892 from SkqLiao/main
fix flash_attn whl path in install job
2025-03-14 23:10:36 +08:00
SkqLiao
a8d159771e fix flash_attn whl path 2025-03-14 23:09:37 +08:00
Atream
b4ad815ef0
Merge pull request #891 from kvcache-ai/performance-optimize-gpu
use compile for gate, slight performance improvement
2025-03-14 20:45:24 +08:00
Atream
a889288fc1 use compile for gate, slight performance improvement 2025-03-14 12:43:28 +00:00
Azure-Tang
c38e77de6b Merge branch 'hip' of https://github.com/fxzjshm/ktransformers into hip 2025-03-14 05:54:02 -04:00
Azure-Tang
ed8437413b merge main; Add torch q8 linear 2025-03-14 05:52:07 -04:00
Atream
6c4ed59175
Merge pull request #886 from kvcache-ai/fix-singleton-zbx
fix-singleton
2025-03-14 12:18:30 +08:00