Atream
|
8b51b0f058
|
Merge pull request #915 from kvcache-ai/Atream-patch-4
Atream patch 4
|
2025-03-17 17:05:39 +08:00 |
|
Atream
|
167506b779
|
Update DeepSeek-V3-Chat-multi-gpu-marlin.yaml
|
2025-03-17 17:05:01 +08:00 |
|
Atream
|
c9a0c44213
|
Update DeepSeek-V3-Chat-multi-gpu-fp8-linear-ggml-experts.yaml
|
2025-03-17 17:03:52 +08:00 |
|
Atream
|
3aee0fa099
|
Merge pull request #913 from kvcache-ai/Atream-patch-3
Add files via upload
|
2025-03-17 17:00:28 +08:00 |
|
Atream
|
094ac8f3a4
|
Add files via upload
|
2025-03-17 16:59:57 +08:00 |
|
ZiWei Yuan
|
8a8311cb04
|
Merge pull request #911 from kvcache-ai/patch_v0.2.3post2
🔧 update multi-gpu-fp8-linear and multi-gpu marlin yaml
|
2025-03-17 15:09:11 +08:00 |
|
liam
|
19f058ec9e
|
🔧 update multi-gpu-fp8-linear and multi-gpu marlin yaml
|
2025-03-17 15:08:12 +08:00 |
|
Azure
|
0e93a09d67
|
Merge pull request #906 from Azure-Tang/main
[Fix] Fix rocm example yaml
|
2025-03-16 10:27:59 +08:00 |
|
Azure-Tang
|
85c32fdd10
|
Fix rocm example yaml
|
2025-03-15 22:27:02 -04:00 |
|
Azure
|
63604cac59
|
Merge pull request #904 from Azure-Tang/main
[fix]Fix rocm compilation
|
2025-03-16 00:36:16 +08:00 |
|
Azure-Tang
|
4a31237346
|
fix rocm compilation
|
2025-03-15 12:34:03 -04:00 |
|
Atream
|
c51818c39a
|
Merge pull request #902 from kvcache-ai/rollback-triton-prefill
rollback-triton-prefill
|
2025-03-15 23:09:30 +08:00 |
|
Atream
|
3934b9dfc1
|
rollback-triton-prefill
|
2025-03-15 14:21:21 +00:00 |
|
ZiWei Yuan
|
bda9cf15e7
|
Merge pull request #899 from kvcache-ai/develop-0.2.3post2
⚡ fix readme path
|
2025-03-15 19:20:52 +08:00 |
|
liam
|
ee02a111d7
|
⚡ fix readme path
|
2025-03-15 19:20:04 +08:00 |
|
ZiWei Yuan
|
9b76cab1a5
|
Merge pull request #898 from kvcache-ai/develop-0.2.3post2
Release 0.2.3post2
|
2025-03-15 18:11:42 +08:00 |
|
liam
|
b5ef7c26dc
|
🔖 release v0.2.3post2
|
2025-03-15 18:04:10 +08:00 |
|
Jiaqi Liao
|
dfe09b05dd
|
Merge pull request #897 from SkqLiao/main
Add Unit Test for Local Chat
|
2025-03-15 17:42:48 +08:00 |
|
SkqLiao
|
c66ca65778
|
write to log
|
2025-03-15 17:10:44 +08:00 |
|
SkqLiao
|
a1891b845d
|
remove unsupprted paramters, add force think
|
2025-03-15 17:04:42 +08:00 |
|
SkqLiao
|
4e23a4c024
|
split two test
|
2025-03-15 11:32:43 +08:00 |
|
Azure
|
117a8d2f2a
|
fix compilation
|
2025-03-14 19:49:20 +00:00 |
|
SkqLiao
|
0899b7dde6
|
remove file output est
|
2025-03-15 03:17:35 +08:00 |
|
SkqLiao
|
570c98c52d
|
remove output test
|
2025-03-15 03:17:17 +08:00 |
|
SkqLiao
|
6385308ff0
|
replace sed with awk
|
2025-03-15 03:11:26 +08:00 |
|
SkqLiao
|
9d19b7b4d4
|
fix sed
|
2025-03-15 03:03:38 +08:00 |
|
SkqLiao
|
336b5dd590
|
fix sed command
|
2025-03-15 02:55:36 +08:00 |
|
SkqLiao
|
2ed4dff85d
|
fix command typo
|
2025-03-15 02:51:03 +08:00 |
|
SkqLiao
|
f21ea700f3
|
fix term
|
2025-03-15 02:45:35 +08:00 |
|
SkqLiao
|
0be19c39e9
|
change cicd option default
|
2025-03-15 02:37:54 +08:00 |
|
SkqLiao
|
a31e09969f
|
fix typo
|
2025-03-15 02:37:08 +08:00 |
|
SkqLiao
|
129e013b41
|
rename cicd
|
2025-03-15 02:36:37 +08:00 |
|
SkqLiao
|
57cf449a97
|
fix command
|
2025-03-15 02:35:56 +08:00 |
|
SkqLiao
|
9812d57c11
|
fix typo, logging to file
|
2025-03-15 02:31:49 +08:00 |
|
SkqLiao
|
0f1684c28d
|
local chat for cicd test
|
2025-03-15 02:31:19 +08:00 |
|
Azure
|
3986e2d2cf
|
Merge pull request #178 from fxzjshm/hip
[Feat] Port to ROCm/HIP
|
2025-03-15 02:31:07 +08:00 |
|
Azure-Tang
|
e5b001d76f
|
Update readme; Format code; Add example yaml.
|
2025-03-14 14:25:52 -04:00 |
|
SkqLiao
|
12949c8acd
|
fix default options
|
2025-03-15 01:47:14 +08:00 |
|
Jiaqi Liao
|
8320ae7d43
|
Merge pull request #893 from SkqLiao/main
Add Local Chat Test for CI/CD
|
2025-03-15 01:46:39 +08:00 |
|
SkqLiao
|
bd9dc55a8d
|
fix option typos
|
2025-03-15 01:45:49 +08:00 |
|
SkqLiao
|
6c98bb6009
|
modify job name
|
2025-03-15 01:45:07 +08:00 |
|
SkqLiao
|
8c51f520dd
|
fix install issue & add test
|
2025-03-15 01:43:38 +08:00 |
|
SkqLiao
|
4e1a7630aa
|
fix cicd script
|
2025-03-14 23:13:39 +08:00 |
|
Jiaqi Liao
|
6a77a3d396
|
Merge pull request #892 from SkqLiao/main
fix flash_attn whl path in install job
|
2025-03-14 23:10:36 +08:00 |
|
SkqLiao
|
a8d159771e
|
fix flash_attn whl path
|
2025-03-14 23:09:37 +08:00 |
|
Atream
|
b4ad815ef0
|
Merge pull request #891 from kvcache-ai/performance-optimize-gpu
use compile for gate, slight performance improvement
|
2025-03-14 20:45:24 +08:00 |
|
Atream
|
a889288fc1
|
use compile for gate, slight performance improvement
|
2025-03-14 12:43:28 +00:00 |
|
Azure-Tang
|
c38e77de6b
|
Merge branch 'hip' of https://github.com/fxzjshm/ktransformers into hip
|
2025-03-14 05:54:02 -04:00 |
|
Azure-Tang
|
ed8437413b
|
merge main; Add torch q8 linear
|
2025-03-14 05:52:07 -04:00 |
|
Atream
|
6c4ed59175
|
Merge pull request #886 from kvcache-ai/fix-singleton-zbx
fix-singleton
|
2025-03-14 12:18:30 +08:00 |
|