Commit graph

5 commits

Author SHA1 Message Date
Atream
f4c198bd42 support absorb for prefill long context 2025-02-25 08:52:02 +00:00
Atream
7e1fe256c8 optimize GPU 2025-02-21 05:06:57 +00:00
liam
83401dbb3b ready to publish 2025-02-10 12:29:23 +08:00
Azure
ee24a27001 update v3 single gpu rule yaml; 2025-02-04 16:14:35 +00:00
Azure
476b1d8dc6 support deepseekv3; runable but have precition problem 2025-01-31 08:27:24 +00:00