models
|
optimize gguf dequant, save mem, support Q2_K
|
2025-02-22 06:13:01 +00:00 |
operators
|
fix: wrong shape in KLinearMarlin.
|
2025-03-03 17:34:45 +08:00 |
optimize
|
Update DeepSeek-V3-Chat-multi-gpu-marlin.yaml
|
2025-02-26 21:53:50 +08:00 |
server
|
Merge pull request #759 from 3wweiweiwu/fix_top_p_typo
|
2025-03-02 13:58:11 +08:00 |
tests
|
⚡ update git ignore add docker dev container
|
2025-02-25 17:22:11 +08:00 |
__init__.py
|
Update __init__.py
|
2025-03-03 16:49:50 +08:00 |
local_chat.py
|
Update local_chat.py
|
2025-03-01 21:52:48 +08:00 |