models
|
optimize gguf dequant, save mem, support Q2_K
|
2025-02-22 06:13:01 +00:00 |
operators
|
⚡ fix experts torch
|
2025-02-26 15:04:40 +08:00 |
optimize
|
Update DeepSeek-V3-Chat-multi-gpu-marlin.yaml
|
2025-02-26 21:53:50 +08:00 |
server
|
Merge pull request #532 from xv44586/fix-sse-formatting
|
2025-02-27 12:19:23 +08:00 |
tests
|
⚡ update git ignore add docker dev container
|
2025-02-25 17:22:11 +08:00 |
util
|
Merge pull request #670 from akemimadoka/fix-win
|
2025-02-27 17:40:27 +08:00 |
__init__.py
|
⚡ release v0.2.2rc1
|
2025-02-25 22:06:36 +08:00 |