configs
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
models
|
[ADD] support multi-gpu qlen>1 q5_k
|
2024-08-12 11:41:26 +00:00 |
operators
|
[fix] format classes and files name
|
2024-08-15 10:44:59 +08:00 |
tests
|
[fix] format classes and files name
|
2024-08-15 10:44:59 +08:00 |
util
|
[feature] experts can be injected using CPUInfer
|
2024-08-14 16:10:54 +08:00 |
website
|
Initial commit
|
2024-07-27 16:06:58 +08:00 |
__init__.py
|
[feature] add github action for pre compile
|
2024-08-14 16:54:50 +00:00 |
local_chat.py
|
[ADD] support multi-gpu qlen>1 q5_k
|
2024-08-12 11:41:26 +00:00 |