ktransformers_ext
|
fix rocm compilation
|
2025-03-15 12:34:03 -04:00 |
models
|
optimize gguf dequant, save mem, support Q2_K
|
2025-02-22 06:13:01 +00:00 |
operators
|
Update gate.py
|
2025-03-20 14:54:01 +08:00 |
optimize
|
Update DeepSeek-V3-Chat-multi-gpu-marlin.yaml
|
2025-03-17 17:05:01 +08:00 |
tests
|
Merge pull request #943 from SkqLiao/main
|
2025-03-20 18:49:34 +08:00 |
util
|
merge main; Add torch q8 linear
|
2025-03-14 05:52:07 -04:00 |
__init__.py
|
🔖 release v0.2.3post2
|
2025-03-15 18:04:10 +08:00 |
local_chat.py
|
merge main; Add torch q8 linear
|
2025-03-14 05:52:07 -04:00 |
local_chat_test.py
|
local chat for cicd test
|
2025-03-15 02:31:19 +08:00 |