mirror of
https://github.com/LostRuins/koboldcpp.git
synced 2026-05-23 12:45:01 +00:00
* hmx-mm: update debug logging in hmx-mm * hmx-mm: update dequant logic to use HVX_vector_x2/4 * hmx-mm: remove non-pipelined version of the quantize matmul It seems that we don't reall need non-pipelined version * hmx-mm: use activation depth mode and update naming Co-authored-by: Kim-Chyan Gan <kgan@qti.qualcomm.com> * hex-mm: minor hmx matmul naming updates * hmx-mm: remove unused vars * snapdragon: scripts bump default ubatch-size to 1K * hexagon: combine HMX and power and clock settings into a single set_power call * hmx-mm: remove leftover of the scale repl helper * hexagon: fix editconf error --------- Co-authored-by: Kim-Chyan Gan <kgan@qti.qualcomm.com> |
||
|---|---|---|
| .. | ||
| apple | ||
| hip | ||
| jinja | ||
| snapdragon | ||
| bench-models.sh | ||
| build-info.sh | ||
| check-requirements.sh | ||
| compare-commits.sh | ||
| compare-llama-bench.py | ||
| compare-logprobs.py | ||
| create_ops_docs.py | ||
| debug-test.sh | ||
| fetch_server_test_models.py | ||
| gen-authors.sh | ||
| gen-unicode-data.py | ||
| get-flags.mk | ||
| get-hellaswag.sh | ||
| get-pg.sh | ||
| get-wikitext-2.sh | ||
| get-winogrande.sh | ||
| get_chat_template.py | ||
| git-bisect-run.sh | ||
| git-bisect.sh | ||
| hf.sh | ||
| install-oneapi.bat | ||
| pr2wt.sh | ||
| serve-static.js | ||
| server-bench.py | ||
| server-test-function-call.py | ||
| server-test-model.py | ||
| server-test-parallel-tc.py | ||
| server-test-structured.py | ||
| sync-ggml-am.sh | ||
| sync-ggml.last | ||
| sync-ggml.sh | ||
| sync_vendor.py | ||
| tool_bench.py | ||
| tool_bench.sh | ||
| ui-download.cmake | ||
| verify-checksum-models.py | ||
| wc2wt.sh | ||
| xxd.cmake | ||