koboldcpp/scripts/snapdragon/adb
Trivikram Reddy 856c3adac1
hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993)
* hexagon: add hvx_vec_repl helpers and use those for splat-from-vtcm usecase

* hmx-mm: optimize per-group scale handling

* hmx-fa: optimize slope load from vtcm

* hmx-fa: use aligned access where possible in hmx-utils

* hexagon: add hvx_vec_repl_2x_f16 helper and consolidate repl helpers

---------

Co-authored-by: Max Krasnyansky <maxk@qti.qualcomm.com>
2026-05-12 17:28:02 -07:00
..
llama-cli.farf Add experimental ggml-hexagon backend for the Hexagon NPU (#16547) 2025-10-22 13:47:09 -07:00
run-bench.sh hexagon: add support for basic and extended Op profiling (#22269) 2026-04-23 14:17:21 -07:00
run-cli.sh hexagon: make vmem and buffer-size configurable (#22487) 2026-04-29 11:51:21 -07:00
run-completion.sh hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993) 2026-05-12 17:28:02 -07:00
run-mtmd.sh hexagon: add support for basic and extended Op profiling (#22269) 2026-04-23 14:17:21 -07:00
run-tool.sh hexagon: add support for basic and extended Op profiling (#22269) 2026-04-23 14:17:21 -07:00