koboldcpp/scripts/snapdragon
Trivikram Reddy 856c3adac1
hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993)
* hexagon: add hvx_vec_repl helpers and use those for splat-from-vtcm usecase

* hmx-mm: optimize per-group scale handling

* hmx-fa: optimize slope load from vtcm

* hmx-fa: use aligned access where possible in hmx-utils

* hexagon: add hvx_vec_repl_2x_f16 helper and consolidate repl helpers

---------

Co-authored-by: Max Krasnyansky <maxk@qti.qualcomm.com>
2026-05-12 17:28:02 -07:00
..
adb hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993) 2026-05-12 17:28:02 -07:00
qdc Enable testing on Snapdragon devices (#21051) 2026-04-23 13:08:10 -07:00
windows hexagon: add support for basic and extended Op profiling (#22269) 2026-04-23 14:17:21 -07:00
ggml-hexagon-profile.py hexagon: add support for basic and extended Op profiling (#22269) 2026-04-23 14:17:21 -07:00