koboldcpp/tools/mtmd
Concedo 718dc159b6 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	CMakeLists.txt
#	docs/speculative.md
#	ggml/src/ggml-cuda/CMakeLists.txt
#	ggml/src/ggml-hexagon/ggml-hexagon.cpp
#	ggml/src/ggml-hexagon/htp/hmx-matmul-ops.c
#	ggml/src/ggml-hexagon/htp/hmx-ops.h
#	ggml/src/ggml-hexagon/htp/main.c
#	ggml/src/ggml-hexagon/htp/matmul-ops.c
#	ggml/src/ggml-hexagon/htp/rope-ops.c
#	ggml/src/ggml-hexagon/htp/ssm-conv.c
#	ggml/src/ggml-opencl/ggml-opencl.cpp
#	scripts/snapdragon/adb/run-bench.sh
#	scripts/snapdragon/adb/run-cli.sh
#	scripts/snapdragon/adb/run-completion.sh
#	scripts/snapdragon/adb/run-mtmd.sh
#	scripts/snapdragon/windows/run-bench.ps1
#	scripts/snapdragon/windows/run-cli.ps1
#	scripts/snapdragon/windows/run-completion.ps1
#	scripts/snapdragon/windows/run-mtmd.ps1
#	src/llama-vocab.cpp
#	tests/test-backend-ops.cpp
#	tools/batched-bench/CMakeLists.txt
#	tools/batched-bench/batched-bench.cpp
#	tools/cli/CMakeLists.txt
#	tools/cli/README.md
#	tools/cli/cli.cpp
#	tools/completion/CMakeLists.txt
#	tools/completion/README.md
#	tools/llama-bench/CMakeLists.txt
#	tools/llama-bench/llama-bench.cpp
#	tools/mtmd/CMakeLists.txt
#	tools/mtmd/tests/test-deepseek-ocr.py
#	tools/mtmd/tests/tests-requirements.txt
#	tools/perplexity/CMakeLists.txt
#	tools/perplexity/perplexity.cpp
#	tools/quantize/CMakeLists.txt
#	tools/server/CMakeLists.txt
#	tools/server/README.md
#	ty.toml
2026-05-21 23:47:21 +08:00
..
debug Merge remote-tracking branch 'origin/upstream' into concedo_experimental 2026-05-06 21:20:06 +08:00
legacy-models chore : correct typos [no ci] (#20041) 2026-03-05 08:50:21 +01:00
models mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (#23329) 2026-05-21 00:35:37 +02:00
tests Merge branch 'upstream' into concedo_experimental 2026-05-21 23:47:21 +08:00
clip-graph.h mtmd: add chunks and fix preproc for qwen3a (#23073) 2026-05-15 19:32:47 +02:00
clip-impl.h Merge branch 'upstream' into concedo_experimental 2026-05-21 23:47:21 +08:00
clip-model.h mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (#23329) 2026-05-21 00:35:37 +02:00
clip.cpp Merge branch 'upstream' into concedo_experimental 2026-05-21 23:47:21 +08:00
clip.h Merge commit 'e6b4acfe86' into concedo_experimental 2026-05-21 23:31:32 +08:00
deprecation-warning.cpp Fix locale-dependent float printing in GGUF metadata (#17331) 2026-03-04 09:30:40 +01:00
llava.cpp updated to handle changes for clip_is_mrope 2026-04-21 19:34:32 +08:00
llava.h updated to handle changes for clip_is_mrope 2026-04-21 19:34:32 +08:00
mtmd-audio.cpp mtmd: add chunks and fix preproc for qwen3a (#23073) 2026-05-15 19:32:47 +02:00
mtmd-audio.h mtmd: add chunks and fix preproc for qwen3a (#23073) 2026-05-15 19:32:47 +02:00
mtmd-cli.cpp common : only load backends when required (#22290) 2026-05-05 09:23:50 +02:00
mtmd-helper.cpp Merge branch 'upstream' into concedo_experimental 2026-04-20 20:19:11 +08:00
mtmd-helper.h mtmd: add pos_0 to mtmd_image_tokens_get_decoder_pos (breaking change) (#22082) 2026-04-19 11:57:21 +02:00
mtmd-image.cpp mtmd : DeepSeek-OCR image processing fixes, img_tool::resize padding refactor (#23345) 2026-05-20 17:37:10 +02:00
mtmd-image.h model : support step3-vl-10b (#21287) 2026-04-08 09:51:31 +02:00
mtmd.cpp mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (#23329) 2026-05-21 00:35:37 +02:00
mtmd.h mtmd: fit_params now take into account mmproj (#21489) 2026-05-20 11:27:44 +02:00
requirements.txt requirements : update transformers/torch for Embedding Gemma (#15828) 2025-09-09 06:06:52 +02:00
test-1.jpeg mtmd : rename llava directory to mtmd (#13311) 2025-05-05 16:02:55 +02:00
test-2.mp3 mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output) (#13784) 2025-05-27 14:06:10 +02:00
tests.sh mtmd, llama : Update HunyuanVL vision-language model support (#22037) 2026-04-22 11:58:43 +02:00