koboldcpp/tools/mtmd
Concedo ac29e6f0c0 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.devops/vulkan.Dockerfile
#	.github/workflows/build-self-hosted.yml
#	.github/workflows/build.yml
#	.github/workflows/release.yml
#	.github/workflows/server-self-hosted.yml
#	docs/build.md
#	ggml/src/ggml-hexagon/htp/CMakeLists.txt
#	ggml/src/ggml-hexagon/htp/hex-utils.h
#	ggml/src/ggml-hexagon/htp/hmx-matmul-ops.c
#	ggml/src/ggml-hexagon/htp/hmx-utils.h
#	ggml/src/ggml-hexagon/htp/htp-ctx.h
#	ggml/src/ggml-hexagon/htp/htp-ops.h
#	ggml/src/ggml-hexagon/htp/hvx-base.h
#	ggml/src/ggml-hexagon/htp/main.c
#	ggml/src/ggml-webgpu/ggml-webgpu.cpp
#	tests/test-backend-ops.cpp
#	tests/test-mtmd-c-api.c
2026-04-15 15:15:19 +08:00
..
debug fixed mtmd build 2026-04-03 14:41:24 +08:00
legacy-models chore : correct typos [no ci] (#20041) 2026-03-05 08:50:21 +01:00
models mtmd: qwen3 audio support (qwen3-omni and qwen3-asr) (#19441) 2026-04-12 23:57:25 +02:00
clip-graph.h model, mtmd: fix gguf conversion for audio/vision mmproj (#21309) 2026-04-02 17:10:32 +02:00
clip-impl.h Merge branch 'upstream' into concedo_experimental 2026-04-14 20:06:04 +08:00
clip-model.h mtmd: qwen3 audio support (qwen3-omni and qwen3-asr) (#19441) 2026-04-12 23:57:25 +02:00
clip.cpp Merge branch 'upstream' into concedo_experimental 2026-04-14 20:06:04 +08:00
clip.h fixed gemma4v image crashing on encode, however images are not yet working correctly 2026-04-03 15:56:35 +08:00
deprecation-warning.cpp Fix locale-dependent float printing in GGUF metadata (#17331) 2026-03-04 09:30:40 +01:00
llava.cpp fixed gemma4v image crashing on encode, however images are not yet working correctly 2026-04-03 15:56:35 +08:00
llava.h track clip img patch nx and ny 2025-12-18 22:58:10 +08:00
mtmd-audio.cpp mtmd: add Gemma 4 audio conformer encoder support (#21421) 2026-04-12 14:15:26 +02:00
mtmd-audio.h mtmd: add Gemma 4 audio conformer encoder support (#21421) 2026-04-12 14:15:26 +02:00
mtmd-cli.cpp common : move up common_init() and fix Windows UTF-8 logs (#21176) 2026-03-31 12:53:41 +02:00
mtmd-helper.cpp Merge branch 'upstream' into concedo_experimental 2026-04-15 15:15:19 +08:00
mtmd-helper.h mtmd: add mtmd_image_tokens_get_decoder_pos() API (#21851) 2026-04-14 16:07:41 +02:00
mtmd-image.cpp mtmd: fix crash when sending image under 2x2 pixels (#21711) 2026-04-12 23:59:21 +02:00
mtmd-image.h model : support step3-vl-10b (#21287) 2026-04-08 09:51:31 +02:00
mtmd.cpp mtmd: add mtmd_image_tokens_get_decoder_pos() API (#21851) 2026-04-14 16:07:41 +02:00
mtmd.h mtmd: add mtmd_image_tokens_get_decoder_pos() API (#21851) 2026-04-14 16:07:41 +02:00
requirements.txt requirements : update transformers/torch for Embedding Gemma (#15828) 2025-09-09 06:06:52 +02:00
test-1.jpeg mtmd : rename llava directory to mtmd (#13311) 2025-05-05 16:02:55 +02:00
test-2.mp3 mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output) (#13784) 2025-05-27 14:06:10 +02:00
tests.sh docs: listing qwen3-asr and qwen3-omni as supported (#21857) 2026-04-13 22:28:17 +02:00