koboldcpp/tools/mtmd
Concedo 19a12bb080 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	CODEOWNERS
#	common/CMakeLists.txt
#	ggml/CMakeLists.txt
#	ggml/src/ggml-webgpu/ggml-webgpu-shader-lib.hpp
#	ggml/src/ggml-webgpu/ggml-webgpu.cpp
#	ggml/src/ggml-webgpu/wgsl-shaders/common_decls.tmpl
#	ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_vec.wgsl
#	scripts/sync-ggml.last
#	tools/cli/cli.cpp
#	tools/llama-bench/llama-bench.cpp
#	tools/perplexity/perplexity.cpp
2026-04-21 18:53:03 +08:00
..
debug fixed mtmd build 2026-04-03 14:41:24 +08:00
legacy-models chore : correct typos [no ci] (#20041) 2026-03-05 08:50:21 +01:00
models mtmd: qwen3 audio support (qwen3-omni and qwen3-asr) (#19441) 2026-04-12 23:57:25 +02:00
clip-graph.h model, mtmd: fix gguf conversion for audio/vision mmproj (#21309) 2026-04-02 17:10:32 +02:00
clip-impl.h Merge branch 'upstream' into concedo_experimental 2026-04-14 20:06:04 +08:00
clip-model.h mtmd: qwen3 audio support (qwen3-omni and qwen3-asr) (#19441) 2026-04-12 23:57:25 +02:00
clip.cpp Merge branch 'upstream' into concedo_experimental 2026-04-14 20:06:04 +08:00
clip.h fixed gemma4v image crashing on encode, however images are not yet working correctly 2026-04-03 15:56:35 +08:00
deprecation-warning.cpp Fix locale-dependent float printing in GGUF metadata (#17331) 2026-03-04 09:30:40 +01:00
llava.cpp fixed gemma4v image crashing on encode, however images are not yet working correctly 2026-04-03 15:56:35 +08:00
llava.h track clip img patch nx and ny 2025-12-18 22:58:10 +08:00
mtmd-audio.cpp mtmd: add Gemma 4 audio conformer encoder support (#21421) 2026-04-12 14:15:26 +02:00
mtmd-audio.h mtmd: add Gemma 4 audio conformer encoder support (#21421) 2026-04-12 14:15:26 +02:00
mtmd-cli.cpp common : move up common_init() and fix Windows UTF-8 logs (#21176) 2026-03-31 12:53:41 +02:00
mtmd-helper.cpp Merge branch 'upstream' into concedo_experimental 2026-04-20 20:19:11 +08:00
mtmd-helper.h mtmd: add pos_0 to mtmd_image_tokens_get_decoder_pos (breaking change) (#22082) 2026-04-19 11:57:21 +02:00
mtmd-image.cpp mtmd: fix crash when sending image under 2x2 pixels (#21711) 2026-04-12 23:59:21 +02:00
mtmd-image.h model : support step3-vl-10b (#21287) 2026-04-08 09:51:31 +02:00
mtmd.cpp mtmd: correct mtmd_decode_use_mrope() (#22188) 2026-04-21 10:53:37 +02:00
mtmd.h mtmd: correct get_n_pos / get_decoder_pos (#22175) 2026-04-20 23:29:19 +02:00
requirements.txt requirements : update transformers/torch for Embedding Gemma (#15828) 2025-09-09 06:06:52 +02:00
test-1.jpeg mtmd : rename llava directory to mtmd (#13311) 2025-05-05 16:02:55 +02:00
test-2.mp3 mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output) (#13784) 2025-05-27 14:06:10 +02:00
tests.sh docs: listing qwen3-asr and qwen3-omni as supported (#21857) 2026-04-13 22:28:17 +02:00