Concedo
8855a7f52b
Merge commit ' c945aaaef2' into concedo_experimental
...
# Conflicts:
# .devops/cann.Dockerfile
# .github/workflows/build.yml
# .github/workflows/release.yml
# README.md
# common/CMakeLists.txt
# common/chat.cpp
# docs/function-calling.md
# ggml/src/ggml-cann/aclnn_ops.cpp
# ggml/src/ggml-cann/aclnn_ops.h
# ggml/src/ggml-cann/common.h
# ggml/src/ggml-cann/ggml-cann.cpp
# models/templates/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16.jinja
# scripts/sync_vendor.py
# tests/CMakeLists.txt
# tests/peg-parser/tests.h
# tests/test-chat-peg-parser.cpp
# tests/test-chat-template.cpp
# tests/test-chat.cpp
# tests/testing.h
# tools/llama-bench/llama-bench.cpp
2026-01-17 10:24:03 +08:00
Concedo
d15bd212c5
cleanup
2026-01-17 00:57:33 +08:00
Concedo
cde4791e36
fix tools building
2025-12-19 12:08:29 +08:00
Concedo
a01b49098c
fix tool builds
2025-12-18 23:26:31 +08:00
Concedo
1daeed5d4d
Merge commit ' 9963b81f63' into concedo_experimental
...
# Conflicts:
# .github/workflows/server.yml
# SECURITY.md
# docs/backend/SYCL.md
# examples/model-conversion/README.md
# examples/model-conversion/scripts/embedding/compare-embeddings-logits.sh
# ggml/src/ggml-hexagon/ggml-hexagon.cpp
# ggml/src/ggml-hexagon/htp/matmul-ops.c
# tests/CMakeLists.txt
# tests/test-chat.cpp
# tests/test-json-schema-to-grammar.cpp
2025-12-17 20:30:34 +08:00
Concedo
cacfa37611
wip
2025-12-17 16:04:45 +08:00
Wagner Bruna
78bbe89956
sd: sync to master-417-43a70e8 ( #1889 )
...
* sd: sync to master-417-43a70e8
* fix sdmain build
* switch to upstream apply_loras()
* refactor u8 path conversions and add it to the gguf reader
2025-12-16 16:16:48 +08:00
Concedo
010995c967
Merge commit ' 4df6e859e9' into concedo_experimental
...
# Conflicts:
# .github/workflows/build.yml
# README.md
# ci/run.sh
# examples/gen-docs/gen-docs.cpp
# scripts/snapdragon/adb/run-cli.sh
# tests/test-lora-conversion-inference.sh
# tools/CMakeLists.txt
# tools/completion/CMakeLists.txt
# tools/completion/README.md
# tools/server/CMakeLists.txt
2025-12-12 17:23:25 +08:00
Concedo
cd73613136
moved volta onto tile kernels, so building for cc7.0 can be avoided
...
this shouldn't do anything (+2 squashed commit)
Squashed commit:
[1cdcb302a] another attempt to tip the scales, part 2
[8f647b709] another attempt to tip the scales (volta)
2025-12-08 19:51:54 +08:00
Concedo
d27949f22a
Revert "try remove volta as a dedicated target b (+1 squashed commits)"
...
This reverts commit ddba580f00 .
2025-12-06 21:31:44 +08:00
Concedo
ddba580f00
try remove volta as a dedicated target b (+1 squashed commits)
...
Squashed commits:
[2df689a03] try remove volta as a dedicated target
2025-12-06 21:31:06 +08:00
Concedo
e570478275
limit cuda arches + scale tweaks
2025-11-28 13:05:11 +08:00
Wagner Bruna
3318b73c94
sd: sync to master-355-694f0d9
2025-11-23 19:28:34 -03:00
LostRuins Concedo
5751c30790
add vulkan for whisper
2025-11-13 15:37:58 +08:00
LostRuins Concedo
d6a2ad8455
still not really working right
2025-11-09 01:57:48 +08:00
LostRuins Concedo
cfb22b5c9d
rename a missed BLAS -> batch
2025-11-06 16:11:26 +08:00
Concedo
b5d3dcb6c0
add workflow for older pc
2025-10-29 17:35:04 +08:00
Wagner Bruna
d7da1eb35c
invert KCPP_BAKE_SD_VOCAB logic, move define to sdtype_adapter.cpp ( #1803 )
...
Using KCPP_BAKE_SD_VOCAB to turn off the change to not embed the
vocabulary files makes testing new upstream merges harder, because
we then need to set that macro on the sd.cpp original build.
So, revert the tests, making the define turn the change on. Also,
since model.cpp is always built by Koboldcpp as part of the
sdtype_adapter.cpp, it's enough to set the macro on that file.
2025-10-20 10:07:37 +08:00
Concedo
59aa1529dc
add embeddings vulkan to makefile
2025-10-13 11:05:45 +08:00
Concedo
e0ba01c65e
fix cuda builds
2025-10-12 20:09:16 +08:00
Concedo
f282362414
added qwen image support (+1 squashed commits)
...
Squashed commits:
[92df28061] added qwen image support (+1 squashed commits)
Squashed commits:
[1485c71ed] wip adding qwen image
2025-10-03 18:58:48 +08:00
Concedo
4f8f0e5949
move embeds into their own dir, detach sd vocab into separate files
2025-10-03 14:21:09 +08:00
Concedo
c00ae93421
makefile fix vulkan noext compile (+1 squashed commits)
...
Squashed commits:
[eae88fd49] makefile fix vulkan noext compile
2025-10-02 23:19:45 +08:00
Concedo
1a4f54dd11
update for cu13 builds (no ci will be provided)
2025-09-26 16:01:43 +08:00
Concedo
326f6f3fad
not sure if working on metal
2025-09-21 11:35:02 +08:00
tsite
04498a345a
update makefile to clone llguidance if the directory does not exist ( #1743 )
...
also remove llguidance when running 'make clean'
2025-09-21 08:40:55 +08:00
Concedo
fddd046f9d
metal common
2025-09-15 01:58:32 +08:00
Concedo
a5580a32fb
fix cuda and macos compile issues
2025-09-12 20:53:42 +08:00
tsite
27c443f01e
add support for llguidance ( #1728 )
...
* add llguidance
remove tab indentation for makefile if statements - these are dangerous
fix broken tool compilation commands
add USE_LLGUIDANCE env var to enable llguidance for faster structured
output generation
add llguidance as an optional submodule
* rm submodule
2025-09-11 16:46:03 +08:00
Concedo
f7fa283bb6
indentation fix for makefile
2025-09-11 16:41:51 +08:00
Concedo
52ff99805c
fixed windows 7 compat builds
2025-08-25 10:36:13 +08:00
Concedo
ed5e7a3062
fix for some old android devices
2025-08-24 01:34:54 +08:00
Concedo
8b8396c30c
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# README.md
# docs/build-s390x.md
# examples/llama.vim
# ggml/src/ggml-cann/aclnn_ops.cpp
# ggml/src/ggml-cann/common.h
# scripts/compare-llama-bench.py
# src/CMakeLists.txt
# tests/test-backend-ops.cpp
# tools/llama-bench/README.md
# tools/llama-bench/llama-bench.cpp
# tools/server/README.md
2025-08-23 11:35:28 +08:00
Concedo
b50f94ae27
this commit removes ggml_cuda_f16 targets. Merge commit ' 7a6e91ad26' into concedo_experimental
...
# Conflicts:
# docs/build.md
# docs/multimodal/MobileVLM.md
# ggml/CMakeLists.txt
# ggml/src/ggml-cuda/CMakeLists.txt
# ggml/src/ggml-musa/CMakeLists.txt
2025-08-21 19:25:29 +08:00
Daniel Bevenius
37f10f955f
make : remove make in favor of CMake ( #15449 )
...
This commit removes the content from the Makefile and updates the
current deprecation message to information that `make` has been
replaced by CMake instead.
The message when `make` is invoked will now be the following:
```console
$ make
Makefile:6: *** Build system changed:
The Makefile build has been replaced by CMake.
For build instructions see:
https://github.com/ggml-org/llama.cpp/blob/master/docs/build.md
. Stop.
```
The motivation for this is that many, if not all targets fail to build
now, after changes to the system, and `make` has also been deprected for
some time now.
2025-08-20 13:31:16 +03:00
Concedo
35707f4e97
split vulkan into two compilation units for faster build
2025-08-20 12:12:47 +08:00
Concedo
67ef5e6c02
phonemizer fixes, now kokoro works very well
2025-08-18 16:13:16 +08:00
Concedo
52606e9b1d
tts cpp model is now loadable in kcpp
2025-08-17 15:47:22 +08:00
Concedo
9935ac093f
standardize tts linting and formatting
2025-08-17 14:11:30 +08:00
Concedo
cfc1a0d4ef
tts cpp cli builds and runs fine.
2025-08-17 13:53:27 +08:00
Concedo
bc04366a65
builds but crashes
2025-08-17 00:09:03 +08:00
Concedo
67e0072245
fixed clblast repacking
2025-08-09 01:08:02 +08:00
Concedo
d37529c0cd
add sanitize flag
2025-08-04 22:19:23 +08:00
Concedo
4db8ba6228
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# ggml/src/ggml-sycl/gemm.hpp
# ggml/src/ggml-sycl/ggml-sycl.cpp
# ggml/src/ggml-sycl/set_rows.cpp
2025-07-14 23:16:44 +08:00
Concedo
dca49de059
fixed qwen2 audio issues, works fine now (+3 squashed commit)
...
Squashed commit:
[b3053a1ba] updated lite
[5071630d6] fixed mtmd issues, audio works
[06efa5af4] fix mtmd compile
2025-07-12 18:54:41 +08:00
Concedo
e9473305d0
wip2 (+1 squashed commits)
...
Squashed commits:
[4628777b6] wip
2025-07-12 18:54:40 +08:00
Concedo
f8a49aa8e6
fixed a typo
2025-07-08 11:41:09 +08:00
Concedo
18cd46a6db
allow people to manually override gfx12 fa
2025-07-05 11:33:30 +08:00
Concedo
abc1d8ac25
better way of checking for avx2 support
2025-06-22 22:56:50 +08:00
Concedo
45f589b78d
test gfx1200 again
2025-06-21 17:56:04 +08:00