Rose
2f67e9f096
new baseconfig setting that aworks in router mode ( #2130 )
...
* new baseconfig setting that aworks in router mode
* re-added fix that prevents unneccessary model reload
* fixed the fix
* swapped order of baseconfig <-> override
* fix indent
* simplify baseconfig, if specified AND restart_override_config_target is NOT, it simply replaces the field (+1 squashed commits)
Squashed commits:
[95e816b16] simplify baseconfig, if specified AND restart_override_config_target is NOT, it simply replaces the field
---------
Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>
2026-04-15 22:50:47 +08:00
Concedo
c6b59fc2c7
autoswap some edge conditions
2026-04-14 23:02:29 +08:00
Concedo
3f810dc8c7
fixed preload story import for large stories
2026-04-13 23:27:55 +08:00
Concedo
c984147c84
fix quotes
2026-04-13 22:50:08 +08:00
Concedo
5a3369fd2a
support for gpt oss jinja
2026-04-12 16:13:51 +08:00
Concedo
4084917cab
fixed token counting limit (+1 squashed commits)
...
Squashed commits:
[314528eb2] fixed token counting limit, set to max supported ctx of 256k
2026-04-12 15:36:03 +08:00
Concedo
f07dcbf7af
allow tokencount to handle messages
2026-04-12 11:46:37 +08:00
Concedo
6556161804
jinja tool streaming is now finally working
2026-04-12 02:05:39 +08:00
Concedo
c4abba8868
almost working
2026-04-12 01:44:41 +08:00
Concedo
3175da0873
cleanup - do not use tool calls from kai api, only
2026-04-11 12:19:48 +08:00
Wagner Bruna
f4fbd94129
sdapi: add job_timestamp field to info result ( #2110 )
2026-04-11 09:28:53 +08:00
Concedo
0f278d93b3
better image handling in jinja
2026-04-10 22:19:01 +08:00
Concedo
b962335c99
fix rosie toolcall issue
2026-04-10 21:23:33 +08:00
Concedo
bcf499e5bf
fix gemma tool calling
2026-04-10 20:51:40 +08:00
Concedo
ffdc3ba49e
tool coercion fixes
2026-04-10 18:25:07 +08:00
Concedo
618db91e3d
should pass tc08 now
2026-04-09 23:15:34 +08:00
Concedo
cfcbfd571a
fix think leaking in sync mode
2026-04-09 21:29:56 +08:00
Concedo
c82c0b463a
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .github/labeler.yml
# .github/workflows/release.yml
# examples/debug/debug.cpp
# ggml/src/ggml-cuda/common.cuh
# ggml/src/ggml-cuda/mmq.cuh
# ggml/src/ggml-webgpu/ggml-webgpu.cpp
# src/llama-vocab.cpp
# tests/test-backend-ops.cpp
# tests/test-chat.cpp
# tests/test-json-schema-to-grammar.cpp
# tools/mtmd/CMakeLists.txt
2026-04-09 17:45:04 +08:00
Concedo
f6199d42e1
tool response type coercion
2026-04-09 12:59:57 +08:00
Concedo
77d0ddb486
even better tool calls
2026-04-08 23:40:42 +08:00
Concedo
d9ed4b444b
multiuser default 10
2026-04-07 23:42:29 +08:00
Concedo
5e16453f0c
fixed a bug in chat completions think handling
2026-04-07 00:16:34 +08:00
Concedo
a395af65db
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .github/workflows/build-riscv.yml
# .github/workflows/build.yml
# ggml/src/ggml-hexagon/htp/argsort-ops.c
# ggml/src/ggml-sycl/fattn-tile.hpp
# tools/mtmd/CMakeLists.txt
2026-04-06 20:56:02 +08:00
Concedo
a309086735
Revert "increase debug mode truncation limit"
...
This reverts commit 59f863746d .
2026-04-06 18:51:12 +08:00
Concedo
59f863746d
increase debug mode truncation limit
2026-04-06 17:57:44 +08:00
Concedo
63ca37e62a
fix assistant prefill logic (+1 squashed commits)
...
Squashed commits:
[f4963baf5] fix prefills
2026-04-05 23:25:44 +08:00
Concedo
53b3bf46e4
fixed a typo
2026-04-05 18:46:30 +08:00
Concedo
49941b6268
handle think streaming for gemma4
2026-04-05 13:48:07 +08:00
Concedo
13e932b241
more fixes for gemma4
2026-04-05 10:34:40 +08:00
Eso
11bc83229a
fix: Autoswap with override configs ( #2091 )
...
* fix: Autoswap with overrides
* fix: Autoswap with overrides
2026-04-05 00:43:19 +08:00
Concedo
f7c9029668
change env var KOBOLDCPP_PASSWORD to KCPP_PASSWORD names for consistency, same for KOBOLDCPP_ADMINPASSWORD to KCPP_ADMINPASSWORD
2026-04-04 23:36:30 +08:00
Concedo
d3d50a7b3c
fixed reasoning content response in fakestreaming tools
2026-04-04 23:03:33 +08:00
Concedo
ac92ac22d7
tool call fix
2026-04-04 22:35:03 +08:00
Concedo
eb3422996a
BOS fix for gemma4
2026-04-04 22:15:01 +08:00
Concedo
a33eda3842
more template fixes for the gemma4 31b
2026-04-04 14:23:16 +08:00
Concedo
1c834fcbd3
try to match template more closely (+2 squashed commit)
...
Squashed commit:
[466808010] try to match template more closely
[9f805e753] try to match template more closely
2026-04-04 13:50:04 +08:00
Concedo
f39bfc39e9
gemma4 jina tool calls fixed, added gemma4 <eos> token handling
2026-04-03 17:39:21 +08:00
Concedo
11b384aa6a
tweak tooltip
2026-04-03 14:30:02 +08:00
Shoaib
f05f680203
[admin]password defaults to env var ( #2086 )
2026-04-03 14:28:03 +08:00
Concedo
5dee1a1cbb
Merge commit ' fbd441c379' into concedo_experimental
...
# Conflicts:
# .github/workflows/build.yml
# AGENTS.md
# ci/run.sh
# docs/build.md
# embd_res/templates/LFM2.5-Instruct.jinja
# ggml/CMakeLists.txt
# ggml/src/ggml-cuda/fattn.cu
# ggml/src/ggml-hexagon/ggml-hexagon.cpp
# ggml/src/ggml-hexagon/htp/CMakeLists.txt
# ggml/src/ggml-hexagon/htp/htp-msg.h
# ggml/src/ggml-hexagon/htp/htp-ops.h
# ggml/src/ggml-hexagon/htp/hvx-div.h
# ggml/src/ggml-hexagon/htp/main.c
# ggml/src/ggml-hexagon/htp/unary-ops.c
# ggml/src/ggml-opencl/ggml-opencl.cpp
# ggml/src/ggml-sycl/common.hpp
# ggml/src/ggml-sycl/convert.cpp
# ggml/src/ggml-sycl/dequantize.hpp
# ggml/src/ggml-sycl/mmvq.cpp
# ggml/src/ggml-sycl/vecdotq.hpp
# ggml/src/ggml-webgpu/ggml-webgpu-shader-lib.hpp
# ggml/src/ggml-webgpu/ggml-webgpu.cpp
# ggml/src/ggml-webgpu/wgsl-shaders/common_decls.tmpl
# ggml/src/ggml-webgpu/wgsl-shaders/flash_attn.wgsl
# ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_decls.tmpl
# ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_vec.wgsl
# scripts/hip/gcn-cdna-vgpr-check.py
# scripts/sync-ggml.last
# tests/test-chat.cpp
2026-04-03 01:06:02 +08:00
Concedo
be6ff2602d
added config overwriting
2026-04-03 00:21:14 +08:00
Concedo
5f0038cdc9
this section wasnt needed for qwen3.5 jinja and actually breaks.
2026-04-02 21:39:24 +08:00
Concedo
feac72cb05
improve jinja tool calling
2026-04-01 17:01:37 +08:00
Concedo
08cf75ce4b
jinja tool calls for qwen3.5 and glm
2026-04-01 16:26:20 +08:00
Concedo
133b1f2654
fix returned names
2026-04-01 10:47:40 +08:00
Concedo
f91de9dcb0
jinja tool calls - parse body if parsable
2026-03-31 23:14:52 +08:00
Concedo
088c01e2a7
add jaxxks mutex lock for proxy during request
2026-03-31 17:03:38 +08:00
Concedo
2acf209972
minor gui cleanup
2026-03-31 16:59:10 +08:00
Eso
79209a14a8
feat: Autoswap functionality ( #2080 )
...
* feat: Autoswap mode (cherry-picked from remoteManagement)
Co-authored-by: esolithe <65901558+esolithe@users.noreply.github.com>
* fix: Remove modelOverride, add triggered_sleeping to autoswap unload timeout branch
Agent-Logs-Url: https://github.com/esolithe/esobold/sessions/1ddb3f88-43b4-4234-aa41-0fe6c9976db4
Co-authored-by: esolithe <65901558+esolithe@users.noreply.github.com>
* fix: Remove esobold-specific GUI elements from admin tab, renumber remaining rows
Agent-Logs-Url: https://github.com/esolithe/esobold/sessions/6a2e4ec3-cb19-4f98-b00f-bdb13749ead3
Co-authored-by: esolithe <65901558+esolithe@users.noreply.github.com>
* fix: Removed unneeded changes
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
2026-03-31 16:49:59 +08:00
Concedo
9fe8027ed3
try fix actions
2026-03-31 15:46:46 +08:00