Commit graph

1396 commits

Author SHA1 Message Date
Concedo
13e932b241 more fixes for gemma4 2026-04-05 10:34:40 +08:00
Eso
11bc83229a
fix: Autoswap with override configs (#2091)
* fix: Autoswap with overrides

* fix: Autoswap with overrides
2026-04-05 00:43:19 +08:00
Concedo
f7c9029668 change env var KOBOLDCPP_PASSWORD to KCPP_PASSWORD names for consistency, same for KOBOLDCPP_ADMINPASSWORD to KCPP_ADMINPASSWORD 2026-04-04 23:36:30 +08:00
Concedo
d3d50a7b3c fixed reasoning content response in fakestreaming tools 2026-04-04 23:03:33 +08:00
Concedo
ac92ac22d7 tool call fix 2026-04-04 22:35:03 +08:00
Concedo
eb3422996a BOS fix for gemma4 2026-04-04 22:15:01 +08:00
Concedo
a33eda3842 more template fixes for the gemma4 31b 2026-04-04 14:23:16 +08:00
Concedo
1c834fcbd3 try to match template more closely (+2 squashed commit)
Squashed commit:

[466808010] try to match template more closely

[9f805e753] try to match template more closely
2026-04-04 13:50:04 +08:00
Concedo
f39bfc39e9 gemma4 jina tool calls fixed, added gemma4 <eos> token handling 2026-04-03 17:39:21 +08:00
Concedo
11b384aa6a tweak tooltip 2026-04-03 14:30:02 +08:00
Shoaib
f05f680203
[admin]password defaults to env var (#2086) 2026-04-03 14:28:03 +08:00
Concedo
5dee1a1cbb Merge commit 'fbd441c379' into concedo_experimental
# Conflicts:
#	.github/workflows/build.yml
#	AGENTS.md
#	ci/run.sh
#	docs/build.md
#	embd_res/templates/LFM2.5-Instruct.jinja
#	ggml/CMakeLists.txt
#	ggml/src/ggml-cuda/fattn.cu
#	ggml/src/ggml-hexagon/ggml-hexagon.cpp
#	ggml/src/ggml-hexagon/htp/CMakeLists.txt
#	ggml/src/ggml-hexagon/htp/htp-msg.h
#	ggml/src/ggml-hexagon/htp/htp-ops.h
#	ggml/src/ggml-hexagon/htp/hvx-div.h
#	ggml/src/ggml-hexagon/htp/main.c
#	ggml/src/ggml-hexagon/htp/unary-ops.c
#	ggml/src/ggml-opencl/ggml-opencl.cpp
#	ggml/src/ggml-sycl/common.hpp
#	ggml/src/ggml-sycl/convert.cpp
#	ggml/src/ggml-sycl/dequantize.hpp
#	ggml/src/ggml-sycl/mmvq.cpp
#	ggml/src/ggml-sycl/vecdotq.hpp
#	ggml/src/ggml-webgpu/ggml-webgpu-shader-lib.hpp
#	ggml/src/ggml-webgpu/ggml-webgpu.cpp
#	ggml/src/ggml-webgpu/wgsl-shaders/common_decls.tmpl
#	ggml/src/ggml-webgpu/wgsl-shaders/flash_attn.wgsl
#	ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_decls.tmpl
#	ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_vec.wgsl
#	scripts/hip/gcn-cdna-vgpr-check.py
#	scripts/sync-ggml.last
#	tests/test-chat.cpp
2026-04-03 01:06:02 +08:00
Concedo
be6ff2602d added config overwriting 2026-04-03 00:21:14 +08:00
Concedo
5f0038cdc9 this section wasnt needed for qwen3.5 jinja and actually breaks. 2026-04-02 21:39:24 +08:00
Concedo
feac72cb05 improve jinja tool calling 2026-04-01 17:01:37 +08:00
Concedo
08cf75ce4b jinja tool calls for qwen3.5 and glm 2026-04-01 16:26:20 +08:00
Concedo
133b1f2654 fix returned names 2026-04-01 10:47:40 +08:00
Concedo
f91de9dcb0 jinja tool calls - parse body if parsable 2026-03-31 23:14:52 +08:00
Concedo
088c01e2a7 add jaxxks mutex lock for proxy during request 2026-03-31 17:03:38 +08:00
Concedo
2acf209972 minor gui cleanup 2026-03-31 16:59:10 +08:00
Eso
79209a14a8
feat: Autoswap functionality (#2080)
* feat: Autoswap mode (cherry-picked from remoteManagement)

Co-authored-by: esolithe <65901558+esolithe@users.noreply.github.com>

* fix: Remove modelOverride, add triggered_sleeping to autoswap unload timeout branch

Agent-Logs-Url: https://github.com/esolithe/esobold/sessions/1ddb3f88-43b4-4234-aa41-0fe6c9976db4

Co-authored-by: esolithe <65901558+esolithe@users.noreply.github.com>

* fix: Remove esobold-specific GUI elements from admin tab, renumber remaining rows

Agent-Logs-Url: https://github.com/esolithe/esobold/sessions/6a2e4ec3-cb19-4f98-b00f-bdb13749ead3

Co-authored-by: esolithe <65901558+esolithe@users.noreply.github.com>

* fix: Removed unneeded changes

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
2026-03-31 16:49:59 +08:00
Concedo
9fe8027ed3 try fix actions 2026-03-31 15:46:46 +08:00
Concedo
894591da7c increase ctx size slider 2026-03-30 21:41:31 +08:00
Concedo
4fc3c28f1a reasoning output parsing improvements 2026-03-29 23:35:24 +08:00
Concedo
674b7f5eee indicate support for claude messages api 2026-03-29 00:57:58 +08:00
Concedo
e3b7905e1c added anthropic messages api support 2026-03-29 00:55:32 +08:00
Concedo
5ad9e3ee31 crude openai responses streaming 2026-03-29 00:16:30 +08:00
Concedo
1e787cd03a improve responses api 2026-03-28 18:42:15 +08:00
Wagner Bruna
e3c6227d46
sd: report back image generation parameters and metadata (#2062)
* sd: refactor image generation result handling

* sd: report back image generation metadata
2026-03-28 00:49:03 +08:00
Concedo
0c2b679ea3 support bf16 quantkv cache type 2026-03-28 00:01:17 +08:00
Concedo
326542f480 rudimentary responses api, not usable yet 2026-03-27 23:38:08 +08:00
scottf007
f0818e1eae
Add socket timeout to is_port_in_use() to fix ~280s startup delay on WSL2 (#2077)
On WSL2 with networkingMode=mirrored, connect_ex() to non-listening ports
gets black-holed through the Windows host networking stack instead of
returning ECONNREFUSED. Without a timeout, TCP SYN retransmits with
exponential backoff (1+2+4+8+16+32+64 ≈ 127s per port), causing Router
Mode's port scan of 15001-15010 to stall for ~280 seconds on startup.

Adding a 1-second timeout makes connect_ex() fail fast, reducing startup
from ~303s to ~23s on affected systems.

Tested on WSL2 Ubuntu 24.04 with mirrored networking, KoboldCpp v1.110,
RTX 3090 Ti, Qwen3.5-27B Q4_K_M.
2026-03-27 22:50:59 +08:00
Concedo
a03998bed6 added jinja kwargs support 2026-03-27 00:28:59 +08:00
Concedo
c91f350ed5 increase max images, take images from the end instead of beginning if too many images 2026-03-26 23:03:52 +08:00
Concedo
39938e19d3 allow router mode to auto-wake other endpoints if put to sleep by auto unload 2026-03-25 23:17:20 +08:00
Concedo
24ab1c1451 upgrade musicui to do tts, show musicui for tts models (+1 squashed commits)
Squashed commits:

[975630b15] upgrade musicui to do tts
2026-03-25 00:24:44 +08:00
Concedo
8437c346a7 fixed tts instruction regex, encapsulate thinking by default 2026-03-24 13:53:46 +08:00
Concedo
9e9028b1a9 fixed cpu mis-selection 2026-03-23 21:30:57 +08:00
Concedo
0d50cafd8b added CustomVoice support 2026-03-23 18:50:08 +08:00
Concedo
0aa6f21c88 jinja prefill fixed 2026-03-22 14:55:44 +08:00
Concedo
79e39e1989 fixed a help menu bug, updated colab (+1 squashed commits)
Squashed commits:

[618478e00] fixed a help menu bug, updated colab
2026-03-22 01:00:30 +08:00
Concedo
89e2397014 updatede lite, up ver (+1 squashed commits)
Squashed commits:

[f1f899070] up version
2026-03-21 17:42:58 +08:00
Concedo
fdfb713d91 added --sdmaingpu allowing image models to be independently placed on any gpu 2026-03-21 17:34:12 +08:00
Concedo
a3d3800f3e added passthrough mode for esrgan upscale, triggered by img2img denoise 0.0 with 1 step 2026-03-21 16:19:10 +08:00
Concedo
58a585d0e7 popular templates section in help menu 2026-03-21 15:37:07 +08:00
Concedo
c4b1a17e1a tools debug 2026-03-19 23:13:02 +08:00
Concedo
2f63f94fd8 fix router nocertify mode 2026-03-19 12:45:19 +08:00
Concedo
8cf9ba34e9 fixed SSL in routermode 2026-03-19 12:43:11 +08:00
Concedo
15e86010d8 autofit will clear moecpu and overridetensors 2026-03-18 21:20:57 +08:00
Concedo
d85272a958 fixed wrong encoding (+1 squashed commits)
Squashed commits:

[a87d059a8] fixed wrong encoding
2026-03-17 15:54:54 +08:00