Concedo
|
f07dcbf7af
|
allow tokencount to handle messages
|
2026-04-12 11:46:37 +08:00 |
|
Concedo
|
6556161804
|
jinja tool streaming is now finally working
|
2026-04-12 02:05:39 +08:00 |
|
Concedo
|
c4abba8868
|
almost working
|
2026-04-12 01:44:41 +08:00 |
|
Concedo
|
3175da0873
|
cleanup - do not use tool calls from kai api, only
|
2026-04-11 12:19:48 +08:00 |
|
Wagner Bruna
|
f4fbd94129
|
sdapi: add job_timestamp field to info result (#2110)
|
2026-04-11 09:28:53 +08:00 |
|
Concedo
|
0f278d93b3
|
better image handling in jinja
|
2026-04-10 22:19:01 +08:00 |
|
Concedo
|
b962335c99
|
fix rosie toolcall issue
|
2026-04-10 21:23:33 +08:00 |
|
Concedo
|
bcf499e5bf
|
fix gemma tool calling
|
2026-04-10 20:51:40 +08:00 |
|
Concedo
|
ffdc3ba49e
|
tool coercion fixes
|
2026-04-10 18:25:07 +08:00 |
|
Concedo
|
618db91e3d
|
should pass tc08 now
|
2026-04-09 23:15:34 +08:00 |
|
Concedo
|
cfcbfd571a
|
fix think leaking in sync mode
|
2026-04-09 21:29:56 +08:00 |
|
Concedo
|
c82c0b463a
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/labeler.yml
# .github/workflows/release.yml
# examples/debug/debug.cpp
# ggml/src/ggml-cuda/common.cuh
# ggml/src/ggml-cuda/mmq.cuh
# ggml/src/ggml-webgpu/ggml-webgpu.cpp
# src/llama-vocab.cpp
# tests/test-backend-ops.cpp
# tests/test-chat.cpp
# tests/test-json-schema-to-grammar.cpp
# tools/mtmd/CMakeLists.txt
|
2026-04-09 17:45:04 +08:00 |
|
Concedo
|
f6199d42e1
|
tool response type coercion
|
2026-04-09 12:59:57 +08:00 |
|
Concedo
|
77d0ddb486
|
even better tool calls
|
2026-04-08 23:40:42 +08:00 |
|
Concedo
|
d9ed4b444b
|
multiuser default 10
|
2026-04-07 23:42:29 +08:00 |
|
Concedo
|
5e16453f0c
|
fixed a bug in chat completions think handling
|
2026-04-07 00:16:34 +08:00 |
|
Concedo
|
a395af65db
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/workflows/build-riscv.yml
# .github/workflows/build.yml
# ggml/src/ggml-hexagon/htp/argsort-ops.c
# ggml/src/ggml-sycl/fattn-tile.hpp
# tools/mtmd/CMakeLists.txt
|
2026-04-06 20:56:02 +08:00 |
|
Concedo
|
a309086735
|
Revert "increase debug mode truncation limit"
This reverts commit 59f863746d.
|
2026-04-06 18:51:12 +08:00 |
|
Concedo
|
59f863746d
|
increase debug mode truncation limit
|
2026-04-06 17:57:44 +08:00 |
|
Concedo
|
63ca37e62a
|
fix assistant prefill logic (+1 squashed commits)
Squashed commits:
[f4963baf5] fix prefills
|
2026-04-05 23:25:44 +08:00 |
|
Concedo
|
53b3bf46e4
|
fixed a typo
|
2026-04-05 18:46:30 +08:00 |
|
Concedo
|
49941b6268
|
handle think streaming for gemma4
|
2026-04-05 13:48:07 +08:00 |
|
Concedo
|
13e932b241
|
more fixes for gemma4
|
2026-04-05 10:34:40 +08:00 |
|
Eso
|
11bc83229a
|
fix: Autoswap with override configs (#2091)
* fix: Autoswap with overrides
* fix: Autoswap with overrides
|
2026-04-05 00:43:19 +08:00 |
|
Concedo
|
f7c9029668
|
change env var KOBOLDCPP_PASSWORD to KCPP_PASSWORD names for consistency, same for KOBOLDCPP_ADMINPASSWORD to KCPP_ADMINPASSWORD
|
2026-04-04 23:36:30 +08:00 |
|
Concedo
|
d3d50a7b3c
|
fixed reasoning content response in fakestreaming tools
|
2026-04-04 23:03:33 +08:00 |
|
Concedo
|
ac92ac22d7
|
tool call fix
|
2026-04-04 22:35:03 +08:00 |
|
Concedo
|
eb3422996a
|
BOS fix for gemma4
|
2026-04-04 22:15:01 +08:00 |
|
Concedo
|
a33eda3842
|
more template fixes for the gemma4 31b
|
2026-04-04 14:23:16 +08:00 |
|
Concedo
|
1c834fcbd3
|
try to match template more closely (+2 squashed commit)
Squashed commit:
[466808010] try to match template more closely
[9f805e753] try to match template more closely
|
2026-04-04 13:50:04 +08:00 |
|
Concedo
|
f39bfc39e9
|
gemma4 jina tool calls fixed, added gemma4 <eos> token handling
|
2026-04-03 17:39:21 +08:00 |
|
Concedo
|
11b384aa6a
|
tweak tooltip
|
2026-04-03 14:30:02 +08:00 |
|
Shoaib
|
f05f680203
|
[admin]password defaults to env var (#2086)
|
2026-04-03 14:28:03 +08:00 |
|
Concedo
|
5dee1a1cbb
|
Merge commit 'fbd441c379' into concedo_experimental
# Conflicts:
# .github/workflows/build.yml
# AGENTS.md
# ci/run.sh
# docs/build.md
# embd_res/templates/LFM2.5-Instruct.jinja
# ggml/CMakeLists.txt
# ggml/src/ggml-cuda/fattn.cu
# ggml/src/ggml-hexagon/ggml-hexagon.cpp
# ggml/src/ggml-hexagon/htp/CMakeLists.txt
# ggml/src/ggml-hexagon/htp/htp-msg.h
# ggml/src/ggml-hexagon/htp/htp-ops.h
# ggml/src/ggml-hexagon/htp/hvx-div.h
# ggml/src/ggml-hexagon/htp/main.c
# ggml/src/ggml-hexagon/htp/unary-ops.c
# ggml/src/ggml-opencl/ggml-opencl.cpp
# ggml/src/ggml-sycl/common.hpp
# ggml/src/ggml-sycl/convert.cpp
# ggml/src/ggml-sycl/dequantize.hpp
# ggml/src/ggml-sycl/mmvq.cpp
# ggml/src/ggml-sycl/vecdotq.hpp
# ggml/src/ggml-webgpu/ggml-webgpu-shader-lib.hpp
# ggml/src/ggml-webgpu/ggml-webgpu.cpp
# ggml/src/ggml-webgpu/wgsl-shaders/common_decls.tmpl
# ggml/src/ggml-webgpu/wgsl-shaders/flash_attn.wgsl
# ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_decls.tmpl
# ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_vec.wgsl
# scripts/hip/gcn-cdna-vgpr-check.py
# scripts/sync-ggml.last
# tests/test-chat.cpp
|
2026-04-03 01:06:02 +08:00 |
|
Concedo
|
be6ff2602d
|
added config overwriting
|
2026-04-03 00:21:14 +08:00 |
|
Concedo
|
5f0038cdc9
|
this section wasnt needed for qwen3.5 jinja and actually breaks.
|
2026-04-02 21:39:24 +08:00 |
|
Concedo
|
feac72cb05
|
improve jinja tool calling
|
2026-04-01 17:01:37 +08:00 |
|
Concedo
|
08cf75ce4b
|
jinja tool calls for qwen3.5 and glm
|
2026-04-01 16:26:20 +08:00 |
|
Concedo
|
133b1f2654
|
fix returned names
|
2026-04-01 10:47:40 +08:00 |
|
Concedo
|
f91de9dcb0
|
jinja tool calls - parse body if parsable
|
2026-03-31 23:14:52 +08:00 |
|
Concedo
|
088c01e2a7
|
add jaxxks mutex lock for proxy during request
|
2026-03-31 17:03:38 +08:00 |
|
Concedo
|
2acf209972
|
minor gui cleanup
|
2026-03-31 16:59:10 +08:00 |
|
Eso
|
79209a14a8
|
feat: Autoswap functionality (#2080)
* feat: Autoswap mode (cherry-picked from remoteManagement)
Co-authored-by: esolithe <65901558+esolithe@users.noreply.github.com>
* fix: Remove modelOverride, add triggered_sleeping to autoswap unload timeout branch
Agent-Logs-Url: https://github.com/esolithe/esobold/sessions/1ddb3f88-43b4-4234-aa41-0fe6c9976db4
Co-authored-by: esolithe <65901558+esolithe@users.noreply.github.com>
* fix: Remove esobold-specific GUI elements from admin tab, renumber remaining rows
Agent-Logs-Url: https://github.com/esolithe/esobold/sessions/6a2e4ec3-cb19-4f98-b00f-bdb13749ead3
Co-authored-by: esolithe <65901558+esolithe@users.noreply.github.com>
* fix: Removed unneeded changes
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
|
2026-03-31 16:49:59 +08:00 |
|
Concedo
|
9fe8027ed3
|
try fix actions
|
2026-03-31 15:46:46 +08:00 |
|
Concedo
|
894591da7c
|
increase ctx size slider
|
2026-03-30 21:41:31 +08:00 |
|
Concedo
|
4fc3c28f1a
|
reasoning output parsing improvements
|
2026-03-29 23:35:24 +08:00 |
|
Concedo
|
674b7f5eee
|
indicate support for claude messages api
|
2026-03-29 00:57:58 +08:00 |
|
Concedo
|
e3b7905e1c
|
added anthropic messages api support
|
2026-03-29 00:55:32 +08:00 |
|
Concedo
|
5ad9e3ee31
|
crude openai responses streaming
|
2026-03-29 00:16:30 +08:00 |
|
Concedo
|
1e787cd03a
|
improve responses api
|
2026-03-28 18:42:15 +08:00 |
|