Commit graph

1165 commits

Author SHA1 Message Date
Concedo
7527f1eff0 handle media for jinja path (+1 squashed commits)
Squashed commits:

[29d47d6b7] handle media for jinja path
2025-11-27 11:40:08 +08:00
Concedo
d68f4a5ae5 disable clip fa for now 2025-11-27 10:20:38 +08:00
Concedo
2b00292bfe display path on 404 2025-11-27 10:07:08 +08:00
Concedo
c12f9e3b7c bump version 2025-11-27 01:04:09 +08:00
Wagner Bruna
998dfcd1be
sd: add an API endpoint to list the available schedulers (#1856) 2025-11-26 22:49:36 +08:00
Concedo
d9b9c54393 added another alias to jinjatools 2025-11-26 18:52:08 +08:00
Concedo
9b6320cd71 adjust launcher scaling behavior 2025-11-25 21:32:03 +08:00
Concedo
9a7f749f7c minor tweak for sd 2025-11-24 22:31:03 +08:00
Wagner Bruna
3a7dd1a97f sd: sync to master-358-347710f
Also adapt Koboldcpp LoRA loading function, and add
backend support for lora_apply_mode.
2025-11-23 19:28:54 -03:00
Concedo
1cc4403cba updated llama.cpp web ui (+2 squashed commit)
Squashed commit:

[9b22ac6e4] more fixes for lcpp web ui,. will be squashed

[522b59b4c] henky tries using svelte or something
2025-11-24 00:43:27 +08:00
Rose
eeb7363985
improvements to tool calling logic (merged changes from old PR branch) (#1855)
* improvements to tool calling logic (merged changes from old PR branch)

* added some tweaks for improved tool calls to reuse old ctx, but needs testing. refer to PR.

* fixes to some stuff that concedo's modifications broke

* fixed error in reasoning

* extremely hacky way to cache tool list please fix

* oops forgot to add this

* slightly less hacky way to preserve the tool list in context

* prevented unintended toolcalls from happening when LLM states something irrelevant to toolcall decision

* fixed something that broke koboldlite

* fixed bug added by concedo that broke jinja tools

* experimental further compression of tools array, needs testing

* reverted experimental further compression of tools array

* final cleanup

* add newline after memory insert

* changed tool reasoning to always be in json format to enforce including final decision

* used new json format to skip extra llm call when not necessary

* more catching of possible bad llm output

* further cleanup

* got it down to just one llm call!

* better json format

* even better json format

* further refinement to json format

* further refinement to json format

* fixed broken tool calling

* single-call enforced json method now seems to work well. removed fallbacks as they are no longer required.

---------

Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>
2025-11-23 22:41:31 +08:00
Concedo
b281d2554a add a resume after horde worker pauses 2025-11-18 22:48:18 +08:00
Concedo
8631bbcee3 linting 2025-11-18 18:56:31 +08:00
LostRuins Concedo
281542aa0d add smoothing curve, not tested 2025-11-17 23:07:35 +08:00
LostRuins Concedo
ea22e04320 filename insenstive search for adapters 2025-11-15 09:48:22 +08:00
LostRuins Concedo
357bef3082 add toggle for jinja tools 2025-11-12 17:29:42 +08:00
LostRuins Concedo
95291a93df
rosie fixes: add format normalization for tools and tool call streaming fixes (#1842) 2025-11-11 23:06:27 +08:00
LostRuins Concedo
cdc18f0945 linting (+1 squashed commits)
Squashed commits:

[994427d3c] linting
2025-11-10 20:54:44 +08:00
Wagner Bruna
2ae6bff5bd
split memory detection functions and add debug command (#1832) 2025-11-10 18:07:15 +08:00
LostRuins Concedo
60a74bdd89 make tool calling work with jinja. but still need to fix qwen omni first (+1 squashed commits)
Squashed commits:

[e394da61e] make tool calling work with jinja. but still need to fix qwen omni first
2025-11-09 16:56:14 +08:00
LostRuins Concedo
055fdcef63 update model path
jinja tojson
2025-11-08 21:51:50 +08:00
LostRuins Concedo
af94884971 update props 2025-11-08 10:15:13 +08:00
LostRuins Concedo
92b5afc019 flag to show if jinja is enabled 2025-11-08 00:49:50 +08:00
LostRuins Concedo
462a34ed5b jinja is now working 2025-11-07 23:46:22 +08:00
LostRuins Concedo
cfb22b5c9d rename a missed BLAS -> batch 2025-11-06 16:11:26 +08:00
LostRuins Concedo
978d755ddc escape clause for tool calling 2025-11-05 22:02:24 +08:00
LostRuins Concedo
3e4a33499f updated lite 2025-11-05 20:52:47 +08:00
LostRuins Concedo
6ddacb62a0 serve gzipped versions of files. added a modded lcpp gui with modified path handling and proper stream termination, see https://github.com/ggml-org/llama.cpp/pull/14839#issuecomment-3490987929 2025-11-05 20:40:30 +08:00
Concedo
333e2bb30b fix for qwen image crashing due to ref images being too big, trial and error shows it happens after 512x512 2025-11-02 01:31:01 +08:00
xzuyn
988baa544e
add JobRate and JobCost to worker log (#1820)
- adds average jobs per hour
- adds average kudos earned per job
- change EarnRate to show 2 decimal places
2025-11-01 10:01:13 +08:00
Concedo
d229774e11 added compatibility endpoint for VITS api 2025-10-26 17:35:10 +08:00
Concedo
b730c99ecb fixed a typo 2025-10-26 10:06:59 +08:00
Concedo
57e1d9c822 rename blasbatchsize to batchsize 2025-10-24 18:16:54 +08:00
Concedo
68c9d955d2 support multiple override kv 2025-10-24 17:28:54 +08:00
Concedo
7446e03851 send logprobs in streaming for oai 2025-10-21 18:23:56 +08:00
Concedo
7d20e6bdb3 updated layer count to be more accurate +1 instead of +3 2025-10-18 15:29:07 +08:00
Concedo
f6916ba864 updated sdui 2025-10-17 13:56:45 +08:00
Concedo
45a02ae534 rename blas to just batching 2025-10-16 16:27:51 +08:00
Concedo
4eaf05dfeb handle oai without v1 prefix 2025-10-16 02:16:49 +08:00
Concedo
dfeccea3a1 added shitty fractional scaling support for GNOME. but really just use KDE 2025-10-15 22:28:04 +08:00
Concedo
8b787866c6 fixed a typo 2025-10-13 11:14:38 +08:00
Concedo
1a360b8458 sdcpp: optimize the handling of the FeedForward precision fix (+1 squashed commits)
Squashed commits:

[621ff6392] sdcpp: optimize the handling of the FeedForward precision fix (+1 squashed commits)

Squashed commits:

[05b16906c] sdcpp: optimize the handling of the FeedForward precision fix
2025-10-12 17:49:38 +08:00
Concedo
a0ed446e61 handle numbers outside int32 range with wrapping 2025-10-12 12:46:45 +08:00
Wagner Bruna
9f9494cf3f
sd: add 'default' to the list of supported samplers (#1788) 2025-10-12 12:35:56 +08:00
Concedo
5396e62b56 allow resetting missing fields to default 2025-10-09 23:36:38 +08:00
Concedo
96dfa7a038 sdgendefaults follow all other params 2025-10-09 14:57:34 +08:00
Concedo
c1a246c1de fixed typo 2025-10-07 21:51:15 +08:00
Concedo
2fa28fdcf8 wrap sd_parse_meta_field in trycatch 2025-10-06 00:05:19 +08:00
Wagner Bruna
c48999f7c0
additional options for image generation (#1765)
* sd: add backend support for choosing the default sampler

* use the default sampler on the API

* sd: add backend support for the scheduler

* sd: add backend support for distilled guidance

* sd: add backend support for timestep-shift

* sd: add a config field to set default image gen options
2025-10-05 23:36:20 +08:00
Concedo
a09d8333b5 allow lowvram (nkvo) to be used with vulkan. 2025-10-05 16:18:58 +08:00