Commit graph

1059 commits

Author SHA1 Message Date
Concedo
f125e724eb fix off-by-one npast during some instances of fast forwarding 2025-05-22 19:51:21 +08:00
Concedo
440350327c set random range for seed 2025-05-21 23:47:18 +08:00
Wagner Bruna
5d0cfc9db3
store on the image the actual random seed, for reproducibility (#1549) 2025-05-21 23:40:47 +08:00
Concedo
8b6dfbd1be disabling the gMask prefix for glm-4 completions 2025-05-21 17:29:24 +08:00
Concedo
49305942ab try disabling the gMask prefix for glm-4 completions 2025-05-21 16:47:08 +08:00
Concedo
5f4923bf24 backend tag replacement for endtags. view results with debug mode. 2025-05-19 23:14:43 +08:00
Concedo
710c747b60 minor noscript edit 2025-05-19 17:51:44 +08:00
Concedo
c546cb638e disable showgui if skiplauncher is used 2025-05-18 01:42:14 +08:00
Concedo
ca4274e384 added size info into HF searcher 2025-05-17 00:31:54 +08:00
Concedo
5ccd4b2bf5 horde default max ctx matches main ctx 2025-05-15 10:26:20 +08:00
Concedo
c5ea7fad93 updated lite, only show processed input in debugmode 2025-05-14 17:46:54 +08:00
Concedo
21e31e255b Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/build.yml
#	.github/workflows/docker.yml
#	README.md
#	build-xcframework.sh
#	common/CMakeLists.txt
#	examples/CMakeLists.txt
#	ggml/src/ggml-cpu/CMakeLists.txt
#	ggml/src/ggml-cuda/CMakeLists.txt
#	ggml/src/ggml-metal/ggml-metal.m
#	ggml/src/ggml-metal/ggml-metal.metal
#	ggml/src/ggml-sycl/CMakeLists.txt
#	ggml/src/ggml-sycl/backend.hpp
#	ggml/src/ggml-sycl/common.hpp
#	ggml/src/ggml-sycl/ggml-sycl.cpp
#	ggml/src/ggml-sycl/mmvq.cpp
#	ggml/src/ggml-sycl/vecdotq.hpp
#	scripts/compare-llama-bench.py
#	src/CMakeLists.txt
#	src/llama-model.cpp
#	src/llama.cpp
#	tests/test-backend-ops.cpp
#	tests/test-opt.cpp
#	tools/llama-bench/README.md
#	tools/llama-bench/llama-bench.cpp
#	tools/mtmd/CMakeLists.txt
#	tools/mtmd/README.md
#	tools/mtmd/clip.cpp
#	tools/rpc/rpc-server.cpp
#	tools/server/CMakeLists.txt
#	tools/server/README.md
2025-05-13 00:28:35 +08:00
Concedo
40eb3a54c4 rename some toolip texts 2025-05-11 22:50:40 +08:00
Concedo
1eb6d25010 truncate middle instead of end for long strings 2025-05-11 20:26:17 +08:00
Concedo
48c3682c2c improve search 2025-05-10 19:25:26 +08:00
Concedo
50e1064ffe better passthrough handling 2025-05-10 19:11:09 +08:00
Concedo
c4a0b323f0 remove fa restrictions for vulkan 2025-05-09 17:34:14 +08:00
Concedo
b6220669f4 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/docker.yml
#	Makefile
#	examples/CMakeLists.txt
#	ggml/CMakeLists.txt
#	ggml/src/CMakeLists.txt
#	ggml/src/ggml-sycl/common.hpp
#	ggml/src/ggml-sycl/convert.cpp
#	ggml/src/ggml-sycl/convert.hpp
#	ggml/src/ggml-sycl/ggml-sycl.cpp
#	scripts/sync-ggml.last
2025-05-08 23:07:33 +08:00
Concedo
7c5d47f688 multigpu warning only once 2025-05-08 00:55:09 +08:00
Concedo
fa22c1a5a4 fixed cfg scale, but turns out it sucks. embedded aria2c into pyinstaller 2025-05-07 18:30:36 +08:00
Concedo
a5b6f372a3 cfg scale wip 2025-05-07 00:36:00 +08:00
Concedo
0fa435b2a6 Merge commit '9b61acf060' into concedo_experimental
# Conflicts:
#	Makefile
#	docs/multimodal/MobileVLM.md
#	docs/multimodal/glmedge.md
#	docs/multimodal/llava.md
#	docs/multimodal/minicpmo2.6.md
#	docs/multimodal/minicpmv2.5.md
#	docs/multimodal/minicpmv2.6.md
#	requirements/requirements-all.txt
#	tools/mtmd/CMakeLists.txt
#	tools/mtmd/README.md
#	tools/mtmd/android/adb_run.sh
#	tools/mtmd/android/build_64.sh
#	tools/mtmd/clip-quantize-cli.cpp
2025-05-06 23:34:21 +08:00
Concedo
38a8778f24 wip cfg scale 2025-05-06 23:06:25 +08:00
Concedo
13cee48740 embed aria2c for windows, add slowness check with highpriority recommendation (+1 squashed commits)
Squashed commits:

[b9b695217] embed aria2c for windows, add slowness check with highpriority recommendation (+1 squashed commits)

Squashed commits:

[90b5d389d] embed aria2c for windows, add slowness check with highpriority recommendation (+1 squashed commits)

Squashed commits:

[fbbaa989f] embed aria2c for windows
2025-05-06 18:56:02 +08:00
Concedo
f59b5eb561 added toggle for guidance 2025-05-05 22:21:46 +08:00
Concedo
1228f91ccb even better comfyui handling, dynamic node ids 2025-05-03 11:21:22 +08:00
Concedo
6cb36ce1ae better zenity checks for multilingual 2025-05-03 10:09:47 +08:00
Concedo
423a68c45d multipart downloading up to 9 parts 2025-05-02 22:34:20 +08:00
Concedo
d8f1f73dd7 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/build-linux-cross.yml
#	.github/workflows/build.yml
#	cmake/build-info.cmake
#	common/CMakeLists.txt
#	examples/llava/README.md
#	examples/server/README.md
#	ggml/CMakeLists.txt
#	ggml/src/ggml-cuda/CMakeLists.txt
#	ggml/src/ggml-rpc/ggml-rpc.cpp
#	ggml/src/ggml-vulkan/CMakeLists.txt
#	ggml/src/ggml-vulkan/vulkan-shaders/CMakeLists.txt
#	scripts/sync-ggml.last
#	tests/test-backend-ops.cpp
#	tests/test-chat-template.cpp
2025-05-02 16:54:15 +08:00
Concedo
7694cf9bfb fix rope bug (+1 squashed commits)
Squashed commits:

[5bf69efe0] fix rope bug
2025-05-02 16:35:01 +08:00
Concedo
bc452da452 improved comfyui compatibility, tweaked hf search 2025-05-02 16:18:31 +08:00
Concedo
803b3e1070 added HF model search tool (+1 squashed commits)
Squashed commits:

[cbd925d59] added HF model search tool
2025-05-02 11:44:01 +08:00
Concedo
80da6af931 wip comfyui basic websocket 2025-05-02 01:25:28 +08:00
Concedo
449382d4df use --file for yad 2025-05-01 23:48:23 +08:00
Concedo
fc255cf50c fixed null stop 2025-05-01 17:07:17 +08:00
Concedo
ed938a2fc6 increase defaultgemamt range 2025-04-30 23:13:55 +08:00
Concedo
fda682fa12 updated lite 2025-04-30 19:49:54 +08:00
Concedo
621cc8f33f think tags handling fixed 2025-04-30 14:18:37 +08:00
Concedo
c2802af9e8 fix qwen3, fixed sd, fixed glm4 2025-04-29 20:50:46 +08:00
Concedo
e659cadf48 more sanitization for user inputs 2025-04-28 15:01:50 +08:00
Concedo
a9bc1a2ee2 do not use shell true instead 2025-04-28 14:26:55 +08:00
Concedo
ca281bd5ba fix sanity check 2025-04-28 00:00:07 +08:00
Concedo
5fa9e02bc3 add debugging info to zenity check 2025-04-27 23:48:23 +08:00
Concedo
4dcd215b27 handle explicit null 2025-04-26 13:06:38 +08:00
Concedo
cb1c182673 add more warmup (+1 squashed commits)
Squashed commits:

[9578d5352] updated lite
2025-04-26 10:22:09 +08:00
kallewoof
7cb815b727
AutoGuess: GLM-4 (#1502)
* AutoGuess: GLM-4

* add 'chat_start' field to adapters

* GLM-4 fix
2025-04-26 08:47:42 +08:00
Concedo
5e87c04056 improved memory estimation (+2 squashed commit)
Squashed commit:

[3319540f9] mem estimation

[43bad21db] mem estimation
2025-04-26 02:03:09 +08:00
Concedo
6b6597ebf1 allow for single token prompt processing (actual batch size 1) 2025-04-25 16:54:46 +08:00
Concedo
25e747e9d8 up version 2025-04-24 18:44:29 +08:00
Concedo
3e8b84b8e5 added support for structured output in chat completions 2025-04-22 22:23:36 +08:00