Concedo
|
e68a5f448c
|
add ddim sampler
|
2025-05-22 21:28:01 +08:00 |
|
Concedo
|
f125e724eb
|
fix off-by-one npast during some instances of fast forwarding
|
2025-05-22 19:51:21 +08:00 |
|
Concedo
|
440350327c
|
set random range for seed
|
2025-05-21 23:47:18 +08:00 |
|
Wagner Bruna
|
5d0cfc9db3
|
store on the image the actual random seed, for reproducibility (#1549)
|
2025-05-21 23:40:47 +08:00 |
|
Concedo
|
8b6dfbd1be
|
disabling the gMask prefix for glm-4 completions
|
2025-05-21 17:29:24 +08:00 |
|
Concedo
|
49305942ab
|
try disabling the gMask prefix for glm-4 completions
|
2025-05-21 16:47:08 +08:00 |
|
Concedo
|
5f4923bf24
|
backend tag replacement for endtags. view results with debug mode.
|
2025-05-19 23:14:43 +08:00 |
|
Concedo
|
710c747b60
|
minor noscript edit
|
2025-05-19 17:51:44 +08:00 |
|
Concedo
|
c546cb638e
|
disable showgui if skiplauncher is used
|
2025-05-18 01:42:14 +08:00 |
|
Concedo
|
ca4274e384
|
added size info into HF searcher
|
2025-05-17 00:31:54 +08:00 |
|
Concedo
|
5ccd4b2bf5
|
horde default max ctx matches main ctx
|
2025-05-15 10:26:20 +08:00 |
|
Concedo
|
c5ea7fad93
|
updated lite, only show processed input in debugmode
|
2025-05-14 17:46:54 +08:00 |
|
Concedo
|
21e31e255b
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/workflows/build.yml
# .github/workflows/docker.yml
# README.md
# build-xcframework.sh
# common/CMakeLists.txt
# examples/CMakeLists.txt
# ggml/src/ggml-cpu/CMakeLists.txt
# ggml/src/ggml-cuda/CMakeLists.txt
# ggml/src/ggml-metal/ggml-metal.m
# ggml/src/ggml-metal/ggml-metal.metal
# ggml/src/ggml-sycl/CMakeLists.txt
# ggml/src/ggml-sycl/backend.hpp
# ggml/src/ggml-sycl/common.hpp
# ggml/src/ggml-sycl/ggml-sycl.cpp
# ggml/src/ggml-sycl/mmvq.cpp
# ggml/src/ggml-sycl/vecdotq.hpp
# scripts/compare-llama-bench.py
# src/CMakeLists.txt
# src/llama-model.cpp
# src/llama.cpp
# tests/test-backend-ops.cpp
# tests/test-opt.cpp
# tools/llama-bench/README.md
# tools/llama-bench/llama-bench.cpp
# tools/mtmd/CMakeLists.txt
# tools/mtmd/README.md
# tools/mtmd/clip.cpp
# tools/rpc/rpc-server.cpp
# tools/server/CMakeLists.txt
# tools/server/README.md
|
2025-05-13 00:28:35 +08:00 |
|
Concedo
|
40eb3a54c4
|
rename some toolip texts
|
2025-05-11 22:50:40 +08:00 |
|
Concedo
|
1eb6d25010
|
truncate middle instead of end for long strings
|
2025-05-11 20:26:17 +08:00 |
|
Concedo
|
48c3682c2c
|
improve search
|
2025-05-10 19:25:26 +08:00 |
|
Concedo
|
50e1064ffe
|
better passthrough handling
|
2025-05-10 19:11:09 +08:00 |
|
Concedo
|
c4a0b323f0
|
remove fa restrictions for vulkan
|
2025-05-09 17:34:14 +08:00 |
|
Concedo
|
b6220669f4
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/workflows/docker.yml
# Makefile
# examples/CMakeLists.txt
# ggml/CMakeLists.txt
# ggml/src/CMakeLists.txt
# ggml/src/ggml-sycl/common.hpp
# ggml/src/ggml-sycl/convert.cpp
# ggml/src/ggml-sycl/convert.hpp
# ggml/src/ggml-sycl/ggml-sycl.cpp
# scripts/sync-ggml.last
|
2025-05-08 23:07:33 +08:00 |
|
Concedo
|
7c5d47f688
|
multigpu warning only once
|
2025-05-08 00:55:09 +08:00 |
|
Concedo
|
fa22c1a5a4
|
fixed cfg scale, but turns out it sucks. embedded aria2c into pyinstaller
|
2025-05-07 18:30:36 +08:00 |
|
Concedo
|
a5b6f372a3
|
cfg scale wip
|
2025-05-07 00:36:00 +08:00 |
|
Concedo
|
0fa435b2a6
|
Merge commit '9b61acf060 ' into concedo_experimental
# Conflicts:
# Makefile
# docs/multimodal/MobileVLM.md
# docs/multimodal/glmedge.md
# docs/multimodal/llava.md
# docs/multimodal/minicpmo2.6.md
# docs/multimodal/minicpmv2.5.md
# docs/multimodal/minicpmv2.6.md
# requirements/requirements-all.txt
# tools/mtmd/CMakeLists.txt
# tools/mtmd/README.md
# tools/mtmd/android/adb_run.sh
# tools/mtmd/android/build_64.sh
# tools/mtmd/clip-quantize-cli.cpp
|
2025-05-06 23:34:21 +08:00 |
|
Concedo
|
38a8778f24
|
wip cfg scale
|
2025-05-06 23:06:25 +08:00 |
|
Concedo
|
13cee48740
|
embed aria2c for windows, add slowness check with highpriority recommendation (+1 squashed commits)
Squashed commits:
[b9b695217] embed aria2c for windows, add slowness check with highpriority recommendation (+1 squashed commits)
Squashed commits:
[90b5d389d] embed aria2c for windows, add slowness check with highpriority recommendation (+1 squashed commits)
Squashed commits:
[fbbaa989f] embed aria2c for windows
|
2025-05-06 18:56:02 +08:00 |
|
Concedo
|
f59b5eb561
|
added toggle for guidance
|
2025-05-05 22:21:46 +08:00 |
|
Concedo
|
1228f91ccb
|
even better comfyui handling, dynamic node ids
|
2025-05-03 11:21:22 +08:00 |
|
Concedo
|
6cb36ce1ae
|
better zenity checks for multilingual
|
2025-05-03 10:09:47 +08:00 |
|
Concedo
|
423a68c45d
|
multipart downloading up to 9 parts
|
2025-05-02 22:34:20 +08:00 |
|
Concedo
|
d8f1f73dd7
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/workflows/build-linux-cross.yml
# .github/workflows/build.yml
# cmake/build-info.cmake
# common/CMakeLists.txt
# examples/llava/README.md
# examples/server/README.md
# ggml/CMakeLists.txt
# ggml/src/ggml-cuda/CMakeLists.txt
# ggml/src/ggml-rpc/ggml-rpc.cpp
# ggml/src/ggml-vulkan/CMakeLists.txt
# ggml/src/ggml-vulkan/vulkan-shaders/CMakeLists.txt
# scripts/sync-ggml.last
# tests/test-backend-ops.cpp
# tests/test-chat-template.cpp
|
2025-05-02 16:54:15 +08:00 |
|
Concedo
|
7694cf9bfb
|
fix rope bug (+1 squashed commits)
Squashed commits:
[5bf69efe0] fix rope bug
|
2025-05-02 16:35:01 +08:00 |
|
Concedo
|
bc452da452
|
improved comfyui compatibility, tweaked hf search
|
2025-05-02 16:18:31 +08:00 |
|
Concedo
|
803b3e1070
|
added HF model search tool (+1 squashed commits)
Squashed commits:
[cbd925d59] added HF model search tool
|
2025-05-02 11:44:01 +08:00 |
|
Concedo
|
80da6af931
|
wip comfyui basic websocket
|
2025-05-02 01:25:28 +08:00 |
|
Concedo
|
449382d4df
|
use --file for yad
|
2025-05-01 23:48:23 +08:00 |
|
Concedo
|
fc255cf50c
|
fixed null stop
|
2025-05-01 17:07:17 +08:00 |
|
Concedo
|
ed938a2fc6
|
increase defaultgemamt range
|
2025-04-30 23:13:55 +08:00 |
|
Concedo
|
fda682fa12
|
updated lite
|
2025-04-30 19:49:54 +08:00 |
|
Concedo
|
621cc8f33f
|
think tags handling fixed
|
2025-04-30 14:18:37 +08:00 |
|
Concedo
|
c2802af9e8
|
fix qwen3, fixed sd, fixed glm4
|
2025-04-29 20:50:46 +08:00 |
|
Concedo
|
e659cadf48
|
more sanitization for user inputs
|
2025-04-28 15:01:50 +08:00 |
|
Concedo
|
a9bc1a2ee2
|
do not use shell true instead
|
2025-04-28 14:26:55 +08:00 |
|
Concedo
|
ca281bd5ba
|
fix sanity check
|
2025-04-28 00:00:07 +08:00 |
|
Concedo
|
5fa9e02bc3
|
add debugging info to zenity check
|
2025-04-27 23:48:23 +08:00 |
|
Concedo
|
4dcd215b27
|
handle explicit null
|
2025-04-26 13:06:38 +08:00 |
|
Concedo
|
cb1c182673
|
add more warmup (+1 squashed commits)
Squashed commits:
[9578d5352] updated lite
|
2025-04-26 10:22:09 +08:00 |
|
kallewoof
|
7cb815b727
|
AutoGuess: GLM-4 (#1502)
* AutoGuess: GLM-4
* add 'chat_start' field to adapters
* GLM-4 fix
|
2025-04-26 08:47:42 +08:00 |
|
Concedo
|
5e87c04056
|
improved memory estimation (+2 squashed commit)
Squashed commit:
[3319540f9] mem estimation
[43bad21db] mem estimation
|
2025-04-26 02:03:09 +08:00 |
|
Concedo
|
6b6597ebf1
|
allow for single token prompt processing (actual batch size 1)
|
2025-04-25 16:54:46 +08:00 |
|
Concedo
|
25e747e9d8
|
up version
|
2025-04-24 18:44:29 +08:00 |
|