Concedo
|
5ccd4b2bf5
|
horde default max ctx matches main ctx
|
2025-05-15 10:26:20 +08:00 |
|
Concedo
|
c5ea7fad93
|
updated lite, only show processed input in debugmode
|
2025-05-14 17:46:54 +08:00 |
|
Concedo
|
21e31e255b
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/workflows/build.yml
# .github/workflows/docker.yml
# README.md
# build-xcframework.sh
# common/CMakeLists.txt
# examples/CMakeLists.txt
# ggml/src/ggml-cpu/CMakeLists.txt
# ggml/src/ggml-cuda/CMakeLists.txt
# ggml/src/ggml-metal/ggml-metal.m
# ggml/src/ggml-metal/ggml-metal.metal
# ggml/src/ggml-sycl/CMakeLists.txt
# ggml/src/ggml-sycl/backend.hpp
# ggml/src/ggml-sycl/common.hpp
# ggml/src/ggml-sycl/ggml-sycl.cpp
# ggml/src/ggml-sycl/mmvq.cpp
# ggml/src/ggml-sycl/vecdotq.hpp
# scripts/compare-llama-bench.py
# src/CMakeLists.txt
# src/llama-model.cpp
# src/llama.cpp
# tests/test-backend-ops.cpp
# tests/test-opt.cpp
# tools/llama-bench/README.md
# tools/llama-bench/llama-bench.cpp
# tools/mtmd/CMakeLists.txt
# tools/mtmd/README.md
# tools/mtmd/clip.cpp
# tools/rpc/rpc-server.cpp
# tools/server/CMakeLists.txt
# tools/server/README.md
|
2025-05-13 00:28:35 +08:00 |
|
Concedo
|
40eb3a54c4
|
rename some toolip texts
|
2025-05-11 22:50:40 +08:00 |
|
Concedo
|
1eb6d25010
|
truncate middle instead of end for long strings
|
2025-05-11 20:26:17 +08:00 |
|
Concedo
|
48c3682c2c
|
improve search
|
2025-05-10 19:25:26 +08:00 |
|
Concedo
|
50e1064ffe
|
better passthrough handling
|
2025-05-10 19:11:09 +08:00 |
|
Concedo
|
c4a0b323f0
|
remove fa restrictions for vulkan
|
2025-05-09 17:34:14 +08:00 |
|
Concedo
|
b6220669f4
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/workflows/docker.yml
# Makefile
# examples/CMakeLists.txt
# ggml/CMakeLists.txt
# ggml/src/CMakeLists.txt
# ggml/src/ggml-sycl/common.hpp
# ggml/src/ggml-sycl/convert.cpp
# ggml/src/ggml-sycl/convert.hpp
# ggml/src/ggml-sycl/ggml-sycl.cpp
# scripts/sync-ggml.last
|
2025-05-08 23:07:33 +08:00 |
|
Concedo
|
7c5d47f688
|
multigpu warning only once
|
2025-05-08 00:55:09 +08:00 |
|
Concedo
|
fa22c1a5a4
|
fixed cfg scale, but turns out it sucks. embedded aria2c into pyinstaller
|
2025-05-07 18:30:36 +08:00 |
|
Concedo
|
a5b6f372a3
|
cfg scale wip
|
2025-05-07 00:36:00 +08:00 |
|
Concedo
|
0fa435b2a6
|
Merge commit '9b61acf060 ' into concedo_experimental
# Conflicts:
# Makefile
# docs/multimodal/MobileVLM.md
# docs/multimodal/glmedge.md
# docs/multimodal/llava.md
# docs/multimodal/minicpmo2.6.md
# docs/multimodal/minicpmv2.5.md
# docs/multimodal/minicpmv2.6.md
# requirements/requirements-all.txt
# tools/mtmd/CMakeLists.txt
# tools/mtmd/README.md
# tools/mtmd/android/adb_run.sh
# tools/mtmd/android/build_64.sh
# tools/mtmd/clip-quantize-cli.cpp
|
2025-05-06 23:34:21 +08:00 |
|
Concedo
|
38a8778f24
|
wip cfg scale
|
2025-05-06 23:06:25 +08:00 |
|
Concedo
|
13cee48740
|
embed aria2c for windows, add slowness check with highpriority recommendation (+1 squashed commits)
Squashed commits:
[b9b695217] embed aria2c for windows, add slowness check with highpriority recommendation (+1 squashed commits)
Squashed commits:
[90b5d389d] embed aria2c for windows, add slowness check with highpriority recommendation (+1 squashed commits)
Squashed commits:
[fbbaa989f] embed aria2c for windows
|
2025-05-06 18:56:02 +08:00 |
|
Concedo
|
f59b5eb561
|
added toggle for guidance
|
2025-05-05 22:21:46 +08:00 |
|
Concedo
|
1228f91ccb
|
even better comfyui handling, dynamic node ids
|
2025-05-03 11:21:22 +08:00 |
|
Concedo
|
6cb36ce1ae
|
better zenity checks for multilingual
|
2025-05-03 10:09:47 +08:00 |
|
Concedo
|
423a68c45d
|
multipart downloading up to 9 parts
|
2025-05-02 22:34:20 +08:00 |
|
Concedo
|
d8f1f73dd7
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/workflows/build-linux-cross.yml
# .github/workflows/build.yml
# cmake/build-info.cmake
# common/CMakeLists.txt
# examples/llava/README.md
# examples/server/README.md
# ggml/CMakeLists.txt
# ggml/src/ggml-cuda/CMakeLists.txt
# ggml/src/ggml-rpc/ggml-rpc.cpp
# ggml/src/ggml-vulkan/CMakeLists.txt
# ggml/src/ggml-vulkan/vulkan-shaders/CMakeLists.txt
# scripts/sync-ggml.last
# tests/test-backend-ops.cpp
# tests/test-chat-template.cpp
|
2025-05-02 16:54:15 +08:00 |
|
Concedo
|
7694cf9bfb
|
fix rope bug (+1 squashed commits)
Squashed commits:
[5bf69efe0] fix rope bug
|
2025-05-02 16:35:01 +08:00 |
|
Concedo
|
bc452da452
|
improved comfyui compatibility, tweaked hf search
|
2025-05-02 16:18:31 +08:00 |
|
Concedo
|
803b3e1070
|
added HF model search tool (+1 squashed commits)
Squashed commits:
[cbd925d59] added HF model search tool
|
2025-05-02 11:44:01 +08:00 |
|
Concedo
|
80da6af931
|
wip comfyui basic websocket
|
2025-05-02 01:25:28 +08:00 |
|
Concedo
|
449382d4df
|
use --file for yad
|
2025-05-01 23:48:23 +08:00 |
|
Concedo
|
fc255cf50c
|
fixed null stop
|
2025-05-01 17:07:17 +08:00 |
|
Concedo
|
ed938a2fc6
|
increase defaultgemamt range
|
2025-04-30 23:13:55 +08:00 |
|
Concedo
|
fda682fa12
|
updated lite
|
2025-04-30 19:49:54 +08:00 |
|
Concedo
|
621cc8f33f
|
think tags handling fixed
|
2025-04-30 14:18:37 +08:00 |
|
Concedo
|
c2802af9e8
|
fix qwen3, fixed sd, fixed glm4
|
2025-04-29 20:50:46 +08:00 |
|
Concedo
|
e659cadf48
|
more sanitization for user inputs
|
2025-04-28 15:01:50 +08:00 |
|
Concedo
|
a9bc1a2ee2
|
do not use shell true instead
|
2025-04-28 14:26:55 +08:00 |
|
Concedo
|
ca281bd5ba
|
fix sanity check
|
2025-04-28 00:00:07 +08:00 |
|
Concedo
|
5fa9e02bc3
|
add debugging info to zenity check
|
2025-04-27 23:48:23 +08:00 |
|
Concedo
|
4dcd215b27
|
handle explicit null
|
2025-04-26 13:06:38 +08:00 |
|
Concedo
|
cb1c182673
|
add more warmup (+1 squashed commits)
Squashed commits:
[9578d5352] updated lite
|
2025-04-26 10:22:09 +08:00 |
|
kallewoof
|
7cb815b727
|
AutoGuess: GLM-4 (#1502)
* AutoGuess: GLM-4
* add 'chat_start' field to adapters
* GLM-4 fix
|
2025-04-26 08:47:42 +08:00 |
|
Concedo
|
5e87c04056
|
improved memory estimation (+2 squashed commit)
Squashed commit:
[3319540f9] mem estimation
[43bad21db] mem estimation
|
2025-04-26 02:03:09 +08:00 |
|
Concedo
|
6b6597ebf1
|
allow for single token prompt processing (actual batch size 1)
|
2025-04-25 16:54:46 +08:00 |
|
Concedo
|
25e747e9d8
|
up version
|
2025-04-24 18:44:29 +08:00 |
|
Concedo
|
3e8b84b8e5
|
added support for structured output in chat completions
|
2025-04-22 22:23:36 +08:00 |
|
Concedo
|
e8b3aeaa28
|
update some defaults for max length and max ctx
|
2025-04-22 15:47:01 +08:00 |
|
Concedo
|
6dbee2f2f8
|
more robust glslc checks, increase default denoise str
|
2025-04-22 15:19:47 +08:00 |
|
Concedo
|
6494dce405
|
handle estimation for multipart gguf (+1 squashed commits)
Squashed commits:
[c7b4af92] handle estimation for multipart gguf
|
2025-04-21 22:07:22 +08:00 |
|
Concedo
|
9cd6a1add2
|
allow mmproj to be run on cpu
|
2025-04-21 21:03:10 +08:00 |
|
Concedo
|
f968079290
|
randomize image names to prevent caching in noscript
|
2025-04-21 13:24:40 +08:00 |
|
Concedo
|
2ed6850c0b
|
added override tensor
|
2025-04-20 20:56:17 +08:00 |
|
Concedo
|
75dfad2bb0
|
fixed noscript (+1 squashed commits)
Squashed commits:
[dba28399] fixed noscript
|
2025-04-19 23:16:08 +08:00 |
|
Concedo
|
12c2efdadd
|
noscript image gen
|
2025-04-19 18:56:52 +08:00 |
|
Concedo
|
305e533dc6
|
i already knew zenity would cause issues
|
2025-04-19 13:04:41 +08:00 |
|