Concedo
|
6494dce405
|
handle estimation for multipart gguf (+1 squashed commits)
Squashed commits:
[c7b4af92] handle estimation for multipart gguf
|
2025-04-21 22:07:22 +08:00 |
|
Concedo
|
9cd6a1add2
|
allow mmproj to be run on cpu
|
2025-04-21 21:03:10 +08:00 |
|
Concedo
|
f968079290
|
randomize image names to prevent caching in noscript
|
2025-04-21 13:24:40 +08:00 |
|
Concedo
|
2ed6850c0b
|
added override tensor
|
2025-04-20 20:56:17 +08:00 |
|
Concedo
|
75dfad2bb0
|
fixed noscript (+1 squashed commits)
Squashed commits:
[dba28399] fixed noscript
|
2025-04-19 23:16:08 +08:00 |
|
Concedo
|
12c2efdadd
|
noscript image gen
|
2025-04-19 18:56:52 +08:00 |
|
Concedo
|
305e533dc6
|
i already knew zenity would cause issues
|
2025-04-19 13:04:41 +08:00 |
|
Concedo
|
78a910be26
|
noscript chat mode tweaks
|
2025-04-19 12:40:13 +08:00 |
|
Concedo
|
a5b5d21cca
|
added chat mode to noscript
|
2025-04-19 00:59:00 +08:00 |
|
Concedo
|
5d57d62665
|
add a timeout for zenity check
|
2025-04-18 13:07:26 +08:00 |
|
Concedo
|
1a09d9cf0e
|
increase to 10 save slots
|
2025-04-18 11:30:32 +08:00 |
|
Concedo
|
c67510718e
|
kv override option (+1 squashed commits)
Squashed commits:
[e615fc01] kv override option
|
2025-04-17 14:22:30 +08:00 |
|
Concedo
|
06159939d9
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/workflows/build.yml
# Makefile
# docs/build.md
# examples/rpc/rpc-server.cpp
# examples/sycl/build.sh
# ggml/CMakeLists.txt
# ggml/src/ggml-cann/aclnn_ops.cpp
# ggml/src/ggml-cann/ggml-cann.cpp
# ggml/src/ggml-hip/CMakeLists.txt
# scripts/sync-ggml.last
|
2025-04-17 00:52:37 +08:00 |
|
Concedo
|
fbf039966c
|
debugmode has debug in cli
|
2025-04-15 23:42:46 +08:00 |
|
Concedo
|
c168b063e5
|
cli fix
|
2025-04-15 23:30:18 +08:00 |
|
Concedo
|
fe12b1cbd4
|
fixed lora, now works quanted too
|
2025-04-14 23:44:42 +08:00 |
|
Concedo
|
6bc2ca4803
|
added more sanity checks on zenity
|
2025-04-14 15:06:08 +08:00 |
|
Concedo
|
3d31d75c8f
|
clamp and display detected GPU memory
|
2025-04-14 14:19:23 +08:00 |
|
Concedo
|
a6149ad0fc
|
fixed g3 adapter back
|
2025-04-12 23:17:54 +08:00 |
|
Concedo
|
6302709fbb
|
discourage but dont prevent vulkan FA (it's occasionally still useful)
|
2025-04-12 16:23:52 +08:00 |
|
Concedo
|
5908f2ca19
|
based on occam and henky advice, disabled flash attention entirely on vulkan.
|
2025-04-12 12:30:48 +08:00 |
|
Concedo
|
7a7bdeab6d
|
json to gbnf endpoint added
|
2025-04-12 11:41:11 +08:00 |
|
Concedo
|
a56cc72bd0
|
added handling for remembering file paths, added gui option to disable zenity in GUI
|
2025-04-12 00:42:26 +08:00 |
|
henk717
|
f6b7fea979
|
zentk - folder select workaround (#1478)
* zentk - folder select workaround
* kcppt extention fix
|
2025-04-11 22:37:07 +08:00 |
|
henk717
|
8fd70f37bd
|
Zentk integration (Zenity/yad support) (#1475)
* Zentk integration (Zenity/yad support)
* Escape incompatible dependencies in zentk
* Properly clean env
|
2025-04-11 18:23:23 +08:00 |
|
Concedo
|
27f575dc83
|
inpaining support completed, invert mask added
|
2025-04-09 23:50:17 +08:00 |
|
Concedo
|
23339ace9b
|
inpainting works in kcpp!
|
2025-04-09 23:01:05 +08:00 |
|
Concedo
|
26e1653255
|
fixed templates not setting gpu when swapped with admin mode
|
2025-04-08 21:45:18 +08:00 |
|
Concedo
|
b99ee451f8
|
Merge commit '4ccea213bc ' into concedo_experimental
# Conflicts:
# .devops/cpu.Dockerfile
# .devops/cuda.Dockerfile
# .devops/intel.Dockerfile
# .devops/musa.Dockerfile
# .devops/rocm.Dockerfile
# .github/workflows/bench.yml.disabled
# .github/workflows/build.yml
# .github/workflows/server.yml
# CMakeLists.txt
# build-xcframework.sh
# ci/run.sh
# common/CMakeLists.txt
# examples/llama.android/llama/build.gradle.kts
# examples/perplexity/perplexity.cpp
# examples/run/CMakeLists.txt
# examples/server/tests/README.md
# examples/sycl/win-build-sycl.bat
# ggml/src/ggml-cann/aclnn_ops.cpp
# ggml/src/ggml-cann/aclnn_ops.h
# ggml/src/ggml-cpu/CMakeLists.txt
# ggml/src/ggml-cpu/ggml-cpu.c
# licenses/LICENSE-linenoise
# scripts/sync-ggml.last
# tests/CMakeLists.txt
|
2025-04-08 21:26:23 +08:00 |
|
Concedo
|
8e23a087e7
|
updated readme, memory detection prints
|
2025-04-08 20:23:52 +08:00 |
|
Concedo
|
11c4e7c2c4
|
automatic memory detection for vulkan
|
2025-04-07 22:56:12 +08:00 |
|
Concedo
|
11f993ca10
|
added flag to adjust max request size
|
2025-04-06 00:13:00 +08:00 |
|
Concedo
|
65cd25d3a1
|
some data sanitization
|
2025-04-05 18:42:26 +08:00 |
|
Concedo
|
93a226d9e4
|
added prefix for llava, reverted system role in template as it degreaded gemma3. truncated debug logs
|
2025-04-05 18:06:41 +08:00 |
|
Concedo
|
34ddd874fe
|
try containerized ci (+3 squashed commit)
Squashed commit:
[f0600744 ] troubleshooting
[fe11073c ] cap auto threads at 32 due to diminishing returns
[0c7f8a1d ] troubleshooting
|
2025-04-05 01:51:03 +08:00 |
|
Concedo
|
c48a4a73d4
|
try fix file open
|
2025-04-04 14:38:17 +08:00 |
|
Concedo
|
43e9b049d6
|
another silly bug silly silly silly (tavern)
|
2025-04-04 14:16:42 +08:00 |
|
Concedo
|
07a96d63fa
|
try to ensure correct file extension
|
2025-04-03 20:13:53 +08:00 |
|
Concedo
|
6e086bd309
|
fixed savedatafile bug, try remove unneeded old clblast code path
|
2025-04-03 19:11:27 +08:00 |
|
Concedo
|
7f1003be44
|
warning for max tokens being too high
|
2025-04-02 18:58:38 +08:00 |
|
Concedo
|
fbf5c04c3c
|
silly me
|
2025-04-02 00:51:05 +08:00 |
|
Concedo
|
30e3d24ead
|
embd include name
|
2025-04-02 00:40:38 +08:00 |
|
Concedo
|
e37f27632f
|
clear cpu flag manually for templates, added truncation for embeddings
|
2025-04-02 00:18:30 +08:00 |
|
Concedo
|
0fd94e19f3
|
made tool calls more robust and allowed tool call template customization
|
2025-04-01 19:16:45 +08:00 |
|
henk717
|
4291e1575b
|
Fix tool spec, this spec is kinda.... (#1458)
|
2025-04-01 10:39:02 +08:00 |
|
Concedo
|
c0adaabfa4
|
Revert "try fix owui"
This reverts commit 12e5b8abdb .
|
2025-04-01 00:27:31 +08:00 |
|
Concedo
|
12e5b8abdb
|
try fix owui
|
2025-04-01 00:23:45 +08:00 |
|
Concedo
|
0ed95fcccc
|
fixed l3 template, add index
|
2025-03-31 23:59:06 +08:00 |
|
Concedo
|
1ebadc515e
|
add streaming support for oai tools (+2 squashed commit)
Squashed commit:
[4d080b37] qwen2.5vl surgery script
[4bebe7e5] add streaming support for oai tools
|
2025-03-31 16:49:15 +08:00 |
|
henk717
|
091eb367fc
|
More robust tool calling prompt (#1455)
* More robust tool checking prompt
* Inform UI we want a tool
|
2025-03-31 14:43:03 +08:00 |
|