Concedo
e9d2332dd8
improved tool calls and whisper
2024-12-06 14:34:31 +08:00
Concedo
2ba5949054
updated sdcpp, also set euler as default sampler
2024-12-01 17:00:20 +08:00
Concedo
409e393d10
fixed critical bug in image model loader
2024-11-30 23:28:24 +08:00
Concedo
f75bbb945f
speculative decoding initial impl completed (+6 squashed commit)
...
Squashed commit:
[0a6306ca0] draft wip dont use (will be squashed)
[a758a1c9c] wip dont use (will be squashed)
[e1994d3ce] wip dont use
[f59690d68] wip
[77228147d] wip on spec decoding. dont use yet
[2445bca54] wip adding speculative decoding (+1 squashed commits)
Squashed commits:
[50e341bb7] wip adding speculative decoding
2024-11-30 10:41:10 +08:00
Concedo
116879144c
better error messages
2024-11-23 18:55:01 +08:00
Concedo
70aee82552
attempts a backflip, but does he stick the landing?
2024-11-16 17:05:45 +08:00
Concedo
8a7d53d838
declutter sdcpp adapter removed useless string literals
2024-11-14 00:27:39 +08:00
Concedo
dd95f88c19
add e5m2 support for use in Kobo, also made a separate contribution PR https://github.com/leejet/stable-diffusion.cpp/pull/460
2024-11-13 11:35:49 +08:00
Concedo
3813f6c517
added new flag nofastforward allowing users to disable fast forwarding
2024-11-13 10:59:01 +08:00
bebopkim
bb6c8bc095
Correct compilation errors on macOS Metal ( #1204 )
2024-11-08 11:42:31 +08:00
Concedo
262437f393
fallback flux loader
2024-11-07 15:55:43 +08:00
Concedo
ccbd630a42
allow custom t5, clipl and clipg
2024-11-06 19:05:48 +08:00
Concedo
3cfc4dc581
avoid euler a for flux (+4 squashed commit)
...
Squashed commit:
[5a4b72385] fix cuda build
[5f969a645] add vulkan information
[6849e7398] fixed flux
[740e80419] update readme
2024-11-05 22:50:14 +08:00
Concedo
5b90eeaf17
fixed sd to work on larger images by adding tiling, also limit res for sd1.5
2024-11-04 23:26:15 +08:00
Concedo
847689e74c
fixed incorrect makefile flags
2024-11-04 20:39:10 +08:00
Concedo
bb13925f39
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# CMakePresets.json
# Makefile
# Package.swift
# ci/run.sh
# common/CMakeLists.txt
# examples/CMakeLists.txt
# flake.lock
# ggml/src/CMakeLists.txt
# ggml/src/ggml-backend.cpp
# ggml/src/ggml.c
# pocs/vdot/q8dot.cpp
# pocs/vdot/vdot.cpp
# tests/test-backend-ops.cpp
# tests/test-grad0.cpp
# tests/test-quantize-fns.cpp
# tests/test-quantize-perf.cpp
# tests/test-rope.cpp
2024-11-04 16:54:53 +08:00
Concedo
5233e8ed1d
sd 3.5 medium
2024-11-03 23:27:06 +08:00
Concedo
f32a874966
resync and updated sdcpp for flux and sd3 support
2024-11-03 22:03:16 +08:00
Concedo
6da5a63852
fix for uploaded wav files being incomplete due to fragmentation when converting to b64
2024-10-20 17:47:19 +08:00
Concedo
e692a79aab
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .github/workflows/docker.yml
# CMakeLists.txt
# CONTRIBUTING.md
# docs/android.md
# docs/docker.md
# examples/embedding/embedding.cpp
# examples/imatrix/imatrix.cpp
# examples/infill/infill.cpp
# examples/llama-bench/llama-bench.cpp
# examples/main/README.md
# examples/parallel/parallel.cpp
# examples/perplexity/perplexity.cpp
# examples/quantize-stats/quantize-stats.cpp
# examples/save-load-state/save-load-state.cpp
# examples/server/README.md
# examples/simple/CMakeLists.txt
# examples/speculative/speculative.cpp
# flake.lock
# ggml/src/CMakeLists.txt
# ggml/src/ggml-blas.cpp
# pocs/vdot/q8dot.cpp
# pocs/vdot/vdot.cpp
# scripts/debug-test.sh
# scripts/sync-ggml.last
# src/llama.cpp
# tests/test-backend-ops.cpp
# tests/test-chat-template.cpp
# tests/test-quantize-fns.cpp
# tests/test-quantize-perf.cpp
# tests/test-tokenizer-0.cpp
# tests/test-tokenizer-1-bpe.cpp
# tests/test-tokenizer-1-spm.cpp
2024-10-11 11:59:59 +08:00
bebopkim
7dac9982f9
Metal: remove ggml_backend_metal_log_set_callback due to backend logging mechanism unificification d6fe7ab
( #1144 )
2024-10-06 14:54:33 +08:00
Concedo
2d57f80af9
Fix compilation on macos
2024-10-03 15:10:30 +08:00
Concedo
0b0e456202
try fix compile issues
2024-09-19 13:56:19 +08:00
Concedo
e44ddf26ef
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .github/workflows/build.yml
# .github/workflows/server.yml
# CMakeLists.txt
# Makefile
# examples/embedding/embedding.cpp
# examples/imatrix/imatrix.cpp
# examples/llama-bench/llama-bench.cpp
# examples/llava/MobileVLM-README.md
# examples/parallel/parallel.cpp
# examples/perplexity/perplexity.cpp
# examples/quantize/CMakeLists.txt
# examples/server/README.md
# examples/speculative/speculative.cpp
# tests/test-backend-ops.cpp
2024-09-13 16:17:24 +08:00
Concedo
eee67281be
move kcpp params out
2024-09-10 16:30:12 +08:00
Concedo
12fd16bfd4
Merge commit ' df270ef745
' into concedo_experimental
...
# Conflicts:
# Makefile
# common/CMakeLists.txt
# common/common.h
# common/sampling.cpp
# common/sampling.h
# examples/infill/infill.cpp
# examples/llama-bench/llama-bench.cpp
# examples/quantize-stats/quantize-stats.cpp
# examples/server/server.cpp
# include/llama.h
# src/llama-sampling.cpp
# src/llama-sampling.h
# src/llama.cpp
# tests/test-grammar-integration.cpp
# tests/test-grammar-parser.cpp
# tests/test-json-schema-to-grammar.cpp
# tests/test-llama-grammar.cpp
# tests/test-sampling.cpp
2024-09-09 17:10:08 +08:00
Concedo
d220495dd4
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .devops/full-cuda.Dockerfile
# .devops/llama-cli-cuda.Dockerfile
# .devops/llama-server-cuda.Dockerfile
# .devops/llama-server-intel.Dockerfile
# .devops/llama-server-rocm.Dockerfile
# .devops/llama-server-vulkan.Dockerfile
# .devops/llama-server.Dockerfile
# .github/workflows/docker.yml
# docs/docker.md
# examples/llama-bench/llama-bench.cpp
# flake.lock
# ggml/include/ggml.h
# ggml/src/CMakeLists.txt
# scripts/sync-ggml.last
# src/llama.cpp
# tests/test-backend-ops.cpp
# tests/test-grad0.cpp
# tests/test-rope.cpp
2024-08-30 10:37:39 +08:00
Concedo
b2c1ff7a13
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .ecrc
# CMakePresets.json
# ci/run.sh
# docs/backend/SYCL.md
# ggml/src/CMakeLists.txt
# src/llama.cpp
# tests/test-backend-ops.cpp
# tests/test-sampling.cpp
2024-08-27 17:46:40 +08:00
Concedo
c61fa9155d
handle oversized images by downscaling
2024-08-26 13:58:18 +08:00
Concedo
7bc87e1f0f
added llava letterboxing feature
2024-08-25 23:15:38 +08:00
Concedo
d71b5477c5
update lite, cleanup, fix interrogate format
2024-08-18 00:48:53 +08:00
Concedo
e1f97f7fb5
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .devops/llama-server.Dockerfile
# README.md
# flake.lock
# ggml/src/ggml-vulkan.cpp
# ggml/src/vulkan-shaders/concat.comp
# ggml/src/vulkan-shaders/pad.comp
# ggml/src/vulkan-shaders/vulkan-shaders-gen.cpp
# scripts/sync-ggml-am.sh
# scripts/sync-ggml.last
# src/llama.cpp
# tests/test-backend-ops.cpp
2024-08-06 16:33:26 +08:00
Concedo
3a72410804
Added vulkan support for SD (+1 squashed commits)
...
Squashed commits:
[13f42f83] Added vulkan support for SD
2024-08-01 17:12:33 +08:00
Concedo
066e7ac540
minor fixes: colab gpu backend, lite bugs, package python file with embd
2024-07-15 17:36:03 +08:00
Concedo
11f0643fa4
fix pyinstallers
2024-06-27 15:19:44 +08:00
Concedo
2f7168779d
Merge branch 'concedo_experimental' of https://github.com/LostRuins/koboldcpp into concedo_experimental
2024-06-06 20:26:57 +08:00
Concedo
1ad56e9b6b
if quiet mode just show transcription event without text
2024-06-06 20:26:47 +08:00
Lexi
1c5e05e477
whisper: fix printf format string ( #894 )
...
This format string uses %d to print uint32_t and size_t{ype,}, which is
not guaranteed to work. Instead, use PRIu32 for uint32_t, and %zu for
size_t.
2024-06-06 19:50:59 +08:00
Concedo
813cf829b5
allow selecting multigpu on vulkan
2024-06-06 18:36:56 +08:00
Concedo
9c7340bfac
downgrade stb image resize
2024-06-02 18:02:22 +08:00
Concedo
b0a7d1aba6
fixed makefile (+1 squashed commits)
...
Squashed commits:
[ef6ddaf5] try fix makefile
2024-06-02 15:21:48 +08:00
Concedo
a65e0800ab
update docs, added gui for whisper
2024-06-01 02:01:49 +08:00
Concedo
961c789c91
wav file resampling
2024-05-30 13:41:58 +08:00
Concedo
f24aef8792
initial whisper integration
2024-05-29 23:13:11 +08:00
Concedo
4ed9ba7352
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .github/workflows/docker.yml
# CMakeLists.txt
# Makefile
# README.md
# flake.lock
# tests/test-backend-ops.cpp
2024-05-28 21:57:19 +08:00
Concedo
2dedea9a74
add to remaining pyinstallers
2024-05-24 16:21:26 +08:00
Concedo
7a5983399b
try to fix lora naming issues
2024-05-15 23:49:01 +08:00
Concedo
08993696c3
try apply lora on load
2024-05-15 22:53:23 +08:00
Concedo
e1e6833a7a
try to fix loras
2024-05-15 16:05:51 +08:00
Concedo
ebbb969526
update lite
2024-05-15 15:00:22 +08:00