Commit graph

493 commits

Author SHA1 Message Date
Concedo
f32a874966 resync and updated sdcpp for flux and sd3 support 2024-11-03 22:03:16 +08:00
Concedo
6da5a63852 fix for uploaded wav files being incomplete due to fragmentation when converting to b64 2024-10-20 17:47:19 +08:00
Concedo
e692a79aab Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/docker.yml
#	CMakeLists.txt
#	CONTRIBUTING.md
#	docs/android.md
#	docs/docker.md
#	examples/embedding/embedding.cpp
#	examples/imatrix/imatrix.cpp
#	examples/infill/infill.cpp
#	examples/llama-bench/llama-bench.cpp
#	examples/main/README.md
#	examples/parallel/parallel.cpp
#	examples/perplexity/perplexity.cpp
#	examples/quantize-stats/quantize-stats.cpp
#	examples/save-load-state/save-load-state.cpp
#	examples/server/README.md
#	examples/simple/CMakeLists.txt
#	examples/speculative/speculative.cpp
#	flake.lock
#	ggml/src/CMakeLists.txt
#	ggml/src/ggml-blas.cpp
#	pocs/vdot/q8dot.cpp
#	pocs/vdot/vdot.cpp
#	scripts/debug-test.sh
#	scripts/sync-ggml.last
#	src/llama.cpp
#	tests/test-backend-ops.cpp
#	tests/test-chat-template.cpp
#	tests/test-quantize-fns.cpp
#	tests/test-quantize-perf.cpp
#	tests/test-tokenizer-0.cpp
#	tests/test-tokenizer-1-bpe.cpp
#	tests/test-tokenizer-1-spm.cpp
2024-10-11 11:59:59 +08:00
bebopkim
7dac9982f9
Metal: remove ggml_backend_metal_log_set_callback due to backend logging mechanism unificification d6fe7ab (#1144) 2024-10-06 14:54:33 +08:00
Concedo
2d57f80af9 Fix compilation on macos 2024-10-03 15:10:30 +08:00
Concedo
0b0e456202 try fix compile issues 2024-09-19 13:56:19 +08:00
Concedo
e44ddf26ef Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/build.yml
#	.github/workflows/server.yml
#	CMakeLists.txt
#	Makefile
#	examples/embedding/embedding.cpp
#	examples/imatrix/imatrix.cpp
#	examples/llama-bench/llama-bench.cpp
#	examples/llava/MobileVLM-README.md
#	examples/parallel/parallel.cpp
#	examples/perplexity/perplexity.cpp
#	examples/quantize/CMakeLists.txt
#	examples/server/README.md
#	examples/speculative/speculative.cpp
#	tests/test-backend-ops.cpp
2024-09-13 16:17:24 +08:00
Concedo
eee67281be move kcpp params out 2024-09-10 16:30:12 +08:00
Concedo
12fd16bfd4 Merge commit 'df270ef745' into concedo_experimental
# Conflicts:
#	Makefile
#	common/CMakeLists.txt
#	common/common.h
#	common/sampling.cpp
#	common/sampling.h
#	examples/infill/infill.cpp
#	examples/llama-bench/llama-bench.cpp
#	examples/quantize-stats/quantize-stats.cpp
#	examples/server/server.cpp
#	include/llama.h
#	src/llama-sampling.cpp
#	src/llama-sampling.h
#	src/llama.cpp
#	tests/test-grammar-integration.cpp
#	tests/test-grammar-parser.cpp
#	tests/test-json-schema-to-grammar.cpp
#	tests/test-llama-grammar.cpp
#	tests/test-sampling.cpp
2024-09-09 17:10:08 +08:00
Concedo
d220495dd4 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.devops/full-cuda.Dockerfile
#	.devops/llama-cli-cuda.Dockerfile
#	.devops/llama-server-cuda.Dockerfile
#	.devops/llama-server-intel.Dockerfile
#	.devops/llama-server-rocm.Dockerfile
#	.devops/llama-server-vulkan.Dockerfile
#	.devops/llama-server.Dockerfile
#	.github/workflows/docker.yml
#	docs/docker.md
#	examples/llama-bench/llama-bench.cpp
#	flake.lock
#	ggml/include/ggml.h
#	ggml/src/CMakeLists.txt
#	scripts/sync-ggml.last
#	src/llama.cpp
#	tests/test-backend-ops.cpp
#	tests/test-grad0.cpp
#	tests/test-rope.cpp
2024-08-30 10:37:39 +08:00
Concedo
b2c1ff7a13 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.ecrc
#	CMakePresets.json
#	ci/run.sh
#	docs/backend/SYCL.md
#	ggml/src/CMakeLists.txt
#	src/llama.cpp
#	tests/test-backend-ops.cpp
#	tests/test-sampling.cpp
2024-08-27 17:46:40 +08:00
Concedo
c61fa9155d handle oversized images by downscaling 2024-08-26 13:58:18 +08:00
Concedo
7bc87e1f0f added llava letterboxing feature 2024-08-25 23:15:38 +08:00
Concedo
d71b5477c5 update lite, cleanup, fix interrogate format 2024-08-18 00:48:53 +08:00
Concedo
e1f97f7fb5 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.devops/llama-server.Dockerfile
#	README.md
#	flake.lock
#	ggml/src/ggml-vulkan.cpp
#	ggml/src/vulkan-shaders/concat.comp
#	ggml/src/vulkan-shaders/pad.comp
#	ggml/src/vulkan-shaders/vulkan-shaders-gen.cpp
#	scripts/sync-ggml-am.sh
#	scripts/sync-ggml.last
#	src/llama.cpp
#	tests/test-backend-ops.cpp
2024-08-06 16:33:26 +08:00
Concedo
3a72410804 Added vulkan support for SD (+1 squashed commits)
Squashed commits:

[13f42f83] Added vulkan support for SD
2024-08-01 17:12:33 +08:00
Concedo
066e7ac540 minor fixes: colab gpu backend, lite bugs, package python file with embd 2024-07-15 17:36:03 +08:00
Concedo
11f0643fa4 fix pyinstallers 2024-06-27 15:19:44 +08:00
Concedo
2f7168779d Merge branch 'concedo_experimental' of https://github.com/LostRuins/koboldcpp into concedo_experimental 2024-06-06 20:26:57 +08:00
Concedo
1ad56e9b6b if quiet mode just show transcription event without text 2024-06-06 20:26:47 +08:00
Lexi
1c5e05e477
whisper: fix printf format string (#894)
This format string uses %d to print uint32_t and size_t{ype,}, which is
not guaranteed to work.  Instead, use PRIu32 for uint32_t, and %zu for
size_t.
2024-06-06 19:50:59 +08:00
Concedo
813cf829b5 allow selecting multigpu on vulkan 2024-06-06 18:36:56 +08:00
Concedo
9c7340bfac downgrade stb image resize 2024-06-02 18:02:22 +08:00
Concedo
b0a7d1aba6 fixed makefile (+1 squashed commits)
Squashed commits:

[ef6ddaf5] try fix makefile
2024-06-02 15:21:48 +08:00
Concedo
a65e0800ab update docs, added gui for whisper 2024-06-01 02:01:49 +08:00
Concedo
961c789c91 wav file resampling 2024-05-30 13:41:58 +08:00
Concedo
f24aef8792 initial whisper integration 2024-05-29 23:13:11 +08:00
Concedo
4ed9ba7352 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/docker.yml
#	CMakeLists.txt
#	Makefile
#	README.md
#	flake.lock
#	tests/test-backend-ops.cpp
2024-05-28 21:57:19 +08:00
Concedo
2dedea9a74 add to remaining pyinstallers 2024-05-24 16:21:26 +08:00
Concedo
7a5983399b try to fix lora naming issues 2024-05-15 23:49:01 +08:00
Concedo
08993696c3 try apply lora on load 2024-05-15 22:53:23 +08:00
Concedo
e1e6833a7a try to fix loras 2024-05-15 16:05:51 +08:00
Concedo
ebbb969526 update lite 2024-05-15 15:00:22 +08:00
Concedo
5ce2fdad24 taesd for sdxl, add lora loading done 2024-05-14 23:02:56 +08:00
Concedo
5d15f8f76a vae test 2024-05-14 19:17:01 +08:00
Concedo
4807b66907 wip sd 2024-05-13 23:23:16 +08:00
Concedo
3667cc0113 fixed stableui btn (+4 squashed commit)
Squashed commit:

[1d4714f1] update default amount to gen

[6eacba33] updated lite

[033589af] added first ver sdui

[16f66d57] updated lite
2024-05-06 00:55:16 +08:00
Concedo
a681cdd9ef Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	common/sampling.h
#	llama.h
#	tests/test-chat-template.cpp
2024-04-24 21:29:07 +08:00
Concedo
06e3a6f36e test workflow (+9 squashed commit)
Squashed commit:

[3d1fedab] test workflow

[c26d3a50] test workflow

[70e84f54] test workflow

[3383d040] workflow test

[2262b3c6] workflow test

[cd335d5a] workflow test

[bdbbfaeb] workflow test

[8e9fed4c] testing workflow

[e5b90d66] workflow test
2024-04-11 23:20:08 +08:00
Concedo
5c323a0661 fixed img2img for different sizes 2024-04-08 23:29:46 +08:00
Concedo
1aff35524d fixed compile issues for ci 2024-04-08 20:32:31 +08:00
Concedo
1ee5f355d4 try fix some compile issues (+1 squashed commits)
Squashed commits:

[e920e76b] try fix some compile issues
2024-04-08 20:01:46 +08:00
Concedo
125f84aa02 fixed compiler warnings 2024-04-08 16:40:55 +08:00
Concedo
a530afa1e4 Merge commit '280345968d' into concedo_experimental
# Conflicts:
#	.devops/full-cuda.Dockerfile
#	.devops/llama-cpp-cuda.srpm.spec
#	.devops/main-cuda.Dockerfile
#	.devops/nix/package.nix
#	.devops/server-cuda.Dockerfile
#	.github/workflows/build.yml
#	CMakeLists.txt
#	Makefile
#	README.md
#	ci/run.sh
#	docs/token_generation_performance_tips.md
#	flake.lock
#	llama.cpp
#	scripts/LlamaConfig.cmake.in
#	scripts/compare-commits.sh
#	scripts/server-llm.sh
#	tests/test-quantize-fns.cpp
2024-04-07 20:27:17 +08:00
Concedo
0061299cce fixed quant tools not compiling, updated docs 2024-04-06 23:11:05 +08:00
Concedo
79c8e87922 remove constraint for img dimension 2024-04-06 19:58:58 +08:00
Concedo
743687020d fixed img2img 2024-04-06 17:29:44 +08:00
Concedo
942fb4b413 fixed removed ref (+1 squashed commits)
Squashed commits:

[93f3c270] fixed removed ref (+1 squashed commits)

Squashed commits:

[df361250] remove some files
2024-03-19 19:33:56 +08:00
Concedo
7968bdebbb added more stats in perf 2024-03-16 16:53:48 +08:00
Concedo
88705cb89a improve quiet mode for SD 2024-03-12 20:50:39 +08:00