koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2025-09-13 18:39:48 +00:00

Author	SHA1	Message	Date
Concedo	e9d2332dd8	improved tool calls and whisper	2024-12-06 14:34:31 +08:00
Concedo	2ba5949054	updated sdcpp, also set euler as default sampler	2024-12-01 17:00:20 +08:00
Concedo	409e393d10	fixed critical bug in image model loader	2024-11-30 23:28:24 +08:00
Concedo	f75bbb945f	speculative decoding initial impl completed (+6 squashed commit) Squashed commit: [0a6306ca0] draft wip dont use (will be squashed) [a758a1c9c] wip dont use (will be squashed) [e1994d3ce] wip dont use [f59690d68] wip [77228147d] wip on spec decoding. dont use yet [2445bca54] wip adding speculative decoding (+1 squashed commits) Squashed commits: [50e341bb7] wip adding speculative decoding	2024-11-30 10:41:10 +08:00
Concedo	116879144c	better error messages	2024-11-23 18:55:01 +08:00
Concedo	70aee82552	attempts a backflip, but does he stick the landing?	2024-11-16 17:05:45 +08:00
Concedo	8a7d53d838	declutter sdcpp adapter removed useless string literals	2024-11-14 00:27:39 +08:00
Concedo	dd95f88c19	add e5m2 support for use in Kobo, also made a separate contribution PR https://github.com/leejet/stable-diffusion.cpp/pull/460	2024-11-13 11:35:49 +08:00
Concedo	3813f6c517	added new flag nofastforward allowing users to disable fast forwarding	2024-11-13 10:59:01 +08:00
bebopkim	bb6c8bc095	Correct compilation errors on macOS Metal (#1204 )	2024-11-08 11:42:31 +08:00
Concedo	262437f393	fallback flux loader	2024-11-07 15:55:43 +08:00
Concedo	ccbd630a42	allow custom t5, clipl and clipg	2024-11-06 19:05:48 +08:00
Concedo	3cfc4dc581	avoid euler a for flux (+4 squashed commit) Squashed commit: [5a4b72385] fix cuda build [5f969a645] add vulkan information [6849e7398] fixed flux [740e80419] update readme	2024-11-05 22:50:14 +08:00
Concedo	5b90eeaf17	fixed sd to work on larger images by adding tiling, also limit res for sd1.5	2024-11-04 23:26:15 +08:00
Concedo	847689e74c	fixed incorrect makefile flags	2024-11-04 20:39:10 +08:00
Concedo	bb13925f39	Merge branch 'upstream' into concedo_experimental # Conflicts: # CMakePresets.json # Makefile # Package.swift # ci/run.sh # common/CMakeLists.txt # examples/CMakeLists.txt # flake.lock # ggml/src/CMakeLists.txt # ggml/src/ggml-backend.cpp # ggml/src/ggml.c # pocs/vdot/q8dot.cpp # pocs/vdot/vdot.cpp # tests/test-backend-ops.cpp # tests/test-grad0.cpp # tests/test-quantize-fns.cpp # tests/test-quantize-perf.cpp # tests/test-rope.cpp	2024-11-04 16:54:53 +08:00
Concedo	5233e8ed1d	sd 3.5 medium	2024-11-03 23:27:06 +08:00
Concedo	f32a874966	resync and updated sdcpp for flux and sd3 support	2024-11-03 22:03:16 +08:00
Concedo	6da5a63852	fix for uploaded wav files being incomplete due to fragmentation when converting to b64	2024-10-20 17:47:19 +08:00
Concedo	e692a79aab	Merge branch 'upstream' into concedo_experimental # Conflicts: # .github/workflows/docker.yml # CMakeLists.txt # CONTRIBUTING.md # docs/android.md # docs/docker.md # examples/embedding/embedding.cpp # examples/imatrix/imatrix.cpp # examples/infill/infill.cpp # examples/llama-bench/llama-bench.cpp # examples/main/README.md # examples/parallel/parallel.cpp # examples/perplexity/perplexity.cpp # examples/quantize-stats/quantize-stats.cpp # examples/save-load-state/save-load-state.cpp # examples/server/README.md # examples/simple/CMakeLists.txt # examples/speculative/speculative.cpp # flake.lock # ggml/src/CMakeLists.txt # ggml/src/ggml-blas.cpp # pocs/vdot/q8dot.cpp # pocs/vdot/vdot.cpp # scripts/debug-test.sh # scripts/sync-ggml.last # src/llama.cpp # tests/test-backend-ops.cpp # tests/test-chat-template.cpp # tests/test-quantize-fns.cpp # tests/test-quantize-perf.cpp # tests/test-tokenizer-0.cpp # tests/test-tokenizer-1-bpe.cpp # tests/test-tokenizer-1-spm.cpp	2024-10-11 11:59:59 +08:00
bebopkim	7dac9982f9	Metal: remove ggml_backend_metal_log_set_callback due to backend logging mechanism unificification `d6fe7ab` (#1144 )	2024-10-06 14:54:33 +08:00
Concedo	2d57f80af9	Fix compilation on macos	2024-10-03 15:10:30 +08:00
Concedo	0b0e456202	try fix compile issues	2024-09-19 13:56:19 +08:00
Concedo	e44ddf26ef	Merge branch 'upstream' into concedo_experimental # Conflicts: # .github/workflows/build.yml # .github/workflows/server.yml # CMakeLists.txt # Makefile # examples/embedding/embedding.cpp # examples/imatrix/imatrix.cpp # examples/llama-bench/llama-bench.cpp # examples/llava/MobileVLM-README.md # examples/parallel/parallel.cpp # examples/perplexity/perplexity.cpp # examples/quantize/CMakeLists.txt # examples/server/README.md # examples/speculative/speculative.cpp # tests/test-backend-ops.cpp	2024-09-13 16:17:24 +08:00
Concedo	eee67281be	move kcpp params out	2024-09-10 16:30:12 +08:00
Concedo	12fd16bfd4	Merge commit '`df270ef745`' into concedo_experimental # Conflicts: # Makefile # common/CMakeLists.txt # common/common.h # common/sampling.cpp # common/sampling.h # examples/infill/infill.cpp # examples/llama-bench/llama-bench.cpp # examples/quantize-stats/quantize-stats.cpp # examples/server/server.cpp # include/llama.h # src/llama-sampling.cpp # src/llama-sampling.h # src/llama.cpp # tests/test-grammar-integration.cpp # tests/test-grammar-parser.cpp # tests/test-json-schema-to-grammar.cpp # tests/test-llama-grammar.cpp # tests/test-sampling.cpp	2024-09-09 17:10:08 +08:00
Concedo	d220495dd4	Merge branch 'upstream' into concedo_experimental # Conflicts: # .devops/full-cuda.Dockerfile # .devops/llama-cli-cuda.Dockerfile # .devops/llama-server-cuda.Dockerfile # .devops/llama-server-intel.Dockerfile # .devops/llama-server-rocm.Dockerfile # .devops/llama-server-vulkan.Dockerfile # .devops/llama-server.Dockerfile # .github/workflows/docker.yml # docs/docker.md # examples/llama-bench/llama-bench.cpp # flake.lock # ggml/include/ggml.h # ggml/src/CMakeLists.txt # scripts/sync-ggml.last # src/llama.cpp # tests/test-backend-ops.cpp # tests/test-grad0.cpp # tests/test-rope.cpp	2024-08-30 10:37:39 +08:00
Concedo	b2c1ff7a13	Merge branch 'upstream' into concedo_experimental # Conflicts: # .ecrc # CMakePresets.json # ci/run.sh # docs/backend/SYCL.md # ggml/src/CMakeLists.txt # src/llama.cpp # tests/test-backend-ops.cpp # tests/test-sampling.cpp	2024-08-27 17:46:40 +08:00
Concedo	c61fa9155d	handle oversized images by downscaling	2024-08-26 13:58:18 +08:00
Concedo	7bc87e1f0f	added llava letterboxing feature	2024-08-25 23:15:38 +08:00
Concedo	d71b5477c5	update lite, cleanup, fix interrogate format	2024-08-18 00:48:53 +08:00
Concedo	e1f97f7fb5	Merge branch 'upstream' into concedo_experimental # Conflicts: # .devops/llama-server.Dockerfile # README.md # flake.lock # ggml/src/ggml-vulkan.cpp # ggml/src/vulkan-shaders/concat.comp # ggml/src/vulkan-shaders/pad.comp # ggml/src/vulkan-shaders/vulkan-shaders-gen.cpp # scripts/sync-ggml-am.sh # scripts/sync-ggml.last # src/llama.cpp # tests/test-backend-ops.cpp	2024-08-06 16:33:26 +08:00
Concedo	3a72410804	Added vulkan support for SD (+1 squashed commits) Squashed commits: [13f42f83] Added vulkan support for SD	2024-08-01 17:12:33 +08:00
Concedo	066e7ac540	minor fixes: colab gpu backend, lite bugs, package python file with embd	2024-07-15 17:36:03 +08:00
Concedo	11f0643fa4	fix pyinstallers	2024-06-27 15:19:44 +08:00
Concedo	2f7168779d	Merge branch 'concedo_experimental' of https://github.com/LostRuins/koboldcpp into concedo_experimental	2024-06-06 20:26:57 +08:00
Concedo	1ad56e9b6b	if quiet mode just show transcription event without text	2024-06-06 20:26:47 +08:00
Lexi	1c5e05e477	whisper: fix printf format string (#894 ) This format string uses %d to print uint32_t and size_t{ype,}, which is not guaranteed to work. Instead, use PRIu32 for uint32_t, and %zu for size_t.	2024-06-06 19:50:59 +08:00
Concedo	813cf829b5	allow selecting multigpu on vulkan	2024-06-06 18:36:56 +08:00
Concedo	9c7340bfac	downgrade stb image resize	2024-06-02 18:02:22 +08:00
Concedo	b0a7d1aba6	fixed makefile (+1 squashed commits) Squashed commits: [ef6ddaf5] try fix makefile	2024-06-02 15:21:48 +08:00
Concedo	a65e0800ab	update docs, added gui for whisper	2024-06-01 02:01:49 +08:00
Concedo	961c789c91	wav file resampling	2024-05-30 13:41:58 +08:00
Concedo	f24aef8792	initial whisper integration	2024-05-29 23:13:11 +08:00
Concedo	4ed9ba7352	Merge branch 'upstream' into concedo_experimental # Conflicts: # .github/workflows/docker.yml # CMakeLists.txt # Makefile # README.md # flake.lock # tests/test-backend-ops.cpp	2024-05-28 21:57:19 +08:00
Concedo	2dedea9a74	add to remaining pyinstallers	2024-05-24 16:21:26 +08:00
Concedo	7a5983399b	try to fix lora naming issues	2024-05-15 23:49:01 +08:00
Concedo	08993696c3	try apply lora on load	2024-05-15 22:53:23 +08:00
Concedo	e1e6833a7a	try to fix loras	2024-05-15 16:05:51 +08:00
Concedo	ebbb969526	update lite	2024-05-15 15:00:22 +08:00

1 2 3 4 5 ...

310 commits