koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-11 04:51:25 +00:00

Author	SHA1	Message	Date
Concedo	8a18e094f5	added smartcaching implementation inspired from Pento95 (+2 squashed commit) Squashed commit: [fcc498688] wip basic smart caching test [b6e8b2577] wip basic smart caching test	2025-12-10 18:00:03 +08:00
Concedo	1aab32fe03	fixed safetensors loading for zimage	2025-12-09 18:09:47 +08:00
Wagner Bruna	801840d3bd	sd: sync to master-391-5865b5e (#1878 )	2025-12-08 19:53:52 +08:00
Concedo	20363dc6e7	z image limit cfg scale to 3.0 max	2025-12-07 16:24:26 +08:00
Wagner Bruna	510508e7da	sd: sync to master-391-5865b5e (#1872 )	2025-12-04 16:29:38 +08:00
Concedo	7316c24bf6	fixes to load flux2	2025-12-03 18:38:54 +08:00
Wagner Bruna	fbdcc74ae6	adapt kcpp changes for z-image	2025-12-01 19:15:38 -03:00
Wagner Bruna	fcb86f22df	sd: sync to master-387-e4c50f1	2025-12-01 19:15:38 -03:00
Wagner Bruna	e971eaefe3	fix: qwenvl.hpp was renamed to llm.hpp	2025-12-01 19:15:38 -03:00
Wagner Bruna	53407c866d	fix mistral vocab filename and qwen2vl tensor name	2025-12-01 17:31:02 -03:00
Wagner Bruna	07f53a4d84	fix qwen2vl tensor detection	2025-12-01 17:04:56 -03:00
Wagner Bruna	3db48a1536	sd: sync to master-383-20eb674	2025-12-01 17:04:56 -03:00
Concedo	438eae7105	mistral2 vocab for sdcpp	2025-12-01 22:32:58 +08:00
Ruben Garcia	06d39dff73	Fix warnings (#1864 )	2025-11-29 20:18:38 +08:00
Concedo	9a7f749f7c	minor tweak for sd	2025-11-24 22:31:03 +08:00
Wagner Bruna	717d9c6401	sd: sync to master-377-2034588	2025-11-23 19:29:01 -03:00
Wagner Bruna	703bbf67d8	sd: sync to master-371-5498cc0	2025-11-23 19:29:01 -03:00
Wagner Bruna	8ef66e90c1	sd: sync to master-366-f532972	2025-11-23 19:29:01 -03:00
Wagner Bruna	3a7dd1a97f	sd: sync to master-358-347710f Also adapt Koboldcpp LoRA loading function, and add backend support for lora_apply_mode.	2025-11-23 19:28:54 -03:00
Wagner Bruna	3318b73c94	sd: sync to master-355-694f0d9	2025-11-23 19:28:34 -03:00
LostRuins Concedo	281542aa0d	add smoothing curve, not tested	2025-11-17 23:07:35 +08:00
LostRuins Concedo	85060da3ce	rename ggml_cumsum for tts	2025-11-16 17:54:58 +08:00
LostRuins Concedo	5751c30790	add vulkan for whisper	2025-11-13 15:37:58 +08:00
LostRuins Concedo	a5cc934f04	cherry pick patch in https://github.com/leejet/stable-diffusion.cpp/pull/957 and https://github.com/leejet/stable-diffusion.cpp/pull/935	2025-11-10 23:05:38 +08:00
Concedo	333e2bb30b	fix for qwen image crashing due to ref images being too big, trial and error shows it happens after 512x512	2025-11-02 01:31:01 +08:00
Concedo	7179e49aef	fix from https://github.com/leejet/stable-diffusion.cpp/pull/926	2025-11-01 23:38:37 +08:00
Concedo	0891b0752d	qwen3vl fixed (+2 squashed commit) Squashed commit: [89f65ed0c] wip fixing q3vl [6fa34cff2] wip fixing q3vl	2025-10-31 17:52:33 +08:00
Concedo	c2316353a1	allow usage of flux without some components	2025-10-30 22:32:20 +08:00
Wagner Bruna	96a70033ba	sd: sync to master-343-dd75fc0 (#1818 )	2025-10-30 13:44:59 +08:00
Wagner Bruna	c652d08f02	sd: sync to master-340-9e28be6 (#1816 )	2025-10-27 21:47:48 +08:00
Wagner Bruna	2cab657c60	sd: sync to master-336-917f7bf (#1810 )	2025-10-25 21:19:35 +08:00
tsite	97867f1990	add alt umt5xxl tensor name (#1813 )	2025-10-25 12:48:08 +08:00
Wagner Bruna	fef73919ea	sd: clean up changes against stable-diffusion.cpp 90ef5f8 (#1804 ) * sd: clean up changes against stable-diffusion.cpp 90ef5f8 Clean up the diff, and include a few missing changes, mainly from the upscaler and model weight type statistics. * added line clear again * remove excess spaces --------- Co-authored-by: LostRuins Concedo <39025047+LostRuins@users.noreply.github.com>	2025-10-23 22:00:33 +08:00
Concedo	86b94456de	Sync sd.cpp to 90ef5f8	2025-10-20 11:07:42 +08:00
Concedo	fab2ff0687	sync sd.cpp to e370258	2025-10-20 10:45:34 +08:00
Concedo	49535fdfed	rename define	2025-10-20 10:08:39 +08:00
Wagner Bruna	d7da1eb35c	invert KCPP_BAKE_SD_VOCAB logic, move define to sdtype_adapter.cpp (#1803 ) Using KCPP_BAKE_SD_VOCAB to turn off the change to not embed the vocabulary files makes testing new upstream merges harder, because we then need to set that macro on the sd.cpp original build. So, revert the tests, making the define turn the change on. Also, since model.cpp is always built by Koboldcpp as part of the sdtype_adapter.cpp, it's enough to set the macro on that file.	2025-10-20 10:07:37 +08:00
Concedo	d9e898afe0	reset scheduler if default otherwise it will persist the old one	2025-10-19 19:53:40 +08:00
Concedo	f6916ba864	updated sdui	2025-10-17 13:56:45 +08:00
Concedo	c18d7991c8	rename tts.cpp ggml_round to ggml_ttsround to avoid conflict	2025-10-16 14:24:44 +08:00
Concedo	5207b8d4be	more sd path fallbacks	2025-10-15 15:22:06 +08:00
Concedo	610ba18971	sdcpp precision fix	2025-10-15 11:08:35 +08:00
Concedo	54db35cd7a	fix t5 scale as well	2025-10-12 20:35:46 +08:00
Concedo	1a360b8458	sdcpp: optimize the handling of the FeedForward precision fix (+1 squashed commits) Squashed commits: [621ff6392] sdcpp: optimize the handling of the FeedForward precision fix (+1 squashed commits) Squashed commits: [05b16906c] sdcpp: optimize the handling of the FeedForward precision fix	2025-10-12 17:49:38 +08:00
Concedo	76d5fcbe49	fix the issue that occurs when using CUDA with k-quants weights	2025-10-12 16:18:03 +08:00
Concedo	65c2129f65	https://github.com/leejet/stable-diffusion.cpp/pull/877/commits/47c0f8e4bd6916442d04b0a4412554cf3a043e8d	2025-10-12 10:01:29 +08:00
Concedo	0cc0ea4cf9	reset prompt template idx	2025-10-11 12:30:07 +08:00
Concedo	5cea2fe944	don't enforce dims	2025-10-11 11:34:47 +08:00
Concedo	80f88eb703	wip qwen image edit. not working yet	2025-10-11 11:24:17 +08:00
Concedo	bc09f34f66	only accept qwen image pruned models matching 40 or 41 layers	2025-10-10 16:26:32 +08:00

1 2 3 4 5 ...

493 commits