koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2025-09-10 17:14:36 +00:00

Author	SHA1	Message	Date
Concedo	fb3bcac368	handle memory separately for kcpp	2023-11-07 17:15:14 +08:00
Concedo	ea81eae189	cleanup, up ver (+1 squashed commits) Squashed commits: [1ea303d6] cleanup , up ver (+1 squashed commits) Squashed commits: [79f09b22] cleanup	2023-11-05 22:49:23 +08:00
YellowRoseCx	e2e5fe56a8	KCPP Fetches AMD ROCm Memory without a stick, CC_TURING Gets the Boot, koboldcpp_hipblas.dll Talks To The Hand, and hipBLAS Compiler Finds Its Independence! (#517 ) * AMD ROCm memory fetching and max mem setting * Update .gitignore with koboldcpp_hipblas.dll * Update CMakeLists.txt remove CC_TURING for AMD * separate hipBLAS compiler, update MMV_Y, move CXX/CC print separate hipBLAS compiler, update MMV_Y value, move the section that prints CXX and CC compiler name	2023-11-05 22:23:18 +08:00
Concedo	5e5be717c3	fix for removing inaccessible backends in gui	2023-11-05 10:12:12 +08:00
Concedo	1e7088a80b	autopick cublas in gui if possible, better layer picking logic	2023-11-05 01:35:27 +08:00
Concedo	135001abc4	try to make the tunnel more reliable	2023-11-04 09:18:19 +08:00
Concedo	36f43ae834	syntax correction	2023-11-04 00:03:45 +08:00
Concedo	373c20ad51	print error log if tunnel fails	2023-11-03 23:48:21 +08:00
Concedo	879061c5d5	noavx2 clblast selector	2023-11-02 23:13:16 +08:00
Concedo	b0c7b88eac	try fix clouflare tunnel (+2 squashed commit) Squashed commit: [87d96bf2] update remote option [c30bc909] updated fixed colab (+1 squashed commits) Squashed commits: [97b77563] updated fixed colab (+2 squashed commit) Squashed commit: [d851b04c] replaced cloudflare manual dl with remotetunnel in colab [90ff1790] updated lite	2023-11-02 22:27:35 +08:00
Concedo	fca7a4c054	added noavx2 model for clblast (+1 squashed commits) Squashed commits: [291ecae6] added noavx2 mode for clblast (+1 squashed commits) Squashed commits: [562bc872] wip adding noavx2 cl	2023-11-02 15:22:34 +08:00
Concedo	82267e5e69	switched back to clinfo since it's possibly more cross platform and can get memory vals easily	2023-11-02 14:12:05 +08:00
Concedo	21588cefd4	tunnel code done (+1 squashed commits) Squashed commits: [b4bc7d20] wip integration of trycloudflare	2023-11-01 23:28:23 +08:00
Concedo	3b227fc704	automatic gpu layer detection	2023-11-01 20:55:26 +08:00
Concedo	b395dbf6f5	wip layer calculator	2023-11-01 20:04:10 +08:00
Concedo	ae2cd56de8	kobold integration of min_p sampler (+1 squashed commits) Squashed commits: [8ad2e349] kobold integration for min_p sampler	2023-11-01 19:08:45 +08:00
Concedo	df7e757d40	windows: added simpleclinfo, which helps determine clblast platform and device on windows	2023-11-01 18:10:35 +08:00
Concedo	f3690ba6d2	shifting enabled by default	2023-10-31 21:41:57 +08:00
Concedo	61c395833d	context shifting is still buggy	2023-10-30 16:25:01 +08:00
Concedo	7f5d1b2fc6	slider error	2023-10-30 00:02:38 +08:00
Concedo	7924592a83	context shift feature done	2023-10-29 18:21:39 +08:00
Concedo	09c74ea046	include content-length	2023-10-28 14:24:37 +08:00
Concedo	15f525c580	revamped smart context for llama models	2023-10-28 12:59:08 +08:00
Concedo	c2f675133d	support for abort without crash on disconnect	2023-10-27 15:27:17 +08:00
Concedo	aed05e5565	todo: troubleshoot sse with multiuser	2023-10-27 00:21:52 +08:00
Concedo	5db89b90b7	Merge branch 'master' into concedo_experimental # Conflicts: # .gitignore # CMakeLists.txt # Makefile # README.md # build.zig # ggml-opencl.cpp # tests/CMakeLists.txt # tests/test-double-float.cpp # tests/test-sampling.cpp	2023-10-25 23:58:15 +08:00
Concedo	98d1dba256	tighten timings	2023-10-25 20:44:20 +08:00
Concedo	cff75061fe	fixed some old models failing due to tokenizer changes, update lite (+1 squashed commits) Squashed commits: [9dee81ec] fixed some old models failing due to tokenizer changes, update lite tooltip (+3 squashed commit) Squashed commit: [5ab95a79] fixes [a561d5e2] fixed some old models failing due to tokenizer changes [95e65daf] lite updates	2023-10-22 11:04:59 +08:00
Concedo	6fa681b692	fixed a race condition with SSE streaming	2023-10-20 22:01:09 +08:00
Concedo	4382e51719	updated lite and default horde ctx amount	2023-10-19 22:49:59 +08:00
Concedo	6f8fe88f10	fix for lite (+5 squashed commit) Squashed commit: [f9ce9855] catch more exceptions [8cdaf149] tweaked horde worker timeouts, updated lite [619ebef4] fixed abort no response if failed [a54a66a2] fixed time overflow [9affdc3e] updated lite	2023-10-17 23:04:32 +08:00
Concedo	643902fbbb	fixed tensor split save and load	2023-10-13 10:07:22 +08:00
Concedo	7e2f714c9c	tensor split only for cuda	2023-10-12 17:01:52 +08:00
Alexander Abushady	11b8f97c1e	Tensor split UI (#471 ) * update .gitignore Remove .idea folder created by Jet Brains products. * Front end, and partial backe-end Tensor Split pulled in, shows in console, then not respected on model load. * UI Tweak + Tensor Split Fix Made Tensor Flow input match similar boxes around it. Also, fixed Tensor Split to populate the correct argument. * Changed int to float for tensor split Accidentally set int, needed to be float when setting tensor split args	2023-10-12 16:50:21 +08:00
Concedo	8be043ee38	more horde optimizations	2023-10-12 16:20:52 +08:00
Concedo	8d1cd512e2	missed a flag	2023-10-12 15:00:51 +08:00
Concedo	c6fe820357	improve cors and header handling	2023-10-12 14:53:39 +08:00
Concedo	f604cffdce	multiuser racer bugfix	2023-10-12 13:39:12 +08:00
Concedo	a003e3c348	horde auto recovery	2023-10-12 00:57:32 +08:00
Concedo	d74eab0e63	actually for this round, do not include deprecated params. i dont want to have to deal with them (+2 squashed commit) Squashed commit: [df2691c2] show context limit [7c74f52a] prevent old scripts from crashing	2023-10-10 19:20:33 +08:00
YellowRoseCx	1b25b21655	Merge pull request #27 from one-lithe-rune/allow-sdk-dll-loading - Allow use of hip SDK (if installed) dlls on windows (#470 ) * If the rocm/hip sdk is installed on windows, then include the sdk as a potential location to load the hipBlas/rocBlas .dlls from. This allows running koboldcpp.py directly with python after building work on windows without having to build the .exe and run that or copy .dlls around. Co-authored-by: one-lithe-rune <skapusniak@lithe-runes.com>	2023-10-10 17:16:33 +08:00
Concedo	f288c6b5e3	Merge branch 'master' into concedo_experimental # Conflicts: # CMakeLists.txt # Makefile # build.zig # scripts/sync-ggml.sh	2023-10-10 00:09:46 +08:00
Matěj Štágl	96e9539f05	OpenAI compat API adapter (#466 ) * feat: oai-adapter * simplify optional adapter for instruct start and end tags --------- Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>	2023-10-09 23:24:48 +08:00
Concedo	4e5b6293ab	adjust streaming timings	2023-10-08 23:12:45 +08:00
Concedo	a2b8473354	force flush sse	2023-10-08 15:12:07 +08:00
Concedo	07a114de63	force debugmode to be indicated on horde, allow 64k context for gguf	2023-10-07 10:23:33 +08:00
Concedo	120695ddf7	add update link	2023-10-07 01:33:18 +08:00
Concedo	2a36c85558	abort has multiuser support via genkey too	2023-10-06 23:27:00 +08:00
Concedo	1d1232ffbc	show horde job count	2023-10-06 18:42:59 +08:00
Concedo	efd0567f10	Merge branch 'concedo' into concedo_experimental # Conflicts: # koboldcpp.py	2023-10-06 11:22:01 +08:00

1 2 3 4 5 ...

318 commits