koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2025-09-10 17:14:36 +00:00

Author	SHA1	Message	Date
Concedo	c47bc28488	slight refactor for noscript ui	2023-12-08 18:35:45 +08:00
Concedo	ec21fa7712	Merge branch 'master' into concedo_experimental # Conflicts: # .github/workflows/build.yml # .gitignore # CMakeLists.txt # Makefile # Package.swift # README.md # ggml-cuda.cu # llama.cpp # llama.h # scripts/sync-ggml.sh # tests/CMakeLists.txt	2023-12-08 17:42:26 +08:00
Concedo	930cdfb1ce	updated lite, added patch that links to noscript mode	2023-12-08 16:53:30 +08:00
Concedo	c7511526a2	noscript mode is done	2023-12-07 00:52:25 +08:00
Concedo	12002d8ed6	very basic noscript mode	2023-12-06 17:51:08 +08:00
Concedo	b6f952fd8d	improved exit logic	2023-12-05 21:08:10 +08:00
Concedo	a5a5839f5c	handle accidentally selecting a kcpps file as model instead	2023-12-04 21:10:42 +08:00
Concedo	6570a2005b	token count includes ids	2023-12-03 15:44:53 +08:00
Concedo	c142c5634a	fixed segfault with clblast by reversing commit in issue https://github.com/ggerganov/llama.cpp/issues/4296	2023-12-03 00:56:00 +08:00
Concedo	a829a1ee56	fix for janitorai	2023-12-02 23:58:41 +08:00
Concedo	1c422f45cb	more printouts	2023-12-02 11:48:48 +08:00
Concedo	66ef4a20e2	refined multiuser mode	2023-11-29 14:29:45 +08:00
Concedo	b75152e3e9	added a proper quiet mode	2023-11-28 21:20:51 +08:00
Concedo	ba5c33319b	Allocate a small amount of extra context for GGUF to deal with KV fragmentation causing issues in some scenarios.	2023-11-28 20:55:14 +08:00
Concedo	d2ef458b02	show more info about available APIs	2023-11-28 17:17:47 +08:00
Concedo	0e5f16de53	reduce max ctx to fit instead of crashing	2023-11-27 19:08:54 +08:00
Concedo	2f51a6afd5	trigger quiet mode when selecting remotetunnel	2023-11-27 00:16:36 +08:00
Concedo	bffa78116d	explore quiet mode	2023-11-26 23:57:27 +08:00
Concedo	eb42c73953	revert auto rope scaling for already-ropetuned models - just use their values	2023-11-24 14:20:36 +08:00
Concedo	dc4078c039	fixed segfault with all non-gguf models	2023-11-20 22:31:56 +08:00
Concedo	22c56f9221	default to multiuser	2023-11-18 12:55:59 +08:00
Concedo	a3f708afce	added more fields to the openai compatible completions APIs	2023-11-16 00:58:08 +08:00
Concedo	8b919b5b57	allow customized rope to use model set values	2023-11-15 16:21:52 +08:00
Concedo	f4ee91abbb	improved estimation	2023-11-13 15:45:13 +08:00
Concedo	be92cfa125	added preloadstory	2023-11-10 13:05:22 +08:00
Concedo	7ef4ec3b16	added trim_stop flag	2023-11-09 16:55:44 +08:00
Concedo	afa466807d	nooby layer selector considers contextsize	2023-11-09 14:05:35 +08:00
Concedo	fb3bcac368	handle memory separately for kcpp	2023-11-07 17:15:14 +08:00
Concedo	ea81eae189	cleanup, up ver (+1 squashed commits) Squashed commits: [1ea303d6] cleanup , up ver (+1 squashed commits) Squashed commits: [79f09b22] cleanup	2023-11-05 22:49:23 +08:00
YellowRoseCx	e2e5fe56a8	KCPP Fetches AMD ROCm Memory without a stick, CC_TURING Gets the Boot, koboldcpp_hipblas.dll Talks To The Hand, and hipBLAS Compiler Finds Its Independence! (#517 ) * AMD ROCm memory fetching and max mem setting * Update .gitignore with koboldcpp_hipblas.dll * Update CMakeLists.txt remove CC_TURING for AMD * separate hipBLAS compiler, update MMV_Y, move CXX/CC print separate hipBLAS compiler, update MMV_Y value, move the section that prints CXX and CC compiler name	2023-11-05 22:23:18 +08:00
Concedo	5e5be717c3	fix for removing inaccessible backends in gui	2023-11-05 10:12:12 +08:00
Concedo	1e7088a80b	autopick cublas in gui if possible, better layer picking logic	2023-11-05 01:35:27 +08:00
Concedo	135001abc4	try to make the tunnel more reliable	2023-11-04 09:18:19 +08:00
Concedo	36f43ae834	syntax correction	2023-11-04 00:03:45 +08:00
Concedo	373c20ad51	print error log if tunnel fails	2023-11-03 23:48:21 +08:00
Concedo	879061c5d5	noavx2 clblast selector	2023-11-02 23:13:16 +08:00
Concedo	b0c7b88eac	try fix clouflare tunnel (+2 squashed commit) Squashed commit: [87d96bf2] update remote option [c30bc909] updated fixed colab (+1 squashed commits) Squashed commits: [97b77563] updated fixed colab (+2 squashed commit) Squashed commit: [d851b04c] replaced cloudflare manual dl with remotetunnel in colab [90ff1790] updated lite	2023-11-02 22:27:35 +08:00
Concedo	fca7a4c054	added noavx2 model for clblast (+1 squashed commits) Squashed commits: [291ecae6] added noavx2 mode for clblast (+1 squashed commits) Squashed commits: [562bc872] wip adding noavx2 cl	2023-11-02 15:22:34 +08:00
Concedo	82267e5e69	switched back to clinfo since it's possibly more cross platform and can get memory vals easily	2023-11-02 14:12:05 +08:00
Concedo	21588cefd4	tunnel code done (+1 squashed commits) Squashed commits: [b4bc7d20] wip integration of trycloudflare	2023-11-01 23:28:23 +08:00
Concedo	3b227fc704	automatic gpu layer detection	2023-11-01 20:55:26 +08:00
Concedo	b395dbf6f5	wip layer calculator	2023-11-01 20:04:10 +08:00
Concedo	ae2cd56de8	kobold integration of min_p sampler (+1 squashed commits) Squashed commits: [8ad2e349] kobold integration for min_p sampler	2023-11-01 19:08:45 +08:00
Concedo	df7e757d40	windows: added simpleclinfo, which helps determine clblast platform and device on windows	2023-11-01 18:10:35 +08:00
Concedo	f3690ba6d2	shifting enabled by default	2023-10-31 21:41:57 +08:00
Concedo	61c395833d	context shifting is still buggy	2023-10-30 16:25:01 +08:00
Concedo	7f5d1b2fc6	slider error	2023-10-30 00:02:38 +08:00
Concedo	7924592a83	context shift feature done	2023-10-29 18:21:39 +08:00
Concedo	09c74ea046	include content-length	2023-10-28 14:24:37 +08:00
Concedo	15f525c580	revamped smart context for llama models	2023-10-28 12:59:08 +08:00

... 13 14 15 16 17 ...

1045 commits