koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2025-09-11 09:34:37 +00:00

Author	SHA1	Message	Date
Concedo	a530afa1e4	Merge commit '`280345968d`' into concedo_experimental # Conflicts: # .devops/full-cuda.Dockerfile # .devops/llama-cpp-cuda.srpm.spec # .devops/main-cuda.Dockerfile # .devops/nix/package.nix # .devops/server-cuda.Dockerfile # .github/workflows/build.yml # CMakeLists.txt # Makefile # README.md # ci/run.sh # docs/token_generation_performance_tips.md # flake.lock # llama.cpp # scripts/LlamaConfig.cmake.in # scripts/compare-commits.sh # scripts/server-llm.sh # tests/test-quantize-fns.cpp	2024-04-07 20:27:17 +08:00
Concedo	7eccc5ffa6	change listen count, fix null	2024-02-16 16:01:24 +08:00
Concedo	5639c1a520	units (+2 squashed commit) Squashed commit: [166979d9] units coversion [038dd5d4] get rid of all warnings (+1 squashed commits) Squashed commits: [6efd1e1b] get rid of all warnings	2024-01-20 23:53:21 +08:00
Concedo	db14de5c32	fossilize ggml library ver 3, to support ggjtv3	2024-01-20 10:49:25 +08:00
Concedo	76a3ba42eb	Merge branch 'master' into concedo_experimental # Conflicts: # ggml.c # ggml.h # requirements.txt # tests/test-quantize-perf.cpp	2023-12-16 22:58:53 +08:00
Concedo	a6eb9b8010	Fix GPT2 not loading due to graph too small	2023-11-26 23:06:42 +08:00
Concedo	dc4078c039	fixed segfault with all non-gguf models	2023-11-20 22:31:56 +08:00
Concedo	5925c23d51	fix for RWKV	2023-09-01 17:02:11 +08:00
Concedo	b8372d4466	Merge branch 'master' into concedo_experimental # Conflicts: # .gitignore # README.md # tests/CMakeLists.txt	2023-08-24 15:21:24 +08:00
Concedo	523fc3be52	fixed rwkv, standardized new ctx usage	2023-07-10 20:05:53 +08:00
Concedo	2827920044	fix compile errors, rwkv not working	2023-07-10 18:23:25 +08:00
Concedo	ca9a11697c	possibly slower, but cannot use larger batches without modifying ggml library.	2023-07-04 00:35:02 +08:00
Concedo	ef3b8dc0d9	GPU accel for rwkv is slow, disable it	2023-07-02 00:41:46 +08:00
Concedo	e1a7042943	try out the new rwkv but it seems worse, may revert	2023-07-02 00:10:56 +08:00
Concedo	3ed3e7b7e2	reverted sequence mode for rwkv due to multiple issues with speed loss with bigger quantized models	2023-06-14 20:03:14 +08:00
Concedo	860fb026df	rwkv compile fix (+1 squashed commits) Squashed commits: [8b0ebb1] upgraded rwkv + added memory overheads + added state_out bufs	2023-06-12 23:04:40 +08:00
Concedo	97b39f875c	fixed fstat64 build error on mac	2023-05-29 15:50:07 +08:00
Concedo	55e0fbf024	wip integrating new rwkv	2023-05-27 22:45:28 +08:00

18 commits