koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2025-09-11 01:24:36 +00:00

History

Concedo ad638285de Merge branch 'master' into concedo_experimental # Conflicts: # Makefile # README.md # flake.lock # ggml-cuda.cu # llama.cpp # tests/test-backend-ops.cpp # tests/test-quantize-fns.cpp		2024-02-28 13:41:35 +08:00
..
tools	Merge branch 'master' into concedo_experimental	2024-02-28 13:41:35 +08:00
ggml_v1.c	Merge branch 'master' into concedo_experimental	2023-10-31 20:44:04 +08:00
ggml_v1.h	Various enhancement and integration pygmalion.cpp	2023-04-03 00:04:43 +08:00
ggml_v2-cuda-legacy.cu	increase pool buffers	2024-01-20 11:52:39 +08:00
ggml_v2-cuda-legacy.h	Wip, CUDA porting malloc improvements, gpu accel for non-llama, backport old quants	2023-06-28 18:20:46 +08:00
ggml_v2-cuda.cu	Koboldcpp-ROCm Port (#399 )	2023-08-28 17:05:06 +08:00
ggml_v2-cuda.h	cleanup indentation, fixing cublas build	2023-06-08 22:40:53 +08:00
ggml_v2-opencl-legacy.c	Merge remote-tracking branch 'occam/opencl-dev' into concedo_experimental	2023-05-22 16:16:48 +08:00
ggml_v2-opencl-legacy.h	still merging in process	2023-05-20 15:58:33 +08:00
ggml_v2-opencl.cpp	Reduce warnings. (#439 )	2023-09-16 18:52:09 +08:00
ggml_v2-opencl.h	still merging in process	2023-05-20 15:58:33 +08:00
ggml_v2.c	Reduce warnings. (#439 )	2023-09-16 18:52:09 +08:00
ggml_v2.h	remove old filever checks (+7 squashed commit)	2023-05-21 00:15:39 +08:00
ggml_v3-cuda.cu	try fix cuda slowdown	2024-02-05 16:34:15 +08:00
ggml_v3-cuda.h	fossilize ggml library ver 3, to support ggjtv3	2024-01-20 10:49:25 +08:00
ggml_v3-opencl.cpp	fossilize ggml library ver 3, to support ggjtv3	2024-01-20 10:49:25 +08:00
ggml_v3-opencl.h	fossilize ggml library ver 3, to support ggjtv3	2024-01-20 10:49:25 +08:00
ggml_v3.c	Merge branch 'master' into concedo_experimental	2024-01-20 23:27:42 +08:00
ggml_v3.h	try to use allocator for cuda ggml v3	2024-01-20 12:53:31 +08:00
gpt2_v1.cpp	fixed other arch	2023-05-24 00:20:43 +08:00
gpt2_v2.cpp	Reduce warnings. (#439 )	2023-09-16 18:52:09 +08:00
gpt2_v3.cpp	fossilize ggml library ver 3, to support ggjtv3	2024-01-20 10:49:25 +08:00
gptj_v1.cpp	remove old filever checks (+7 squashed commit)	2023-05-21 00:15:39 +08:00
gptj_v2.cpp	Reduce warnings. (#439 )	2023-09-16 18:52:09 +08:00
gptj_v3.cpp	fossilize ggml library ver 3, to support ggjtv3	2024-01-20 10:49:25 +08:00
llama-util.h	units (+2 squashed commit)	2024-01-20 23:53:21 +08:00
llama_v2-util.h	units (+2 squashed commit)	2024-01-20 23:53:21 +08:00
llama_v2.cpp	Koboldcpp-ROCm Port (#399 )	2023-08-28 17:05:06 +08:00
llama_v2.h	still merging in process	2023-05-20 15:58:33 +08:00
llama_v3.cpp	refactor identifiers	2024-01-20 17:26:11 +08:00
llama_v3.h	fossilize ggml library ver 3, to support ggjtv3	2024-01-20 10:49:25 +08:00
mpt_v3.cpp	fossilize ggml library ver 3, to support ggjtv3	2024-01-20 10:49:25 +08:00
neox_v2.cpp	fixed other arch	2023-05-24 00:20:43 +08:00
neox_v3.cpp	fossilize ggml library ver 3, to support ggjtv3	2024-01-20 10:49:25 +08:00
otherarch.h	try to use allocator for cuda ggml v3	2024-01-20 12:53:31 +08:00
rwkv_v2.cpp	Reduce warnings. (#439 )	2023-09-16 18:52:09 +08:00
rwkv_v2.h	wip integrating new rwkv	2023-05-27 22:45:28 +08:00
rwkv_v3.cpp	change listen count, fix null	2024-02-16 16:01:24 +08:00
rwkv_v3.h	fix compile errors, rwkv not working	2023-07-10 18:23:25 +08:00
rwkv_vocab.cpp	integrated world tokenizer for RWKV	2023-06-13 20:06:19 +08:00
utils.cpp	b64 decoder	2024-02-11 20:35:34 +08:00
utils.h	b64 decoder	2024-02-11 20:35:34 +08:00