koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2025-09-12 09:59:41 +00:00

History

Concedo 86469d15c4 fix for yr-rocm, large gpu scratch		2023-06-30 12:40:08 +08:00
..
tools	integrated world tokenizer for RWKV	2023-06-13 20:06:19 +08:00
ggml_v1.c	just testing cublas	2023-05-15 20:01:22 +08:00
ggml_v1.h	Various enhancement and integration pygmalion.cpp	2023-04-03 00:04:43 +08:00
ggml_v2-cuda-legacy.cu	fix compile errors	2023-06-29 17:54:12 +08:00
ggml_v2-cuda-legacy.h	Wip, CUDA porting malloc improvements, gpu accel for non-llama, backport old quants	2023-06-28 18:20:46 +08:00
ggml_v2-cuda.cu	cleanup indentation, fixing cublas build	2023-06-08 22:40:53 +08:00
ggml_v2-cuda.h	cleanup indentation, fixing cublas build	2023-06-08 22:40:53 +08:00
ggml_v2-opencl-legacy.c	Merge remote-tracking branch 'occam/opencl-dev' into concedo_experimental	2023-05-22 16:16:48 +08:00
ggml_v2-opencl-legacy.h	still merging in process	2023-05-20 15:58:33 +08:00
ggml_v2-opencl.cpp	back to 32 block size, not better	2023-06-01 00:14:22 +08:00
ggml_v2-opencl.h	still merging in process	2023-05-20 15:58:33 +08:00
ggml_v2.c	Wip, CUDA porting malloc improvements, gpu accel for non-llama, backport old quants	2023-06-28 18:20:46 +08:00
ggml_v2.h	remove old filever checks (+7 squashed commit)	2023-05-21 00:15:39 +08:00
gpt2_v1.cpp	fixed other arch	2023-05-24 00:20:43 +08:00
gpt2_v2.cpp	cleaning up some old junk	2023-06-04 11:05:46 +08:00
gpt2_v3.cpp	fix for yr-rocm, large gpu scratch	2023-06-30 12:40:08 +08:00
gptj_v1.cpp	remove old filever checks (+7 squashed commit)	2023-05-21 00:15:39 +08:00
gptj_v2.cpp	cleaning up some old junk	2023-06-04 11:05:46 +08:00
gptj_v3.cpp	fix for yr-rocm, large gpu scratch	2023-06-30 12:40:08 +08:00
llama_v2-util.h	cleanup indentation, fixing cublas build	2023-06-08 22:40:53 +08:00
llama_v2.cpp	fix for yr-rocm, large gpu scratch	2023-06-30 12:40:08 +08:00
llama_v2.h	still merging in process	2023-05-20 15:58:33 +08:00
mpt_v3.cpp	fix for yr-rocm, large gpu scratch	2023-06-30 12:40:08 +08:00
neox_v2.cpp	fixed other arch	2023-05-24 00:20:43 +08:00
neox_v3.cpp	fix for yr-rocm, large gpu scratch	2023-06-30 12:40:08 +08:00
otherarch.h	Implemented basic GPU offloading for MPT, GPT-2, GPT-J and GPT-NeoX	2023-06-22 00:43:25 +08:00
rwkv_v2.cpp	wip integrating new rwkv	2023-05-27 22:45:28 +08:00
rwkv_v2.h	wip integrating new rwkv	2023-05-27 22:45:28 +08:00
rwkv_v3.cpp	reverted sequence mode for rwkv due to multiple issues with speed loss with bigger quantized models	2023-06-14 20:03:14 +08:00
rwkv_v3.h	rwkv compile fix (+1 squashed commits)	2023-06-12 23:04:40 +08:00
rwkv_vocab.cpp	integrated world tokenizer for RWKV	2023-06-13 20:06:19 +08:00
utils.cpp	wstring convert for mpt	2023-06-24 11:43:42 +08:00
utils.h	wstring convert for mpt	2023-06-24 11:43:42 +08:00