Commit graph

15 commits

Author SHA1 Message Date
Concedo
b13a768813 added softprompt endpoint 2023-03-25 10:12:47 +08:00
Concedo
e791827973 added a GUI for selection of models if none was passed in through command line. 2023-03-24 22:03:57 +08:00
Concedo
c6c60332a4 Optimizations 2023-03-24 21:33:53 +08:00
Concedo
706e19e9b4 added ability to fast forward in time through partially duplicated prompts 2023-03-24 18:50:16 +08:00
Concedo
5f142df76e dynamic max context size defaulting to 1024, also implemented the basic API as a fallback 2023-03-22 15:56:47 +08:00
Henk
19178fa28e 2048 context all core 2023-03-21 21:49:47 +01:00
Concedo
7e6f4ed5e2 use a portable exit function 2023-03-21 22:00:27 +08:00
Concedo
a1625c4be1 changed file embedding technique 2023-03-21 21:16:06 +08:00
Concedo
ff1c0c4898 added embedded copy of kobold lite 2023-03-21 20:41:19 +08:00
Concedo
8d39365af6 update license, added backwards compatibility with both ggml model formats, fixed context length issues. 2023-03-20 23:43:35 +08:00
Concedo
dda69d4034 bugfixes 2023-03-20 13:37:51 +08:00
Concedo
29054a2bee explicit buffer allocation from python 2023-03-20 01:18:34 +08:00
Concedo
356c1b87ba bugfixes and support for persistent states 2023-03-20 00:59:45 +08:00
Concedo
f952b7c613 Removed junk, fixed some bugs and support dynamic number of sharded files
Merge remote-tracking branch 'origin/master' into concedo

# Conflicts:
#	README.md
2023-03-19 11:13:00 +08:00
Concedo
2c8f870f53 Created a python bindings for llama.cpp and emulated a simple Kobold HTTP API Endpoint 2023-03-19 00:07:11 +08:00