Concedo
|
085a9f90a7
|
still refactoring
|
2023-04-01 11:56:34 +08:00 |
|
Concedo
|
9ab6e87b58
|
Merge branch 'master' into concedo
# Conflicts:
# CMakeLists.txt
|
2023-04-01 09:05:45 +08:00 |
|
Concedo
|
801b178f2a
|
still refactoring, but need a checkpoint to prepare build for 1.0.7
|
2023-04-01 08:55:14 +08:00 |
|
Concedo
|
6b86f5ea22
|
halfway refactoring, wip adding other model types
|
2023-04-01 01:13:05 +08:00 |
|
Concedo
|
559a1967f7
|
Backwards compatibility formats all done
Merge branch 'master' into concedo
# Conflicts:
# CMakeLists.txt
# README.md
# llama.cpp
|
2023-03-31 19:01:33 +08:00 |
|
Concedo
|
79f9743347
|
improved console info, fixed utf encoding bugs
|
2023-03-31 15:38:38 +08:00 |
|
Concedo
|
664b277c27
|
integrated libopenblas for greatly accelerated prompt processing. Windows binaries are included - feel free to build your own or to build for other platforms, but that is beyond the scope of this repo. Will fall back to non-blas if libopenblas is removed.
|
2023-03-30 00:43:52 +08:00 |
|
Concedo
|
57474944d6
|
Merge branch 'master' into concedo
# Conflicts:
# .github/workflows/build.yml
# CMakeLists.txt
# Makefile
# README.md
|
2023-03-26 14:52:08 +08:00 |
|
Concedo
|
3c78124aac
|
Merge branch 'master' into concedo
# Conflicts:
# README.md
|
2023-03-25 11:20:04 +08:00 |
|
Concedo
|
119392f6f2
|
defaulting to f32 kv, and 4 threads seem to produce better results
|
2023-03-25 11:11:40 +08:00 |
|
Concedo
|
c6c60332a4
|
Optimizations
|
2023-03-24 21:33:53 +08:00 |
|
Concedo
|
706e19e9b4
|
added ability to fast forward in time through partially duplicated prompts
|
2023-03-24 18:50:16 +08:00 |
|
Concedo
|
86c7457e24
|
Merge branch 'master' into concedo
# Conflicts:
# .github/workflows/build.yml
# CMakeLists.txt
# Makefile
# README.md
# main.cpp
|
2023-03-22 22:31:45 +08:00 |
|
Concedo
|
5f142df76e
|
dynamic max context size defaulting to 1024, also implemented the basic API as a fallback
|
2023-03-22 15:56:47 +08:00 |
|
Concedo
|
8d39365af6
|
update license, added backwards compatibility with both ggml model formats, fixed context length issues.
|
2023-03-20 23:43:35 +08:00 |
|
Concedo
|
a2c10e0d2f
|
Merge branch 'master' into concedo
# Conflicts:
# .devops/full.Dockerfile
# README.md
# main.cpp
|
2023-03-20 20:58:27 +08:00 |
|
Concedo
|
29054a2bee
|
explicit buffer allocation from python
|
2023-03-20 01:18:34 +08:00 |
|
Concedo
|
356c1b87ba
|
bugfixes and support for persistent states
|
2023-03-20 00:59:45 +08:00 |
|
Concedo
|
f952b7c613
|
Removed junk, fixed some bugs and support dynamic number of sharded files
Merge remote-tracking branch 'origin/master' into concedo
# Conflicts:
# README.md
|
2023-03-19 11:13:00 +08:00 |
|
Concedo
|
2c8f870f53
|
Created a python bindings for llama.cpp and emulated a simple Kobold HTTP API Endpoint
|
2023-03-19 00:07:11 +08:00 |
|