Concedo
3ed3e7b7e2
reverted sequence mode for rwkv due to multiple issues with speed loss with bigger quantized models
2023-06-14 20:03:14 +08:00
Concedo
871009dfab
integrated world tokenizer for RWKV
2023-06-13 20:06:19 +08:00
Concedo
860fb026df
rwkv compile fix (+1 squashed commits)
...
Squashed commits:
[8b0ebb1] upgraded rwkv + added memory overheads + added state_out bufs
2023-06-12 23:04:40 +08:00
Concedo
c44b9c3ecf
added the llama_v2 cuda back (+2 squashed commit)
...
Squashed commit:
[1c97fd4] Revert "fix for cublas"
This reverts commit 994be9a4db
.
[fce03c3] Revert "fix for cublas"
This reverts commit 33528f5b1d
.
2023-06-11 23:23:24 +08:00
Concedo
a6a0fa338a
cleanup indentation, fixing cublas build
2023-06-08 22:40:53 +08:00
Concedo
c046db5197
lite bugfixes, buffer size changes, fixed a topk bug.
2023-06-06 22:38:25 +08:00
Concedo
9270056269
fixed compile error in cmake VS
2023-06-05 11:48:04 +08:00
Concedo
9aa2d8535b
hide gpu input box when dropdown not selected, minor memory fix for neox and gptj
2023-06-04 21:47:17 +08:00
Concedo
20803c221e
cleaning up some old junk
2023-06-04 11:05:46 +08:00
Concedo
b62279cb39
buf size for starcoder still not good
2023-06-04 00:41:08 +08:00
Concedo
c1b293d31a
fixed MPT ooms
2023-06-03 18:37:13 +08:00
Concedo
6f82e17b7a
added MPT support
2023-06-03 16:14:08 +08:00
Concedo
234270bd83
back to 32 block size, not better
2023-06-01 00:14:22 +08:00
Concedo
446e42a8c6
change dmmv block size
2023-05-31 21:40:12 +08:00
Concedo
6b3373cb81
revert bad fix
2023-05-29 22:06:12 +08:00
Concedo
ef16d09a51
fix for older gcc, updated lite
2023-05-29 18:54:15 +08:00
Concedo
97b39f875c
fixed fstat64 build error on mac
2023-05-29 15:50:07 +08:00
Concedo
55e0fbf024
wip integrating new rwkv
2023-05-27 22:45:28 +08:00
Concedo
6d7749c98f
no difference
2023-05-27 12:42:19 +08:00
Concedo
bd4fe936f5
cleanup sampling code
2023-05-27 11:58:39 +08:00
Concedo
bf482d1786
revert klite newline bug, trying to add win7 support
2023-05-24 22:21:01 +08:00
Concedo
844f92688a
subpattern fix
2023-05-24 16:48:39 +08:00
Concedo
abb9ad789c
fixed other arch
2023-05-24 00:20:43 +08:00
Concedo
981d5ba866
Merge remote-tracking branch 'occam/opencl-dev' into concedo_experimental
...
# Conflicts:
# .github/workflows/build.yml
# CMakeLists.txt
# Makefile
# README.md
# ggml-opencl.cpp
# llama.cpp
# otherarch/ggml_v2-opencl-legacy.c
2023-05-22 16:16:48 +08:00
Concedo
587308a202
fixed some build errors on linux, changed icon resolution, added more error printing
2023-05-22 12:18:42 +08:00
Concedo
fea84c3cf5
fix for stupid msvc compiler
2023-05-21 22:41:33 +08:00
Concedo
60e0c67874
fix compile errors on cuda
2023-05-21 21:13:17 +08:00
Concedo
33528f5b1d
fix for cublas
2023-05-21 21:03:36 +08:00
Concedo
994be9a4db
fix for cublas
2023-05-21 21:02:21 +08:00
Concedo
24127ebf98
updated lite, fixed some encoding issues
2023-05-21 17:29:00 +08:00
Concedo
75e4548821
missed out gpt2
2023-05-21 01:44:47 +08:00
Concedo
c048bcfec4
remove old filever checks (+7 squashed commit)
...
Squashed commit:
[b72627a] new format not working
[e568870] old ver works
[7053b77] compile errors fixed, fixing linkers
[4ae8889] add new ver
[ff82dfd] file format checks
[25b8aa8] refactoring type names
[931063b] still merging
2023-05-21 00:15:39 +08:00
Concedo
a0cfed1e30
still merging in process
2023-05-20 15:58:33 +08:00
Concedo
a8958f6b76
merging, do not use
2023-05-20 15:12:31 +08:00
Concedo
4e86a07e57
wip cleanup before big merge
2023-05-20 12:48:28 +08:00
Concedo
2c6ac06936
gpu offload not working for other arch. debug in future.
2023-05-17 17:13:01 +08:00
Concedo
57230b5196
upgrade all other formats
2023-05-17 16:28:20 +08:00
Concedo
00da2a5f4e
neox is updated
2023-05-17 14:56:54 +08:00
Concedo
90fe9096b4
clean and refactoring pass before supporting newer models for different arch
2023-05-17 11:23:29 +08:00
Concedo
94ef3e81cf
inc allocation
2023-05-16 23:32:35 +08:00
Concedo
72836d4eac
fixing more compile issues
2023-05-15 20:10:54 +08:00
Concedo
6504150fac
just testing cublas
2023-05-15 20:01:22 +08:00
Concedo
c81dd58e76
Merge commit ' f954edda93
' into archive_lib
...
# Conflicts:
# ggml.c
2023-05-14 18:34:56 +08:00
Concedo
b692e4d2a4
wip
2023-05-14 17:21:07 +08:00
Concedo
e05455f852
fixed wrong sized struct from legacy q8_1, fixed opencl varsize arrays
2023-05-13 23:56:08 +08:00
Concedo
05cf5f7d6e
partially working, but the blas matmul is broken
2023-05-13 11:35:38 +08:00
Concedo
b335f73a60
BACKWARDS COMPAT QUANT SHIM is ready, but upstream model converter is BORKED. BORK BORK.
2023-05-13 01:30:11 +08:00
Concedo
e47f7ade05
updated kobold lite, patch oom errors
2023-05-09 19:16:45 +08:00
Concedo
2f2eff6e13
the dark gods have been sated, and redpajama is integrated... but at what cost?
2023-05-08 20:58:00 +08:00
Concedo
105f818d45
integrated new version of rwkv from upstream
2023-05-03 23:26:39 +08:00