Concedo
|
90fe9096b4
|
clean and refactoring pass before supporting newer models for different arch
|
2023-05-17 11:23:29 +08:00 |
|
Concedo
|
e05455f852
|
fixed wrong sized struct from legacy q8_1, fixed opencl varsize arrays
|
2023-05-13 23:56:08 +08:00 |
|
Concedo
|
b335f73a60
|
BACKWARDS COMPAT QUANT SHIM is ready, but upstream model converter is BORKED. BORK BORK.
|
2023-05-13 01:30:11 +08:00 |
|
Concedo
|
3de34ee492
|
Merge branch 'master' into concedo_experimental
# Conflicts:
# CMakeLists.txt
# Makefile
# ggml-opencl.c
|
2023-05-01 12:03:46 +08:00 |
|
Concedo
|
032a171867
|
integrated q5 formats
|
2023-04-28 12:58:39 +08:00 |
|
Concedo
|
59fb174678
|
fixed compile errors, made mmap automatic when lora is selected, added updated quantizers and quantization handling for gpt neox gpt 2 and gptj
|
2023-04-24 23:20:06 +08:00 |
|