Commit graph

11 commits

Author SHA1 Message Date
0cc4m
b73c437e83 Fix convert_row_f16 kernel issue 2023-05-18 08:05:19 +02:00
0cc4m
0df55da4ca Deduplicate dequant kernels 2023-05-18 07:35:40 +02:00
0cc4m
67dbd356b6 Remove redundant constant values 2023-05-17 19:20:46 +02:00
0cc4m
de10afa80f Fix tensor load to device
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
2023-05-16 18:49:49 +02:00
0cc4m
b3ff66d87f Fix error in convert f16 to f32 kernel call 2023-05-16 13:05:33 +02:00
0cc4m
342d346c13 Generate dequant_mul_mat kernels from simple templates 2023-05-16 07:42:01 +02:00
0cc4m
5a74dc1536 Add remaining dequant_mul_mat functions 2023-05-14 22:19:54 +02:00
0cc4m
883e587a04 Fix dequant_mul_mat kernel 2023-05-14 21:26:28 +02:00
0cc4m
8795403de3 Fix bugs in dequant_mul_mat code 2023-05-14 21:14:05 +02:00
0cc4m
c77966524a Refactor OpenCL code to work more like the CUDA code, add missing functions 2023-05-14 17:01:46 +02:00
0cc4m
82bc517b9a Move back to C++ for OpenCL 2023-05-14 17:00:37 +02:00
Renamed from ggml-opencl.c (Browse further)