Concedo
f81404e33c
updated class py, added imatrix
2024-01-28 22:37:11 +08:00
Kawrakow
15bceec2d7
imatrix : keep intermediate imatrix results ( #5077 )
...
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2024-01-22 14:18:43 +02:00
Kawrakow
726c0fa9a2
Slightly faster imatrix ( #5050 )
...
* imatrix: speedup by avoiding unnecessary allocations and copies
* imatrix: add --no-ppl option to skip PPL calculations altogether
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2024-01-21 08:01:20 +02:00
Georgi Gerganov
a5cacb22b2
imatrix : add README.md
2024-01-19 15:24:47 +02:00
Georgi Gerganov
2d5419d08a
imatrix : fix assert for src0 non-cont check
2024-01-18 21:45:51 +02:00
Georgi Gerganov
ba69bbc84c
imatrix : offload to GPU support ( #4957 )
...
* backend : add eval callback
ggml-ci
* backend : group nodes in a single compute when user don't need them
* backend : clean-up the implementation
ggml-ci
* simple : do not perform tensor data copy if not needed
* simple : fix
* imatrix : offload to GPU support
* imatrix : fix ggml_mul_mat_id hanlding
ggml-ci
* ci : add imatrix test
ggml-ci
* ci : rearrange output
ggml-ci
2024-01-17 18:46:30 +02:00
Kawrakow
326b418b59
Importance Matrix calculation ( #4861 )
...
* imatrix: 1st version
* imatrix: WIP
* Cleanup
* Update examples/imatrix/imatrix.cpp
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2024-01-12 06:59:57 +01:00