koboldcpp/examples/gguf-split
Concedo 9a25d77cc1 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/build.yml
#	.github/workflows/docker.yml
#	Makefile
#	README-sycl.md
#	README.md
#	ci/run.sh
#	ggml-cuda.cu
#	ggml.c
#	grammars/README.md
#	scripts/get-wikitext-2.sh
#	scripts/hf.sh
#	scripts/sync-ggml.last
#	tests/test-backend-ops.cpp
#	tests/test-grammar-integration.cpp
#	tests/test-json-schema-to-grammar.cpp
2024-04-14 21:18:39 +08:00
..
CMakeLists.txt gguf-split: split and merge gguf per batch of tensors (#6135) 2024-03-19 12:05:44 +01:00
gguf-split.cpp Merge branch 'upstream' into concedo_experimental 2024-04-14 21:18:39 +08:00
README.md Fix --split-max-size (#6655) 2024-04-14 13:12:59 +02:00
tests.sh Fix --split-max-size (#6655) 2024-04-14 13:12:59 +02:00

GGUF split Example

CLI to split / merge GGUF files.

Command line options:

  • --split: split GGUF to multiple GGUF, default operation.
  • --split-max-size: max size per split in M or G, f.ex. 500M or 2G.
  • --split-max-tensors: maximum tensors in each split: default(128)
  • --merge: merge multiple GGUF to a single GGUF.