kvcache-ai-ktransformers/doc/en/Docker.md

848 B
Raw Blame History

Docker

Prerequisites

  • Docker must be installed and running on your system.
  • Create a folder to store big models & intermediate files (ex. /mnt/models)

Images

There are Docker images available for our project

Uploading

Building docker image locally

  • Download Dockerfile in there

  • finish, execute

    docker build  -t approachingai/ktransformers:v0.1.1 .
    

Usage

Assuming you have the nvidia-container-toolkit that you can use the GPU in a Docker container.

docker run --gpus all -v /path/to/models:/models -p 10002:10002 approachingai/ktransformers:v0.1.1 --port 10002 --gguf_path /models/path/to/gguf_path --model_path /models/path/to/model_path --web True

More operators you can see in the readme