eynollah/docs/docker.md

44 lines
1.5 KiB
Markdown
Raw Normal View History

2025-10-20 22:16:56 +02:00
## Inference with Docker
docker pull ghcr.io/qurator-spk/eynollah:latest
### 1. ocrd resource manager
2025-10-17 19:19:26 +02:00
(just once, to get the models and install them into a named volume for later re-use)
vol_models=ocrd-resources:/usr/local/share/ocrd-resources
docker run --rm -v $vol_models ocrd/eynollah ocrd resmgr download ocrd-eynollah-segment default
Now, each time you want to use Eynollah, pass the same resources volume again.
Also, bind-mount some data directory, e.g. current working directory $PWD (/data is default working directory in the container).
2025-10-20 22:16:56 +02:00
2025-10-17 19:19:26 +02:00
Either use standalone CLI (2) or OCR-D CLI (3):
2025-10-20 22:16:56 +02:00
### 2. standalone CLI
(follow self-help, cf. readme)
2025-10-17 19:19:26 +02:00
docker run --rm -v $vol_models -v $PWD:/data ocrd/eynollah eynollah binarization --help
docker run --rm -v $vol_models -v $PWD:/data ocrd/eynollah eynollah layout --help
docker run --rm -v $vol_models -v $PWD:/data ocrd/eynollah eynollah ocr --help
2025-10-20 22:16:56 +02:00
### 3. OCR-D CLI
(follow self-help, cf. readme and https://ocr-d.de/en/spec/cli)
2025-10-17 19:19:26 +02:00
docker run --rm -v $vol_models -v $PWD:/data ocrd/eynollah ocrd-eynollah-segment -h
docker run --rm -v $vol_models -v $PWD:/data ocrd/eynollah ocrd-sbb-binarize -h
Alternatively, just "log in" to the container once and use the commands there:
2025-10-20 22:16:56 +02:00
docker run --rm -v $vol_models -v $PWD:/data -it ocrd/eynollah bash
## Training with Docker
2025-10-20 23:19:10 +02:00
Build the Docker training image
2025-10-20 22:16:56 +02:00
cd train
docker build -t model-training .
2025-10-20 23:19:10 +02:00
Run the Docker training image
2025-10-20 22:16:56 +02:00
cd train
docker run --gpus all -v $PWD:/entry_point_dir model-training