You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
1.2 KiB
1.2 KiB
Binarization
Binarization for document images
Examples
Introduction
This tool performs document image binarization using trained models. The method is based on Calvo-Zaragoza and Gallego, 2018.
Installation
Clone the repository, enter it and run
pip install .
Models
Pre-trained models can be downloaded from here:
https://qurator-data.de/sbb_binarization/
Usage
sbb_binarize \
-m <path to directory containing model files> \
<input image> \
<output image>
Example
sbb_binarize -m /path/to/models/ myimage.tif myimage-bin.tif
To use the OCR-D interface:
ocrd-sbb-binarize --overwrite -I INPUT_FILE_GRP -O OCR-D-IMG-BIN -P model "/var/lib/sbb_binarization"