mirror of
https://github.com/qurator-spk/sbb_textline_detection.git
synced 2025-06-30 22:20:00 +02:00
No description
ocrd_sbb_textline_detection used the output XML by main.py as is, and – by doing this – threw away any input data from the input PAGE, including the critical pc:AlternativeImage and the less important pc:MetadataItem. Fix this by merging the segmentation results into a file created from the input file. Also add a pc:MetadataItem processingStep about the segmentation operation. |
||
---|---|---|
qurator | ||
.gitkeep | ||
Dockerfile | ||
ocrd-tool.json | ||
README.md | ||
requirements.txt | ||
setup.py |
Textline-Recognition
Installation:
Setup virtual environment:
virtualenv --python=python3.6 venv
Activate virtual environment:
source venv/bin/activate
Upgrade pip:
pip install -U pip
Install package together with its dependencies in development mode:
pip install -e ./
Perform document structure and textline analysis on a scanned document image and save the result as PAGE XML.
Usage
text_line_recognition --help