mirror of
https://github.com/qurator-spk/sbb_textline_detection.git
synced 2025-10-30 00:44:17 +01:00
No description
ocrd_sbb_textline_detection used the output XML by main.py as is, and – by doing this – threw away any input data from the input PAGE, including the critical pc:AlternativeImage and the less important pc:MetadataItem. Fix this by merging the segmentation results into a file created from the input file. Also add a pc:MetadataItem processingStep about the segmentation operation. |
||
|---|---|---|
| qurator | ||
| .gitkeep | ||
| Dockerfile | ||
| ocrd-tool.json | ||
| README.md | ||
| requirements.txt | ||
| setup.py | ||
Textline-Recognition
Installation:
Setup virtual environment:
virtualenv --python=python3.6 venv
Activate virtual environment:
source venv/bin/activate
Upgrade pip:
pip install -U pip
Install package together with its dependencies in development mode:
pip install -e ./
Perform document structure and textline analysis on a scanned document image and save the result as PAGE XML.
Usage
text_line_recognition --help