mirror of https://github.com/qurator-spk/sbb_textline_detection.git synced 2026-07-08 12:29:10 +02:00

No description

Find a file

Gerber, Mike 4aed06a325 ✨ sbb_textline_detection: Preserve input PAGE info by merging segmentation results ocrd_sbb_textline_detection used the output XML by main.py as is, and – by doing this – threw away any input data from the input PAGE, including the critical pc:AlternativeImage and the less important pc:MetadataItem. Fix this by merging the segmentation results into a file created from the input file. Also add a pc:MetadataItem processingStep about the segmentation operation.		2019-11-19 15:08:53 +01:00
qurator	✨ sbb_textline_detection: Preserve input PAGE info by merging segmentation results	2019-11-19 15:08:53 +01:00
.gitkeep	🧹 sbb_textline_docker: Rename to sbb_textline_detector	2019-10-10 16:13:07 +02:00
Dockerfile	🧹 sbb_textline_detector: Use same structure as the other projects	2019-10-10 16:24:28 +02:00
ocrd-tool.json	✨ sbb_textline_detector: Add a OCR-D interface	2019-10-10 17:54:42 +02:00
README.md	🧹 sbb_textline_docker: Rename to sbb_textline_detector	2019-10-10 16:13:07 +02:00
requirements.txt	✨ sbb_textline_detection: Preserve input PAGE info by merging segmentation results	2019-11-19 15:08:53 +01:00
setup.py	🐛 sbb_textline_detector: Install *.json	2019-10-11 16:18:10 +02:00

Textline-Recognition

Installation:

Setup virtual environment:

virtualenv --python=python3.6 venv

Activate virtual environment:

source venv/bin/activate

Upgrade pip:

pip install -U pip

Install package together with its dependencies in development mode:

pip install -e ./

Perform document structure and textline analysis on a scanned document image and save the result as PAGE XML.

text_line_recognition --help