No description
Find a file
Gerber, Mike 4f28cd905a 🧹 sbb_textline_detector: Do not create empty/space-only TextEquivs
ocrd_tesserocr or ocrd_cis complain about already existing text if
empty/space-only TextEquivs elements exist after segmentation. Also, it
does not make sense to create them in a segmentation step.

Fix by removing the code generating the elements.
2019-10-25 18:08:31 +02:00
qurator 🧹 sbb_textline_detector: Do not create empty/space-only TextEquivs 2019-10-25 18:08:31 +02:00
.gitkeep 🧹 sbb_textline_docker: Rename to sbb_textline_detector 2019-10-10 16:13:07 +02:00
Dockerfile 🧹 sbb_textline_detector: Use same structure as the other projects 2019-10-10 16:24:28 +02:00
ocrd-tool.json sbb_textline_detector: Add a OCR-D interface 2019-10-10 17:54:42 +02:00
README.md 🧹 sbb_textline_docker: Rename to sbb_textline_detector 2019-10-10 16:13:07 +02:00
requirements.txt add missing requirement 2019-10-19 11:15:59 +02:00
setup.py 🐛 sbb_textline_detector: Install *.json 2019-10-11 16:18:10 +02:00

Textline-Recognition


Installation:

Setup virtual environment:

virtualenv --python=python3.6 venv

Activate virtual environment:

source venv/bin/activate

Upgrade pip:

pip install -U pip

Install package together with its dependencies in development mode:

pip install -e ./

Perform document structure and textline analysis on a scanned document image and save the result as PAGE XML.

Usage

text_line_recognition --help