You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Go to file
Gerber, Mike 4f28cd905a 🧹 sbb_textline_detector: Do not create empty/space-only TextEquivs
ocrd_tesserocr or ocrd_cis complain about already existing text if
empty/space-only TextEquivs elements exist after segmentation. Also, it
does not make sense to create them in a segmentation step.

Fix by removing the code generating the elements.
5 years ago
qurator 🧹 sbb_textline_detector: Do not create empty/space-only TextEquivs 5 years ago
.gitkeep 🧹 sbb_textline_docker: Rename to sbb_textline_detector 5 years ago
Dockerfile 🧹 sbb_textline_detector: Use same structure as the other projects 5 years ago
README.md 🧹 sbb_textline_docker: Rename to sbb_textline_detector 5 years ago
ocrd-tool.json sbb_textline_detector: Add a OCR-D interface 5 years ago
requirements.txt add missing requirement 5 years ago
setup.py 🐛 sbb_textline_detector: Install *.json 5 years ago

README.md

Textline-Recognition


Installation:

Setup virtual environment:

virtualenv --python=python3.6 venv

Activate virtual environment:

source venv/bin/activate

Upgrade pip:

pip install -U pip

Install package together with its dependencies in development mode:

pip install -e ./

Perform document structure and textline analysis on a scanned document image and save the result as PAGE XML.

Usage

text_line_recognition --help