You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Go to file
Gerber, Mike 48a31ce672 Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector"
This reverts commit 2c89bf3b35ee290d7b830ef270df3a96aa48245e, reversing
changes made to 9f7e413148ca5dbac9b555d7b0d0a5fa3a0f5340.
5 years ago
.screenshots Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
qurator Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
.travis.yml Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
LICENSE Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
README.md Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
ocrd-tool.json Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
pytest.ini Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
requirements.txt Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
setup.py Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago

README.md

dinglehopper

dinglehopper is an OCR evaluation tool and reads ALTO, PAGE and text files.

Build Status

Goals

  • Useful
    • As a UI tool
    • For an automated evaluation
    • As a library
  • Unicode support

Installation

It's best to use pip, e.g.:

sudo pip install .

Usage

dinglehopper some-document.gt.page.xml some-document.ocr.alto.xml

This generates report.html and report.json.

As a OCR-D processor:

ocrd-dinglehopper -m mets.xml -I OCR-D-GT-PAGE,OCR-D-OCR-TESS -O OCR-D-OCR-TESS-EVAL

This generates HTML and JSON reports in the OCR-D-OCR-TESS-EVAL filegroup.

dinglehopper displaying metrics and character differences

Testing

Use pytest to run the tests in the tests directory:

virtualenv -p /usr/bin/python3 venv
. venv/bin/activate
pip install -r requirements.txt
pip install pytest
pytest