1
0
Fork 0
mirror of https://github.com/qurator-spk/dinglehopper.git synced 2025-06-18 16:19:59 +02:00
No description
Find a file
Gerber, Mike f94e8b9b1c Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector"
This reverts commit a3c1eee8f31349edcfb1e36920763bcecceb1129, reversing
changes made to dc76213ffc1fbabc2c45f0e52ced55449bdf2e83.
2019-12-09 12:44:05 +01:00
.screenshots Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 2019-12-09 12:44:05 +01:00
qurator Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 2019-12-09 12:44:05 +01:00
.travis.yml Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 2019-12-09 12:44:05 +01:00
LICENSE Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 2019-12-09 12:44:05 +01:00
ocrd-tool.json Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 2019-12-09 12:44:05 +01:00
pytest.ini Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 2019-12-09 12:44:05 +01:00
README.md Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 2019-12-09 12:44:05 +01:00
requirements.txt Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 2019-12-09 12:44:05 +01:00
setup.py Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 2019-12-09 12:44:05 +01:00

dinglehopper

dinglehopper is an OCR evaluation tool and reads ALTO, PAGE and text files.

Build Status

Goals

  • Useful
    • As a UI tool
    • For an automated evaluation
    • As a library
  • Unicode support

Installation

It's best to use pip, e.g.:

sudo pip install .

Usage

dinglehopper some-document.gt.page.xml some-document.ocr.alto.xml

This generates report.html and report.json.

As a OCR-D processor:

ocrd-dinglehopper -m mets.xml -I OCR-D-GT-PAGE,OCR-D-OCR-TESS -O OCR-D-OCR-TESS-EVAL

This generates HTML and JSON reports in the OCR-D-OCR-TESS-EVAL filegroup.

dinglehopper displaying metrics and character differences

Testing

Use pytest to run the tests in the tests directory:

virtualenv -p /usr/bin/python3 venv
. venv/bin/activate
pip install -r requirements.txt
pip install pytest
pytest