You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Go to file
JKamlah c36396302d FIX problem with creation of hash, instead of merging the strings each string get's an own hash. Adding os.path.join. 6 years ago
.screenshots 📝 dinglehopper: Add screenshot 6 years ago
qurator FIX problem with creation of hash, instead of merging the strings each string get's an own hash. Adding os.path.join. 6 years ago
.travis.yml dinglehopper: Add Travis configuration 6 years ago
LICENSE 📝 dinglehopper: Add a LICENSE file 6 years ago
README.md Fix typo in README 6 years ago
ocrd-tool.json symlink ocrd-tool.json to repo root 6 years ago
pytest.ini 🧹 dinglehopper: Move pytest.ini 6 years ago
requirements.txt dinglehopper: Add OCR-D interface 6 years ago
setup.py 📝 dinglehopper: Update author info 6 years ago

README.md

dinglehopper

dinglehopper is an OCR evaluation tool and reads ALTO, PAGE and text files.

Build Status

Goals

  • Useful
    • As a UI tool
    • For an automated evaluation
    • As a library
  • Unicode support

Usage

dinglehopper some-document.gt.page.xml some-document.ocr.alto.xml

This generates report.html and report.json.

As a OCR-D processor:

ocrd-dinglehopper -m mets.xml -I OCR-D-GT-PAGE,OCR-D-OCR-TESS -O OCR-D-OCR-TESS-EVAL

This generates HTML and JSON reports in the OCR-D-OCR-TESS-EVAL filegroup.

dinglehopper displaying metrics and character differences