You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Go to file
Mike Gerber a1007fad15
dinglehopper: Merge pull request #1 from cneud/cneud-xml-links
add links to supported XML formats
5 years ago
.screenshots 📝 dinglehopper: Add screenshot 5 years ago
qurator dinglehopper: Add OCR-D interface 5 years ago
.travis.yml dinglehopper: Add Travis configuration 5 years ago
README.md dinglehopper: Merge pull request #1 from cneud/cneud-xml-links 5 years ago
pytest.ini 🧹 dinglehopper: Move pytest.ini 5 years ago
requirements.txt dinglehopper: Add OCR-D interface 5 years ago
setup.py dinglehopper: Add OCR-D interface 5 years ago

README.md

dinglehopper

dinglehopper is an OCR evaluation tool and reads ALTO, PAGE and text files.

Travis CI badge

Goals

  • Useful
    • As an UI tool
    • For an automated evaluation
    • As a library
  • Unicode support

Usage

As a OCR-D processor:

ocrd-dinglehopper -m mets.xml -I OCR-D-GT-PAGE,OCR-D-OCR-TESS -O OCR-D-OCR-TESS-EVAL

This generates HTML and JSON reports in the OCR-D-OCR-TESS-EVAL filegroup.

dinglehopper displaying metrics and character differences