You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
dinglehopper/qurator/dinglehopper
Benjamin Rosemann a68fc269d9 Fix the extraction of text from Page with TableRegion
Dinglehopper did not consider `OrderedGroupIndex` in the `ReadingOrder`
element when extracting text regions. As a consequence a `TableRegion`
was not considered for text extraction.
4 years ago
..
notebooks Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
templates 🧹 dinglehopper: Disable metrics in JSON report, too 4 years ago
tests Fix the extraction of text from Page with TableRegion 4 years ago
__init__.py 🧹 dinglehopper: Move all normalization code to extracted_text.py 4 years ago
align.py 🎨 dinglehopper: Reformat using black 4 years ago
character_error_rate.py 🎨 dinglehopper: Reformat using black 4 years ago
cli.py 🎨 dinglehopper: Reformat using black 4 years ago
cli_extract.py 🎨 dinglehopper: Reformat using black 4 years ago
config.py dinglehopper: Show a progressbar on --progress 4 years ago
edit_distance.py 🎨 dinglehopper: Reformat using black 4 years ago
extracted_text.py 🎨 dinglehopper: Reformat using black 4 years ago
ocr_files.py Fix the extraction of text from Page with TableRegion 4 years ago
ocrd-tool.json dinglehopper: Add OCR-D parameter to choose TextEquiv level 4 years ago
ocrd_cli.py ocrd cli: use core-provided zip_input_files method 4 years ago
word_error_rate.py 🎨 dinglehopper: Reformat using black 4 years ago