You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
dinglehopper/qurator/dinglehopper
Gerber, Mike 8b4ee20a40 Add a new CLI tool dinglehopper-extract to just give the extracted text 4 years ago
..
.idea Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
notebooks Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
templates 🧹 dinglehopper: Disable metrics in JSON report, too 4 years ago
tests dinglehopper: Implement the basic text extraction from PAGE TextLines 4 years ago
.gitignore Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector" 5 years ago
__init__.py 🧹 dinglehopper: Move all normalization code to extracted_text.py 4 years ago
align.py 🎨 dinglehopper: Make PyCharm happier with the type hinting, newlines etc. 5 years ago
character_error_rate.py 🧹 dinglehopper: Sanitize imports 4 years ago
cli.py dinglehopper: Add CLI option to choose TextEquiv level 4 years ago
cli_extract.py Add a new CLI tool dinglehopper-extract to just give the extracted text 4 years ago
config.py dinglehopper: Show a progressbar on --progress 4 years ago
edit_distance.py dinglehopper: Show a progressbar on --progress 4 years ago
extracted_text.py dinglehopper: Give segment ids from the extracted textequiv_level 4 years ago
ocr_files.py dinglehopper: Add OCR-D parameter to choose TextEquiv level 4 years ago
ocrd-tool.json dinglehopper: Add OCR-D parameter to choose TextEquiv level 4 years ago
ocrd_cli.py dinglehopper: Add OCR-D parameter to choose TextEquiv level 4 years ago
word_error_rate.py ➡️ dinglehopper: Move ExtractedText to its own file 4 years ago