You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Gerber, Mike e3a1d1e941 📝 README: Include an example script 5 years ago
ocrd_repair_inconsistencies Check reading direction/textline order rather than assuming 5 years ago
.gitignore 🎉 Initial commit 5 years ago
README.md 📝 README: Include an example script 5 years ago
requirements.txt 🎉 Initial commit 5 years ago
setup.py 🎉 Initial commit 5 years ago

README.md

ocrd_repair_inconsistencies

Automatically fix order inconsistencies in regions, lines and words. Elements are only fixed if reordering their children top-to-bottom/left-to-right fixes the appropriately concatenated text of the children to match the parent's text.

We wrote this as a one-shot script to fix some files. Use with caution.

Example usage

For example, use this fix script:

#!/bin/sh
set -e

tmp_fg=FIXED_$RANDOM

ocrd_repair_inconsistencies -I OCR-D-GT-PAGE -O $tmp_fg

for f in $tmp_fg/*; do
  g="OCR-D-GT-PAGE/OCR-D-GT-PAGE_${f#$tmp_fg/$tmp_fg_}"
  cp $f $g
done

ocrd workspace remove-group -rf $tmp_fg
rmdir $tmp_fg