Commit Graph

189 Commits (9e80926eb298173d87f442b8a46895cc84f811d7)
 

Author SHA1 Message Date
Gerber, Mike 7025d960b4 Use ocrd_olena for binarization 5 years ago
Gerber, Mike 63c364207c 💩 Add a funny workaround to get git-annex to give us our files 5 years ago
Gerber, Mike 33e25641f2 ⬆ Update sbb_textline_detector 5 years ago
Gerber, Mike 2b67f5feb4 ⬆ Update sbb_textline_detector 5 years ago
Gerber, Mike 3687d6d7b4 🧹 Do not remove line confidences anymore 5 years ago
Gerber, Mike 6454d20998 Use sbb_textline_detector to segment lines 5 years ago
Gerber, Mike 735e9599d7 🐛 ocrd-bugs: Most/All workspaces in bag files don't validate 5 years ago
Gerber, Mike 0f8f1d814b 🐛 Mkdir robustly 5 years ago
Gerber, Mike bdab016e2c Use GT4HistOCR_2000000 model from qurator-data for Tesseract 5 years ago
Gerber, Mike 57ff3fc19b ⬆ Update data 5 years ago
Gerber, Mike ff2cc50aed ⬆ Update dinglehopper (substitutions) 5 years ago
Gerber, Mike 0c5ed94892 ⬆ Update dinglehopper (to fix NFC trouble + substitutions) 5 years ago
Gerber, Mike 1dde641d5a ⬆ Update dinglehopper (to fix text alignment) 5 years ago
Gerber, Mike 47dd5d3b62 🎨 Move XML schemata to a better path 5 years ago
Gerber, Mike 02457155aa ⬆ Update dinglehopper (to fix reading order) 5 years ago
Gerber, Mike af2034400a 🎨 Add extra newlines to separate steps 5 years ago
Gerber, Mike 1863439d92 💩 Remove extra Pillow dependency workarounds 5 years ago
Gerber, Mike 81b7e5458c 💩 Install Pillow 5.4.1 because pip does not have a dependency resolver 5 years ago
Gerber, Mike 224762e1bb 🐛 Let ocrd_calamari handle the weird setuptools depencency 5 years ago
Gerber, Mike a272237bd8 ⬆ Update ocrd dependency 5 years ago
Gerber, Mike 2b9cab1a1a ⬆ Update ocrd_calamari 5 years ago
Gerber, Mike e5cd5b937e Run pip3 list for easier checking 5 years ago
Gerber, Mike bd24624bd7 ⬆ Do not downgrade to PAGE 2018 anymore 5 years ago
Gerber, Mike 0b2b66a0b4 🔧 Allow setting LOG_LEVEL 5 years ago
Gerber, Mike d6c38b5b9f 🧹 Do not install extra tesserocr 5 years ago
Gerber, Mike f19bba45b8 💩 Remove mysterious TEMP directory for now 5 years ago
Gerber, Mike 68902f923d 📜 Downgrading to PAGE 2018 is not the last step anymore 5 years ago
Gerber, Mike 6c0d7e0aee 💩 Do not fix PAGE image references for now 5 years ago
Gerber, Mike e3bf65b502 ⬆ Update dinglehopper 5 years ago
Gerber, Mike 87968cd297 🧹 README: Move TODO to my usual TODO list 5 years ago
Gerber, Mike debecf71b9 💩 Install the right Pillow version manually... 5 years ago
Gerber, Mike a3d6befb0d 🏗 Build Tesseract from source 5 years ago
Gerber, Mike d903e3634c 📝 README: Clarify workspace TODO 5 years ago
Gerber, Mike 8a04602044 📝 README: Not using podman anymore 5 years ago
Gerber, Mike 0782dbde32 ⬆ Update dinglehopper 5 years ago
Gerber, Mike 343a3fbf82 🔧 Evaluate both Tesseract and Calamari results 5 years ago
Gerber, Mike 0bc06c2fad Run Calamari OCR 5 years ago
Gerber, Mike 001e62f54a 🔧 Use docker, not podman 5 years ago
Gerber, Mike daed87566e 🚑 Don't install typegroups classifier for now 5 years ago
Gerber, Mike d8f3438ac5 🚑 Don't check pixel density 5 years ago
Gerber, Mike b169f35bb1 🔧 Build container with cache again 5 years ago
Gerber, Mike 85ff80d548 Use dinglehopper's new OCR-D interface 5 years ago
Gerber, Mike d5aa273b44 🚧 Use ocr-eval aka dinglehopper 5 years ago
Gerber, Mike be5750f4e1 As a last step, downgrade to PAGE 2018 to support PAGE Viewer 5 years ago
Gerber, Mike cf2b4de2a0 🧹 Validate again after fixing image references 5 years ago
Gerber, Mike 21e00932be 🐛 Use a valid filegrp USE for fontident 5 years ago
Gerber, Mike ade39a278c 🎨 Align file groups 5 years ago
Gerber, Mike 3fee2d4fe6 📌 Use my ocrd_typegroups_classifier fix for passing down the page id 5 years ago
Gerber, Mike 44772f1923 🚧 Work around problems with ocrd-tesserocr producing TextEquiv/@conf 5 years ago
Gerber, Mike 8b67866aac Validate PAGE XML after OCR 5 years ago