Commit Graph

87 Commits (4cdfdeb5fe25232c80af8e49794a678d4a44adaf)

Author SHA1 Message Date
Gerber, Mike 58282c9e95 Include glyph output 5 years ago
Gerber, Mike 11a30892c5 🔍 Only do pip3 list when LOG_LEVEL >= DEBUG 5 years ago
Gerber, Mike 9f111ca362 🧹 Do not validate OCR results twice 5 years ago
Gerber, Mike 8ca25f3c56 🎨 Expose OCR textequiv_level as a environment variable 5 years ago
Gerber, Mike 979c7044a8 Make OCR-D-IMG-BIN output group explicit 5 years ago
Gerber, Mike 28bb482ceb Produce word results 5 years ago
Gerber, Mike 6ae85063c5 📝 Document do_validate() options better 5 years ago
Gerber, Mike 2cf68f149d ♻ Extract a main() function for the main stuff 5 years ago
Gerber, Mike be0a0c353a 📝 Document the two remaining un-documented functions 5 years ago
Gerber, Mike 848dd143fd 🎨 Use long command lines again 5 years ago
Gerber, Mike 6b83d5ae1e 🧹 Update/move some XXXs/TODOs 5 years ago
Gerber, Mike 5a55598d0c 🧹 Remove image reference fixing remnants - jpageviewer now has --resolve-dir 5 years ago
Gerber, Mike 44979e7fa2 🧹 do_linesegmentation_sbb: It's now clear that sbb segmentation works with RGB images 5 years ago
Gerber, Mike 460b6c34d1 ✏ Fix typo in $ocrd_olena_binarize_parameters 5 years ago
Gerber, Mike 71d54c6978 🔧 Set up logging level using /etc/ocrd_logging.py instead of "-l" 5 years ago
Gerber, Mike 1a538dce1a 🧹 Remove superfluous mets.xml options 5 years ago
Gerber, Mike c192bfdbfe 🧹 Remove workaround for TEMP/ directory bug 5 years ago
Gerber, Mike d7a2aac44b ♻ Remove file groups using "ocrd workspace remove-group" 5 years ago
Gerber, Mike c8039db686 🎨 Put validate options into a variable 5 years ago
Gerber, Mike 5ece7f1b0a 🧹 Remove remnants of ocrd-ocropy-segment 5 years ago
Gerber, Mike 135489eaeb 🧹 Remove page_downgrade_to_2018 5 years ago
Gerber, Mike 423d9c2ed6 🚧 do_validate: Skip dimension checking 5 years ago
Gerber, Mike 948e9074df ⬆ Update to ocrd_calamari 0.0.4 5 years ago
Gerber, Mike 1ef850992c 🎨 Use same style of specifying parameters for all processors 5 years ago
Gerber, Mike b468d688f2 🧹 Remove font identification for now 5 years ago
Gerber, Mike 07555e8270 🎨 Use new OCR-D JSON string parameters 5 years ago
Gerber, Mike 9c31d604e9 ⬆ Update ocrd-sbb-textline-detector command 5 years ago
Gerber, Mike fd56731464 🚧 Do not check PAGE coordinates for now 5 years ago
Gerber, Mike 87a2bce93c ⬆ Update calamari-models URL + path 5 years ago
Gerber, Mike d166077a55 Update to sbb_textline_detector with the fixed AlternativeImage support (= merged PAGE results) 5 years ago
Gerber, Mike de47a3e5b1 🔥 Remove now unused page_fix_image_references() 5 years ago
Gerber, Mike 1af18c629e 🧹 Validate imagefilename again 5 years ago
Gerber, Mike de49aa715b ⬆ Update to OCR-D 1.0.0 5 years ago
Gerber, Mike 7025d960b4 Use ocrd_olena for binarization 5 years ago
Gerber, Mike 3687d6d7b4 🧹 Do not remove line confidences anymore 5 years ago
Gerber, Mike 6454d20998 Use sbb_textline_detector to segment lines 5 years ago
Gerber, Mike bdab016e2c Use GT4HistOCR_2000000 model from qurator-data for Tesseract 5 years ago
Gerber, Mike 47dd5d3b62 🎨 Move XML schemata to a better path 5 years ago
Gerber, Mike af2034400a 🎨 Add extra newlines to separate steps 5 years ago
Gerber, Mike 1863439d92 💩 Remove extra Pillow dependency workarounds 5 years ago
Gerber, Mike e5cd5b937e Run pip3 list for easier checking 5 years ago
Gerber, Mike bd24624bd7 ⬆ Do not downgrade to PAGE 2018 anymore 5 years ago
Gerber, Mike 0b2b66a0b4 🔧 Allow setting LOG_LEVEL 5 years ago
Gerber, Mike f19bba45b8 💩 Remove mysterious TEMP directory for now 5 years ago
Gerber, Mike 68902f923d 📜 Downgrading to PAGE 2018 is not the last step anymore 5 years ago
Gerber, Mike 6c0d7e0aee 💩 Do not fix PAGE image references for now 5 years ago
Gerber, Mike 343a3fbf82 🔧 Evaluate both Tesseract and Calamari results 5 years ago
Gerber, Mike 0bc06c2fad Run Calamari OCR 5 years ago
Gerber, Mike daed87566e 🚑 Don't install typegroups classifier for now 5 years ago
Gerber, Mike d8f3438ac5 🚑 Don't check pixel density 5 years ago