From 612d44b0741cdb7b2f1402e6a222783a16960a6a Mon Sep 17 00:00:00 2001 From: "Gerber, Mike" Date: Fri, 22 May 2020 13:49:34 +0200 Subject: [PATCH] =?UTF-8?q?=F0=9F=9A=A7=20zdb2ocr:=20Add=20TODOs=20from=20?= =?UTF-8?q?notes.md?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- zdb2ocr | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/zdb2ocr b/zdb2ocr index 1091cd4..d779242 100755 --- a/zdb2ocr +++ b/zdb2ocr @@ -21,3 +21,11 @@ ocrd workspace validate mets.xml | grep -v "Won't download remote image" $self_dir/run-docker-hub -I MAX --skip-validation + + +# * TODO: Error on invocation +# * TODO: Check out options to get better image resolutions +# * TODO: Are input images already grayscale? Further binarization makes them +# worse than before +# * TODO: Does this loose the image URLs for the MAX filegroup? +# * TODO: Lots of text problems with ocrd_calamari "not the same as Calamari"