Commit Graph

552 Commits (aabcafe46f599989bf18922aaa06c360622b3223)
 

Author SHA1 Message Date
Gerber, Mike d3b6974316 💩 I hate pip to much 4 years ago
Gerber, Mike 037f64518f ⬆️ Update dinglehopper 4 years ago
Gerber, Mike c2946395e3 🐛 Fix downloading repacked tessdata_best 4 years ago
Gerber, Mike daa8095a25 ⬆️ Update ocrd_calamari to fix pcGtsId (also ocrd for good measure) 4 years ago
Gerber, Mike bf7c6abcbf ⬆️ Update sbb_textline_detector to fix pcGtsId issue 4 years ago
Gerber, Mike 73a125d893 ⬆️ Update ocrd_tesserocr to fix pcGtsId issue 4 years ago
Gerber, Mike 0b1da9a5db 🧹 Update Calamari model path 4 years ago
Gerber, Mike c5536d3722 💩 Increase pip default timeout 4 years ago
Gerber, Mike b75d17e42c Merge branch 'master' of github.com:mikegerber/my_ocrd_workflow 4 years ago
Gerber, Mike 72693071e0 ⚙️ Get tessdata_best from git-annex/our mirror to make downloads more robust 4 years ago
Gerber, Mike 6ae31395a3 ⚙️ Configure retries for apt-get/curl to make downloads more robust 4 years ago
Gerber, Mike b0b1c021a8 🧹 Update Calamari model path 4 years ago
Gerber, Mike 31c36eb534 ⬆️ Update qurator_data_lib.sh to allow not unpacking a downloaded file 4 years ago
Gerber, Mike 3f511bc8e3 🗒️ README: Break jpageviewer line 4 years ago
Gerber, Mike d1a2bfe669 🐛 Deal with ocrd_olena >= 1.2.0 using one output file group only 4 years ago
Gerber, Mike 7111d28f9b ⬆️ ocrd_olena → 1.2.0 4 years ago
Gerber, Mike 1a308a5522 🧹 Use OCR-D's -P, remove now redundant validation and remove now unnecessary functions 4 years ago
Gerber, Mike efd955c04f 🧹 Modernize my_ocrd_workflow and use OCR-D's new --overwrite 4 years ago
Gerber, Mike 3af3c6dd00 ⬆️ Update qurator_data_lib.sh 4 years ago
Gerber, Mike 0f8b2d82d5 🧹 Travis: Comment out transfer.sh output 4 years ago
Gerber, Mike 032f58e4b8 Merge branch 'master' of https://github.com/mikegerber/my_ocrd_workflow 4 years ago
Gerber, Mike 131e862762 💩 Travis: Skip validation until https://github.com/OCR-D/ocrd_olena/issues/60 is fixed 4 years ago
Gerber, Mike 7fe2ce84b5 🐛 Update sbb_textline_detector to fix Keras/TF issue 4 years ago
Gerber, Mike ef3a8a69e0 ⬆️ Update ocrd_olena 4 years ago
Gerber, Mike f7b43bbefa ppn2ocr: Support TIFF in the BEST group 4 years ago
Gerber, Mike 4e37a52899 Merge branch 'master' of github.com:mikegerber/my_ocrd_workflow 4 years ago
Gerber, Mike bb703152db 🐛 ppn2ocr: Verify oai.sbb.berlin's certificate again
Now that oai.sbb.berlin's certificate chain is fixed, remove the
workaround again.

Fixes GH#15.
4 years ago
Gerber, Mike c5ae23d2ef Validate before even starting, to find data problems 4 years ago
Gerber, Mike f7b0b4121d ⁉️ Check dependencies using pipdeptree to triage Travis build fail 4 years ago
Gerber, Mike c334b1e7ac 🧹 Move check-FULLTEXT-Page-dimensions-vs-BEST-dimensions.py code to mono-repo/experiments 4 years ago
Gerber, Mike af4557fb33 Merge branch 'master' of https://github.com/mikegerber/my_ocrd_workflow 4 years ago
Gerber, Mike 0aa541fa18 📓 README: Reference howto/*proxy*.md instead of duplicating the proxy settings 4 years ago
Gerber, Mike 3f4ec30349 🧹 .gitignore __pychache__/*.pyc 4 years ago
Gerber, Mike f98a1ec2c8 🐛 run: XXX Work around podman vs docker uid behaviour 4 years ago
Gerber, Mike 746fb768da 🚧 Add a script that checks FULLTEXT dimensions against BEST dimensions 4 years ago
Gerber, Mike d2c316285c 🧹 ppn2ocr: Remove obsolete show_help() 4 years ago
Gerber, Mike f5b2eed8a6 🐛 ppn2ocr: Work around oai.sbb.berlin certificate problem
oai.sbb.berlin does not have a valid certificate:

% curl https://oai.sbb.berlin
curl: (60) SSL certificate problem: unable to get local issuer certificate
More details here: https://curl.haxx.se/docs/sslcerts.html

curl failed to verify the legitimacy of the server and therefore could not
establish a secure connection to it. To learn more about this situation and
how to fix it, please visit the web page mentioned above.

Work around this by setting verify=False.
4 years ago
Gerber, Mike 448bf9e256 🐛 ppn2ocr: Remove LOCAL file group too 4 years ago
Gerber, Mike 4e19e2a655 💄 ppn2ocr: Add a proper CLI interface 4 years ago
Gerber, Mike 70eb73e4c7 🧹 ppn2ocr: (Re)Move TODOs 4 years ago
Gerber, Mike 6c74672916 🚧 ppn2ocr: Update README to use the correct path to requirements-ppn2ocr.txt 4 years ago
Gerber, Mike 9b7437601f 🚧 ppn2ocr: Update README 4 years ago
Gerber, Mike 3e0b7436a5 🚧 ppn2ocr: Add requirements-ppn2ocr.txt 4 years ago
Gerber, Mike 05dbffeb7a 🚧 ppn2ocr: Do not call workflow for now 4 years ago
Gerber, Mike 10f5198fa6 🚧 ppn2ocr: s/contain/encapsulate 4 years ago
Gerber, Mike f893b339c5 🚧 ppn2ocr: Properly remove the PRESENTATION file group 4 years ago
Gerber, Mike 014e70fe35 🚧 ppn2ocr: Actually run the workflow 4 years ago
Gerber, Mike 74cb361723 🚧 ppn2ocr: Extract a function to contain the IIIF hack 4 years ago
Gerber, Mike c7c8934e89 🚧 ppn2ocr: Convert to Python + fumble in IIIF URLs 4 years ago
Gerber, Mike 7c5cbc7244 📝 ppn2ocr: Add to README, including proxy configuration 5 years ago