Commit Graph

525 Commits (9d2ebd9cecae851e49c5da60e8547a23a2f0087f)
 

Author SHA1 Message Date
Gerber, Mike c5ae23d2ef Validate before even starting, to find data problems
Gerber, Mike f7b0b4121d ⁉️ Check dependencies using pipdeptree to triage Travis build fail
Gerber, Mike c334b1e7ac 🧹 Move check-FULLTEXT-Page-dimensions-vs-BEST-dimensions.py code to mono-repo/experiments
Gerber, Mike af4557fb33 Merge branch 'master' of https://github.com/mikegerber/my_ocrd_workflow
Gerber, Mike 0aa541fa18 📓 README: Reference howto/*proxy*.md instead of duplicating the proxy settings
Gerber, Mike 3f4ec30349 🧹 .gitignore __pychache__/*.pyc
Gerber, Mike f98a1ec2c8 🐛 run: XXX Work around podman vs docker uid behaviour
Gerber, Mike 746fb768da 🚧 Add a script that checks FULLTEXT dimensions against BEST dimensions
Gerber, Mike d2c316285c 🧹 ppn2ocr: Remove obsolete show_help()
Gerber, Mike f5b2eed8a6 🐛 ppn2ocr: Work around oai.sbb.berlin certificate problem
oai.sbb.berlin does not have a valid certificate:

% curl https://oai.sbb.berlin
curl: (60) SSL certificate problem: unable to get local issuer certificate
More details here: https://curl.haxx.se/docs/sslcerts.html

curl failed to verify the legitimacy of the server and therefore could not
establish a secure connection to it. To learn more about this situation and
how to fix it, please visit the web page mentioned above.

Work around this by setting verify=False.
Gerber, Mike 448bf9e256 🐛 ppn2ocr: Remove LOCAL file group too
Gerber, Mike 4e19e2a655 💄 ppn2ocr: Add a proper CLI interface
Gerber, Mike 70eb73e4c7 🧹 ppn2ocr: (Re)Move TODOs
Gerber, Mike 6c74672916 🚧 ppn2ocr: Update README to use the correct path to requirements-ppn2ocr.txt
Gerber, Mike 9b7437601f 🚧 ppn2ocr: Update README
Gerber, Mike 3e0b7436a5 🚧 ppn2ocr: Add requirements-ppn2ocr.txt
Gerber, Mike 05dbffeb7a 🚧 ppn2ocr: Do not call workflow for now
Gerber, Mike 10f5198fa6 🚧 ppn2ocr: s/contain/encapsulate
Gerber, Mike f893b339c5 🚧 ppn2ocr: Properly remove the PRESENTATION file group
Gerber, Mike 014e70fe35 🚧 ppn2ocr: Actually run the workflow
Gerber, Mike 74cb361723 🚧 ppn2ocr: Extract a function to contain the IIIF hack
Gerber, Mike c7c8934e89 🚧 ppn2ocr: Convert to Python + fumble in IIIF URLs
Gerber, Mike 7c5cbc7244 📝 ppn2ocr: Add to README, including proxy configuration
Gerber, Mike 1585247482 ppn2ocr: Make PPN a command line parameter
Gerber, Mike 2a4b204fbe 🎨 ppn2ocr: Extract a function to make a workspace
Gerber, Mike 18d4ab0ba1 ppn2ocr: Use a better example document
Gerber, Mike 8024064697 🐛 ppn2ocr: Fix file:/ links to use file:///, and remove unavaiblable LOCAL file group
Gerber, Mike 612d44b074 🚧 zdb2ocr: Add TODOs from notes.md
Gerber, Mike 9303f4b4df 🚧 zdb2ocr: Produce OCR of ZEFYS newspapers (WIP)
Gerber, Mike 3b60b26c53 🐛 ppn2ocr: Do not set no_proxy here
Gerber, Mike 5675047047 🧹 ppn2ocr: We already use run-docker-hub
Gerber, Mike 770af0a205 🚧 WIP: Add script ppn2ocr to run a document by giving PPN
Gerber, Mike 4b8399fc52 📝 README-DEV: Fix git push instructions
Gerber, Mike a5b4e06a09 Allow skipping validation
Gerber, Mike f0da2c95ba ⬆ Update ocrd_olena
Gerber, Mike 78f632a523 Support --input-file-grp/-I command line parameter
Gerber, Mike 9e80926eb2 📝 Remove redundant/wrongly placed comment about SELinux vs run
Gerber, Mike 5da25f0aeb 📝 README: Use the image from Docker Hub
Gerber, Mike 830479d8c2 📝 README-DEV.md
Gerber, Mike 6ccfe08511 👷 Travis: Hopefully fix deploy on tags
Gerber, Mike a5897ead97 🎨 Re-use ./run for ./run-docker-hub
Gerber, Mike 0d61c258a6 Support running a stable version from Docker Hub
Gerber, Mike 6537e8284e 🧹 Remove work around for fixed issue
Gerber, Mike 453961b5b2 👷 Travis: Deploy on master + on tags
Gerber, Mike 4cdfdeb5fe 🐛 Travis: Do not use special characters when checking results (work around)
Gerber, Mike abf33508b7 🐛 Travis: Do not use special characters when checking results (work around)
Gerber, Mike 88c29cef68 🐛 Add tessdata_best Tesseract models again
Gerber, Mike 5e8cd47798 🧹 Clean up after first apt-get run
Gerber, Mike 4d9a833bef 🐛 Fix olena install
Gerber, Mike f65505d51a Install Tesseract from a PPA