Commit graph

326 commits

Author SHA1 Message Date
bb703152db 🐛 ppn2ocr: Verify oai.sbb.berlin's certificate again
Now that oai.sbb.berlin's certificate chain is fixed, remove the
workaround again.

Fixes GH#15.
2020-06-23 15:15:21 +02:00
c5ae23d2ef Validate before even starting, to find data problems 2020-06-19 19:27:32 +02:00
f7b0b4121d ⁉️ Check dependencies using pipdeptree to triage Travis build fail 2020-06-19 17:47:35 +02:00
c334b1e7ac 🧹 Move check-FULLTEXT-Page-dimensions-vs-BEST-dimensions.py code to mono-repo/experiments 2020-06-19 16:01:07 +02:00
af4557fb33 Merge branch 'master' of https://github.com/mikegerber/my_ocrd_workflow 2020-06-18 15:46:46 +02:00
0aa541fa18 📓 README: Reference howto/*proxy*.md instead of duplicating the proxy settings 2020-06-18 14:46:01 +02:00
3f4ec30349 🧹 .gitignore __pychache__/*.pyc 2020-06-18 10:51:43 +02:00
f98a1ec2c8 🐛 run: XXX Work around podman vs docker uid behaviour 2020-06-18 10:50:24 +02:00
746fb768da 🚧 Add a script that checks FULLTEXT dimensions against BEST dimensions 2020-06-18 10:49:31 +02:00
d2c316285c 🧹 ppn2ocr: Remove obsolete show_help() 2020-06-17 16:44:17 +02:00
f5b2eed8a6 🐛 ppn2ocr: Work around oai.sbb.berlin certificate problem
oai.sbb.berlin does not have a valid certificate:

% curl https://oai.sbb.berlin
curl: (60) SSL certificate problem: unable to get local issuer certificate
More details here: https://curl.haxx.se/docs/sslcerts.html

curl failed to verify the legitimacy of the server and therefore could not
establish a secure connection to it. To learn more about this situation and
how to fix it, please visit the web page mentioned above.

Work around this by setting verify=False.
2020-06-09 11:19:25 +02:00
448bf9e256 🐛 ppn2ocr: Remove LOCAL file group too 2020-06-04 19:55:00 +02:00
4e19e2a655 💄 ppn2ocr: Add a proper CLI interface 2020-06-03 15:53:45 +02:00
70eb73e4c7 🧹 ppn2ocr: (Re)Move TODOs 2020-06-03 15:34:00 +02:00
6c74672916 🚧 ppn2ocr: Update README to use the correct path to requirements-ppn2ocr.txt 2020-06-03 11:18:36 +02:00
9b7437601f 🚧 ppn2ocr: Update README 2020-06-03 11:17:18 +02:00
3e0b7436a5 🚧 ppn2ocr: Add requirements-ppn2ocr.txt 2020-06-03 11:14:44 +02:00
05dbffeb7a 🚧 ppn2ocr: Do not call workflow for now 2020-06-03 10:12:36 +02:00
10f5198fa6 🚧 ppn2ocr: s/contain/encapsulate 2020-06-03 10:11:23 +02:00
f893b339c5 🚧 ppn2ocr: Properly remove the PRESENTATION file group 2020-06-03 10:10:54 +02:00
014e70fe35 🚧 ppn2ocr: Actually run the workflow 2020-06-02 19:25:31 +02:00
74cb361723 🚧 ppn2ocr: Extract a function to contain the IIIF hack 2020-06-02 19:18:06 +02:00
c7c8934e89 🚧 ppn2ocr: Convert to Python + fumble in IIIF URLs 2020-06-02 19:06:31 +02:00
7c5cbc7244 📝 ppn2ocr: Add to README, including proxy configuration 2020-05-22 17:23:49 +02:00
1585247482 ppn2ocr: Make PPN a command line parameter 2020-05-22 17:15:50 +02:00
2a4b204fbe 🎨 ppn2ocr: Extract a function to make a workspace 2020-05-22 16:53:20 +02:00
18d4ab0ba1 ppn2ocr: Use a better example document 2020-05-22 16:45:19 +02:00
8024064697 🐛 ppn2ocr: Fix file:/ links to use file:///, and remove unavaiblable LOCAL file group 2020-05-22 16:09:00 +02:00
612d44b074 🚧 zdb2ocr: Add TODOs from notes.md 2020-05-22 13:49:34 +02:00
9303f4b4df 🚧 zdb2ocr: Produce OCR of ZEFYS newspapers (WIP) 2020-05-22 13:43:11 +02:00
3b60b26c53 🐛 ppn2ocr: Do not set no_proxy here 2020-05-18 21:03:06 +02:00
5675047047 🧹 ppn2ocr: We already use run-docker-hub 2020-05-14 16:25:34 +02:00
770af0a205 🚧 WIP: Add script ppn2ocr to run a document by giving PPN 2020-03-09 18:27:29 +01:00
4b8399fc52 📝 README-DEV: Fix git push instructions 2020-03-09 18:26:40 +01:00
a5b4e06a09 Allow skipping validation 2020-03-09 16:50:30 +01:00
f0da2c95ba ⬆ Update ocrd_olena 2020-03-09 16:49:43 +01:00
78f632a523 Support --input-file-grp/-I command line parameter 2020-03-09 12:26:38 +01:00
9e80926eb2 📝 Remove redundant/wrongly placed comment about SELinux vs run 2020-03-03 10:27:32 +01:00
5da25f0aeb 📝 README: Use the image from Docker Hub 2020-03-02 18:52:34 +01:00
830479d8c2 📝 README-DEV.md 2020-03-02 18:07:52 +01:00
6ccfe08511 👷 Travis: Hopefully fix deploy on tags 2020-03-02 17:16:14 +01:00
a5897ead97 🎨 Re-use ./run for ./run-docker-hub 2020-03-02 17:09:51 +01:00
0d61c258a6 Support running a stable version from Docker Hub 2020-03-02 16:57:24 +01:00
6537e8284e 🧹 Remove work around for fixed issue qurator-spk/bb_textline_detection#27 2020-03-02 15:18:24 +01:00
453961b5b2 👷 Travis: Deploy on master + on tags 2020-03-02 14:54:12 +01:00
4cdfdeb5fe 🐛 Travis: Do not use special characters when checking results (work around) 2020-03-02 14:15:08 +01:00
abf33508b7 🐛 Travis: Do not use special characters when checking results (work around) 2020-03-02 13:45:54 +01:00
88c29cef68 🐛 Add tessdata_best Tesseract models again 2020-03-02 12:47:46 +01:00
5e8cd47798 🧹 Clean up after first apt-get run 2020-03-02 12:07:52 +01:00
4d9a833bef 🐛 Fix olena install 2020-03-02 12:06:50 +01:00