Gerber, Mike
3f4ec30349
🧹 .gitignore __pychache__/*.pyc
4 years ago
Gerber, Mike
f98a1ec2c8
🐛 run: XXX Work around podman vs docker uid behaviour
4 years ago
Gerber, Mike
746fb768da
🚧 Add a script that checks FULLTEXT dimensions against BEST dimensions
4 years ago
Gerber, Mike
d2c316285c
🧹 ppn2ocr: Remove obsolete show_help()
4 years ago
Gerber, Mike
f5b2eed8a6
🐛 ppn2ocr: Work around oai.sbb.berlin certificate problem
...
oai.sbb.berlin does not have a valid certificate:
% curl https://oai.sbb.berlin
curl: (60) SSL certificate problem: unable to get local issuer certificate
More details here: https://curl.haxx.se/docs/sslcerts.html
curl failed to verify the legitimacy of the server and therefore could not
establish a secure connection to it. To learn more about this situation and
how to fix it, please visit the web page mentioned above.
Work around this by setting verify=False.
4 years ago
Gerber, Mike
4e19e2a655
💄 ppn2ocr: Add a proper CLI interface
4 years ago
Gerber, Mike
70eb73e4c7
🧹 ppn2ocr: (Re)Move TODOs
4 years ago
Gerber, Mike
6c74672916
🚧 ppn2ocr: Update README to use the correct path to requirements-ppn2ocr.txt
4 years ago
Gerber, Mike
9b7437601f
🚧 ppn2ocr: Update README
4 years ago
Gerber, Mike
3e0b7436a5
🚧 ppn2ocr: Add requirements-ppn2ocr.txt
4 years ago
Gerber, Mike
05dbffeb7a
🚧 ppn2ocr: Do not call workflow for now
4 years ago
Gerber, Mike
10f5198fa6
🚧 ppn2ocr: s/contain/encapsulate
4 years ago
Gerber, Mike
f893b339c5
🚧 ppn2ocr: Properly remove the PRESENTATION file group
4 years ago
Gerber, Mike
014e70fe35
🚧 ppn2ocr: Actually run the workflow
4 years ago
Gerber, Mike
74cb361723
🚧 ppn2ocr: Extract a function to contain the IIIF hack
4 years ago
Gerber, Mike
c7c8934e89
🚧 ppn2ocr: Convert to Python + fumble in IIIF URLs
4 years ago
Gerber, Mike
7c5cbc7244
📝 ppn2ocr: Add to README, including proxy configuration
5 years ago
Gerber, Mike
1585247482
✨ ppn2ocr: Make PPN a command line parameter
5 years ago
Gerber, Mike
2a4b204fbe
🎨 ppn2ocr: Extract a function to make a workspace
5 years ago
Gerber, Mike
18d4ab0ba1
✨ ppn2ocr: Use a better example document
5 years ago
Gerber, Mike
8024064697
🐛 ppn2ocr: Fix file:/ links to use file:///, and remove unavaiblable LOCAL file group
5 years ago
Gerber, Mike
612d44b074
🚧 zdb2ocr: Add TODOs from notes.md
5 years ago
Gerber, Mike
9303f4b4df
🚧 zdb2ocr: Produce OCR of ZEFYS newspapers (WIP)
5 years ago
Gerber, Mike
3b60b26c53
🐛 ppn2ocr: Do not set no_proxy here
5 years ago
Gerber, Mike
5675047047
🧹 ppn2ocr: We already use run-docker-hub
5 years ago
Gerber, Mike
770af0a205
🚧 WIP: Add script ppn2ocr to run a document by giving PPN
5 years ago
Gerber, Mike
4b8399fc52
📝 README-DEV: Fix git push instructions
5 years ago
Gerber, Mike
a5b4e06a09
✨ Allow skipping validation
5 years ago
Gerber, Mike
f0da2c95ba
⬆ Update ocrd_olena
5 years ago
Gerber, Mike
78f632a523
✨ Support --input-file-grp/-I command line parameter
5 years ago
Gerber, Mike
9e80926eb2
📝 Remove redundant/wrongly placed comment about SELinux vs run
5 years ago
Gerber, Mike
5da25f0aeb
📝 README: Use the image from Docker Hub
5 years ago
Gerber, Mike
830479d8c2
📝 README-DEV.md
5 years ago
Gerber, Mike
6ccfe08511
👷 Travis: Hopefully fix deploy on tags
5 years ago
Gerber, Mike
a5897ead97
🎨 Re-use ./run for ./run-docker-hub
5 years ago
Gerber, Mike
0d61c258a6
✨ Support running a stable version from Docker Hub
5 years ago
Gerber, Mike
6537e8284e
🧹 Remove work around for fixed issue qurator-spk/bb_textline_detection#27
5 years ago
Gerber, Mike
453961b5b2
👷 Travis: Deploy on master + on tags
5 years ago
Gerber, Mike
4cdfdeb5fe
🐛 Travis: Do not use special characters when checking results (work around)
5 years ago
Gerber, Mike
abf33508b7
🐛 Travis: Do not use special characters when checking results (work around)
5 years ago
Gerber, Mike
88c29cef68
🐛 Add tessdata_best Tesseract models again
5 years ago
Gerber, Mike
5e8cd47798
🧹 Clean up after first apt-get run
5 years ago
Gerber, Mike
4d9a833bef
🐛 Fix olena install
5 years ago
Gerber, Mike
f65505d51a
⚡ Install Tesseract from a PPA
5 years ago
Gerber, Mike
4cb4f6f2bf
⬆ Update qurator_data_lib.sh to use a silent curl instead of wget
5 years ago
Gerber, Mike
7ecca0e92a
⬆ Update Tesseract to 4.1.1
5 years ago
Gerber, Mike
9f29e53e63
✅ Travis: Cache Docker builds from previous image
5 years ago
Gerber, Mike
d8463e2ea7
✅ Travis: Try a multi-stage build
5 years ago
Gerber, Mike
cc7504ea33
🔍 Try uploading our XML to check it
5 years ago
Gerber, Mike
20310d454a
🔍 Try uploading our XML to check it
5 years ago