Gerber, Mike
21c1f310b1
🚧 Travis: Fix build stage
4 years ago
Gerber, Mike
73ffa01d12
🎨 Rename boxed-* to my_ocrd_workflow-*
4 years ago
Gerber, Mike
a820d72526
🧹 s/base/core
4 years ago
Gerber, Mike
9680dd8299
⚙️ Install pip via get-pip.py
4 years ago
Gerber, Mike
92391747a7
🧹 Remove obsolete xsd/ directory
4 years ago
Gerber, Mike
02eae7b6fa
✨ Move processors into their own Docker container
4 years ago
Gerber, Mike
894cbeee32
Merge branch 'test/pip-2020-resolver'
4 years ago
Gerber, Mike
5806674fa0
🎉 Checkout pip 20.2's new 2020-resolver feature
4 years ago
Gerber, Mike
d3b6974316
💩 I hate pip to much
4 years ago
Gerber, Mike
037f64518f
⬆️ Update dinglehopper
4 years ago
Gerber, Mike
c2946395e3
🐛 Fix downloading repacked tessdata_best
4 years ago
Gerber, Mike
daa8095a25
⬆️ Update ocrd_calamari to fix pcGtsId (also ocrd for good measure)
4 years ago
Gerber, Mike
bf7c6abcbf
⬆️ Update sbb_textline_detector to fix pcGtsId issue
4 years ago
Gerber, Mike
73a125d893
⬆️ Update ocrd_tesserocr to fix pcGtsId issue
4 years ago
Gerber, Mike
0b1da9a5db
🧹 Update Calamari model path
4 years ago
Gerber, Mike
c5536d3722
💩 Increase pip default timeout
4 years ago
Gerber, Mike
b75d17e42c
Merge branch 'master' of github.com:mikegerber/my_ocrd_workflow
4 years ago
Gerber, Mike
72693071e0
⚙️ Get tessdata_best from git-annex/our mirror to make downloads more robust
4 years ago
Gerber, Mike
6ae31395a3
⚙️ Configure retries for apt-get/curl to make downloads more robust
4 years ago
Gerber, Mike
b0b1c021a8
🧹 Update Calamari model path
4 years ago
Gerber, Mike
31c36eb534
⬆️ Update qurator_data_lib.sh to allow not unpacking a downloaded file
4 years ago
Gerber, Mike
3f511bc8e3
🗒️ README: Break jpageviewer line
4 years ago
Gerber, Mike
d1a2bfe669
🐛 Deal with ocrd_olena >= 1.2.0 using one output file group only
4 years ago
Gerber, Mike
7111d28f9b
⬆️ ocrd_olena → 1.2.0
4 years ago
Gerber, Mike
1a308a5522
🧹 Use OCR-D's -P, remove now redundant validation and remove now unnecessary functions
4 years ago
Gerber, Mike
efd955c04f
🧹 Modernize my_ocrd_workflow and use OCR-D's new --overwrite
4 years ago
Gerber, Mike
3af3c6dd00
⬆️ Update qurator_data_lib.sh
4 years ago
Gerber, Mike
0f8b2d82d5
🧹 Travis: Comment out transfer.sh output
4 years ago
Gerber, Mike
032f58e4b8
Merge branch 'master' of https://github.com/mikegerber/my_ocrd_workflow
4 years ago
Gerber, Mike
131e862762
💩 Travis: Skip validation until https://github.com/OCR-D/ocrd_olena/issues/60 is fixed
4 years ago
Gerber, Mike
7fe2ce84b5
🐛 Update sbb_textline_detector to fix Keras/TF issue
4 years ago
Gerber, Mike
ef3a8a69e0
⬆️ Update ocrd_olena
4 years ago
Gerber, Mike
f7b43bbefa
✨ ppn2ocr: Support TIFF in the BEST group
5 years ago
Gerber, Mike
4e37a52899
Merge branch 'master' of github.com:mikegerber/my_ocrd_workflow
5 years ago
Gerber, Mike
bb703152db
🐛 ppn2ocr: Verify oai.sbb.berlin's certificate again
...
Now that oai.sbb.berlin's certificate chain is fixed, remove the
workaround again.
Fixes GH#15.
5 years ago
Gerber, Mike
c5ae23d2ef
✨ Validate before even starting, to find data problems
5 years ago
Gerber, Mike
f7b0b4121d
⁉️ Check dependencies using pipdeptree to triage Travis build fail
5 years ago
Gerber, Mike
c334b1e7ac
🧹 Move check-FULLTEXT-Page-dimensions-vs-BEST-dimensions.py code to mono-repo/experiments
5 years ago
Gerber, Mike
af4557fb33
Merge branch 'master' of https://github.com/mikegerber/my_ocrd_workflow
5 years ago
Gerber, Mike
0aa541fa18
📓 README: Reference howto/*proxy*.md instead of duplicating the proxy settings
5 years ago
Gerber, Mike
3f4ec30349
🧹 .gitignore __pychache__/*.pyc
5 years ago
Gerber, Mike
f98a1ec2c8
🐛 run: XXX Work around podman vs docker uid behaviour
5 years ago
Gerber, Mike
746fb768da
🚧 Add a script that checks FULLTEXT dimensions against BEST dimensions
5 years ago
Gerber, Mike
d2c316285c
🧹 ppn2ocr: Remove obsolete show_help()
5 years ago
Gerber, Mike
f5b2eed8a6
🐛 ppn2ocr: Work around oai.sbb.berlin certificate problem
...
oai.sbb.berlin does not have a valid certificate:
% curl https://oai.sbb.berlin
curl: (60) SSL certificate problem: unable to get local issuer certificate
More details here: https://curl.haxx.se/docs/sslcerts.html
curl failed to verify the legitimacy of the server and therefore could not
establish a secure connection to it. To learn more about this situation and
how to fix it, please visit the web page mentioned above.
Work around this by setting verify=False.
5 years ago
Gerber, Mike
448bf9e256
🐛 ppn2ocr: Remove LOCAL file group too
5 years ago
Gerber, Mike
4e19e2a655
💄 ppn2ocr: Add a proper CLI interface
5 years ago
Gerber, Mike
70eb73e4c7
🧹 ppn2ocr: (Re)Move TODOs
5 years ago
Gerber, Mike
6c74672916
🚧 ppn2ocr: Update README to use the correct path to requirements-ppn2ocr.txt
5 years ago
Gerber, Mike
9b7437601f
🚧 ppn2ocr: Update README
5 years ago