Commit Graph

355 Commits (56b364154bccf797a22f1a111b58302a6b1eb3dc)
 

Author SHA1 Message Date
Gerber, Mike 74cb361723 🚧 ppn2ocr: Extract a function to contain the IIIF hack
Gerber, Mike c7c8934e89 🚧 ppn2ocr: Convert to Python + fumble in IIIF URLs
Gerber, Mike 7c5cbc7244 📝 ppn2ocr: Add to README, including proxy configuration
Gerber, Mike 1585247482 ppn2ocr: Make PPN a command line parameter
Gerber, Mike 2a4b204fbe 🎨 ppn2ocr: Extract a function to make a workspace
Gerber, Mike 18d4ab0ba1 ppn2ocr: Use a better example document
Gerber, Mike 8024064697 🐛 ppn2ocr: Fix file:/ links to use file:///, and remove unavaiblable LOCAL file group
Gerber, Mike 612d44b074 🚧 zdb2ocr: Add TODOs from notes.md
Gerber, Mike 9303f4b4df 🚧 zdb2ocr: Produce OCR of ZEFYS newspapers (WIP)
Gerber, Mike 3b60b26c53 🐛 ppn2ocr: Do not set no_proxy here
Gerber, Mike 5675047047 🧹 ppn2ocr: We already use run-docker-hub
Gerber, Mike 770af0a205 🚧 WIP: Add script ppn2ocr to run a document by giving PPN
Gerber, Mike 4b8399fc52 📝 README-DEV: Fix git push instructions
Gerber, Mike a5b4e06a09 Allow skipping validation
Gerber, Mike f0da2c95ba ⬆ Update ocrd_olena
Gerber, Mike 78f632a523 Support --input-file-grp/-I command line parameter
Gerber, Mike 9e80926eb2 📝 Remove redundant/wrongly placed comment about SELinux vs run
Gerber, Mike 5da25f0aeb 📝 README: Use the image from Docker Hub
Gerber, Mike 830479d8c2 📝 README-DEV.md
Gerber, Mike 6ccfe08511 👷 Travis: Hopefully fix deploy on tags
Gerber, Mike a5897ead97 🎨 Re-use ./run for ./run-docker-hub
Gerber, Mike 0d61c258a6 Support running a stable version from Docker Hub
Gerber, Mike 6537e8284e 🧹 Remove work around for fixed issue
Gerber, Mike 453961b5b2 👷 Travis: Deploy on master + on tags
Gerber, Mike 4cdfdeb5fe 🐛 Travis: Do not use special characters when checking results (work around)
Gerber, Mike abf33508b7 🐛 Travis: Do not use special characters when checking results (work around)
Gerber, Mike 88c29cef68 🐛 Add tessdata_best Tesseract models again
Gerber, Mike 5e8cd47798 🧹 Clean up after first apt-get run
Gerber, Mike 4d9a833bef 🐛 Fix olena install
Gerber, Mike f65505d51a Install Tesseract from a PPA
Gerber, Mike 4cb4f6f2bf ⬆ Update qurator_data_lib.sh to use a silent curl instead of wget
Gerber, Mike 7ecca0e92a ⬆ Update Tesseract to 4.1.1
Gerber, Mike 9f29e53e63 Travis: Cache Docker builds from previous image
Gerber, Mike d8463e2ea7 Travis: Try a multi-stage build
Gerber, Mike cc7504ea33 🔍 Try uploading our XML to check it
Gerber, Mike 20310d454a 🔍 Try uploading our XML to check it
Gerber, Mike 90188c37cb Download tessdata_best from qurator-data.de mirror
Gerber, Mike 007d26df87 🚧 Install olena via preliminary Ubuntu package
Mike Gerber 8462112863
📝 README: **test environment**
Gerber, Mike 8177aab29f 📝 README: Mention historical prints
Gerber, Mike dcbdefc16e 📝 README: Describe what this does and why
Gerber, Mike abc207d655 📝 README: Include example workspace + reference PAGE Viewer and dinglehopper
Fixes GH-7.
Gerber, Mike 8cd842419d ⬆ Update ocrd_tesserocr to fix glyph bug ()
Gerber, Mike 2f10596d28 ⬆ Update qurator_data_lib.sh (Fixes GH-5)
Gerber, Mike c92b10b984 🚧 qurator_data_lib.sh: Do not hardcode data/
Gerber, Mike 58282c9e95 Include glyph output
Gerber, Mike 11a30892c5 🔍 Only do pip3 list when LOG_LEVEL >= DEBUG
Gerber, Mike 9f111ca362 🧹 Do not validate OCR results twice
Gerber, Mike 8ca25f3c56 🎨 Expose OCR textequiv_level as a environment variable
Gerber, Mike 0cb0e35e8e ⬆ Update ocrd_calamari to 0.0.5