Commit graph

230 commits

Author SHA1 Message Date
abf33508b7 🐛 Travis: Do not use special characters when checking results (work around) 2020-03-02 13:45:54 +01:00
88c29cef68 🐛 Add tessdata_best Tesseract models again 2020-03-02 12:47:46 +01:00
5e8cd47798 🧹 Clean up after first apt-get run 2020-03-02 12:07:52 +01:00
4d9a833bef 🐛 Fix olena install 2020-03-02 12:06:50 +01:00
f65505d51a Install Tesseract from a PPA 2020-03-02 12:05:55 +01:00
4cb4f6f2bf ⬆ Update qurator_data_lib.sh to use a silent curl instead of wget 2020-02-27 12:31:09 +01:00
7ecca0e92a ⬆ Update Tesseract to 4.1.1 2020-02-26 17:43:27 +01:00
9f29e53e63 Travis: Cache Docker builds from previous image 2020-02-26 13:18:27 +01:00
d8463e2ea7 Travis: Try a multi-stage build 2020-02-26 12:50:42 +01:00
cc7504ea33 🔍 Try uploading our XML to check it 2020-02-26 11:59:56 +01:00
20310d454a 🔍 Try uploading our XML to check it 2020-02-25 19:29:22 +01:00
90188c37cb Download tessdata_best from qurator-data.de mirror 2020-02-25 18:33:59 +01:00
007d26df87 🚧 Install olena via preliminary Ubuntu package 2020-02-25 18:33:16 +01:00
8462112863
📝 README: **test environment** 2020-02-21 16:59:35 +01:00
8177aab29f 📝 README: Mention historical prints 2020-02-21 16:57:22 +01:00
dcbdefc16e 📝 README: Describe what this does and why 2020-02-21 16:54:10 +01:00
abc207d655 📝 README: Include example workspace + reference PAGE Viewer and dinglehopper
Fixes GH-7.
2020-02-21 13:22:06 +01:00
8cd842419d ⬆ Update ocrd_tesserocr to fix glyph bug (OCR-D/ocrd_tesserocr#112) 2020-02-17 18:52:44 +01:00
2f10596d28 ⬆ Update qurator_data_lib.sh (Fixes GH-5) 2020-02-13 19:02:57 +01:00
c92b10b984 🚧 qurator_data_lib.sh: Do not hardcode data/ 2020-02-13 18:19:49 +01:00
58282c9e95 Include glyph output 2020-02-13 16:13:24 +01:00
11a30892c5 🔍 Only do pip3 list when LOG_LEVEL >= DEBUG 2020-02-13 15:02:15 +01:00
9f111ca362 🧹 Do not validate OCR results twice 2020-02-12 18:08:11 +01:00
8ca25f3c56 🎨 Expose OCR textequiv_level as a environment variable 2020-02-12 14:48:22 +01:00
0cb0e35e8e ⬆ Update ocrd_calamari to 0.0.5 2020-02-12 14:28:04 +01:00
868ac5774c 🎨 Improve structure and documentation of run 2020-02-11 15:51:56 +01:00
979c7044a8 Make OCR-D-IMG-BIN output group explicit 2020-02-11 14:26:53 +01:00
2b93fe3400 👷 qurator_data_lib.sh: Check that we are running bash 2020-02-11 14:15:32 +01:00
28bb482ceb Produce word results 2020-02-10 19:26:04 +01:00
6ae85063c5 📝 Document do_validate() options better 2020-02-10 19:25:08 +01:00
1252d8ccc3 🎨 Nudge build+download towards the standard qurator_data_lib.sh 2020-02-10 19:23:17 +01:00
61bb4f99f6 Travis: Add status badge 2020-02-10 18:01:25 +01:00
8a92556afd Travis: Check OCR results 2020-02-10 17:54:52 +01:00
d64713f5b6 Merge branch 'master' of https://github.com/mikegerber/my_ocrd_workflow 2020-02-10 17:05:02 +01:00
bb92f8b1e6 🧹 Remove half-ass GPU support to fix Travis build
Follow up in https://github.com/mikegerber/my_ocrd_workflow/issues/6.
2020-02-10 17:04:12 +01:00
c8a64e5d57 🐛 Fix textline_detection model download 2020-02-10 15:46:55 +01:00
a097532847 🐛 select+do is apparently a bash feature, so make ./build a bash script 2020-02-10 15:12:27 +01:00
db8a6f6a0b Travis: Avoid trying to checkout private data/ submodule 2020-02-10 15:04:41 +01:00
7d17b9b2d4 Add initial Travis configuration 2020-02-10 15:00:56 +01:00
788aedcb9b ⬆ Update sbb_textline_detector (just README changes) 2020-02-10 14:40:17 +01:00
934814c03c 🐳 Docker: Comment installing requirements 2020-02-10 14:39:21 +01:00
6d5305d07b 🧹 Docker: Move textline_detection model copy to the other OCR model copies 2020-02-10 14:37:23 +01:00
041cee707e 🧹 Remove unused vendor/sbb_tetline_detector tar (Fixes GH-3) 2020-02-10 13:47:34 +01:00
0889d1a5e3 🧹 Update/move some XXXs/TODOs 2020-02-07 19:42:54 +01:00
b08dc66f9f 🐳 Docker: Consistently use trailing slash when copying a file to a directory 2020-02-07 19:37:31 +01:00
2cf68f149d ♻ Extract a main() function for the main stuff 2020-02-07 18:57:46 +01:00
be0a0c353a 📝 Document the two remaining un-documented functions 2020-02-07 18:47:16 +01:00
848dd143fd 🎨 Use long command lines again 2020-02-07 18:46:33 +01:00
6b83d5ae1e 🧹 Update/move some XXXs/TODOs 2020-02-07 18:01:26 +01:00
98aee51801 🔧 Set up logging level using /etc/ocrd_logging.py instead of "-l" 2020-02-07 17:51:53 +01:00