Commit Graph

199 Commits (11615be6b28bcf443332518f76e7cb303d69de81)
 

Author SHA1 Message Date
Gerber, Mike 7da45a0ec1 Set pcGtsId
Newest OCR-D validation checks PAGE-XML pcGtsId against METS file/@ID.
Set the pcGtsId here correctly.

Fixes #40.
4 years ago
Gerber, Mike 046e3e8ee3 🚧 Tests: Add some TODOs re data + namespace version changes 4 years ago
Gerber, Mike 0a9dbd0c25 🧹 Do not install numpy, let the TF dependency do it 4 years ago
Gerber, Mike d9afb05cf3 🐛 Use TensorFlow >= 2.3.0rc2 to fix retracing warnings 4 years ago
Gerber, Mike 4eb4f97f6b Merge branch 'feat/update-calamari1' of https://github.com/OCR-D/ocrd_calamari into feat/update-calamari1 4 years ago
Gerber, Mike 93190fae3b Recognize more than one line at a time (Fixes gh#20) 4 years ago
Gerber, Mike 7584d0135c ⬆️ Update model download for Calamari 1.0 4 years ago
Gerber, Mike 9ea50e25d1 ⬆️ Update to Calamari 1.0.x 4 years ago
Gerber, Mike 027fcd7d75 🐛 Fix test file path 4 years ago
Gerber, Mike 8ab57e44dc ⬆️ Update model download for Calamari 1.0 4 years ago
Gerber, Mike 7dff7784c5 ⬆️ Update to Calamari 1.0.x 4 years ago
Mike Gerber c6ced9b3e9
Merge pull request #39 from OCR-D/dont-install-test
setup.py: exclude "test", not "tests", from installation
5 years ago
kba e03ff4064b setup.py: exclude "test", not "tests", from installation 5 years ago
Mike Gerber fb538845d8
📄 Update license (Fixes #35)
Set copyright owner name. Also, going along the lines of "update the year when substantial revision of the work happenend", set the copyright years. The latter may be not be necessary, because "life of author + 70 years" or something.
5 years ago
Gerber, Mike 123ee61a8b v0.0.6 5 years ago
Gerber, Mike 69df78bce1 🐛 setup.py: Fix GitHub url by s/kba/OCR-D 5 years ago
Gerber, Mike 62e5e0c295 🐛 ocrd-tool.json: Fix GitHub url by s/kba/OCR-D 5 years ago
Gerber, Mike 0334a35870 🐛 Sort predictions in exactly the same way, also when building the text 5 years ago
Gerber, Mike 0c9e1f13c7 🐛 Sort predictions in exactly the same way to make sure we are correctly removing spaces 5 years ago
Gerber, Mike d2c843aa3f 📦 v0.0.5 5 years ago
Gerber, Mike cd8f6a5fcb 🐛 Use line id for debug message 5 years ago
Gerber, Mike 5b6d8b3f41 🐛 Build line text on our own
Calamari does whitespace post-processing on prediction.sentence, while
it does not do the same on prediction.positions. Do it on our own to
have consistency.

Fixes GH-37.
5 years ago
Gerber, Mike 30f7e1b246 🐳 Docker: Run pip3 check for good measure 5 years ago
Gerber, Mike 303172b279 📝 Document make targets 5 years ago
Gerber, Mike a2d1d76dbd 🐳 Docker: Do not use the make target to install calamari-ocr, stick to pip 5 years ago
Gerber, Mike 41f5c8a8fa 🐳 Docker: Upgrade pip to silence warning and fix potential other problems 5 years ago
Gerber, Mike 7c18b1d391 🐳 Docker: Use ocrd/core:master instead of outdated :edge 5 years ago
Gerber, Mike 1fda419f25 🐳 Fix Docker build 5 years ago
Gerber, Mike 71096493ac 📝 README-DEV: Improve info about releasing 5 years ago
Gerber, Mike b26194179c 📝 README-DEV: Improve markdown 5 years ago
Gerber, Mike cf7a788854 📝 README-DEV: Mention cleaning up the dict/ directory 5 years ago
Gerber, Mike 4508e3ec47 📦 v0.0.4 5 years ago
Gerber, Mike 73beab1770 📝 README: Add a missing `cd` 5 years ago
Gerber, Mike 3416a155ec 📝 README: Provide a complete example using real data and other processors
See #33.
5 years ago
Gerber, Mike f2001a79f1 Merge branch 'master' of https://github.com/OCR-D/ocrd_calamari 5 years ago
Gerber, Mike 3e426b2a0a 📝 README: Use gt4histocr-calamari from the Makefile in the example
See #33.
5 years ago
Mike Gerber 46fe34400f
📝 README: Link to the correct ocrd-tool.json 5 years ago
Mike Gerber 0c7cd69526
📝 README: Update intro that we're mostly on par with Calamari's functionality 5 years ago
Gerber, Mike b802b4deaf Allow configuring a cut off confidence value for glyph alternatives 5 years ago
Gerber, Mike e39a2bce01 📝 Fix example parameters JSON 5 years ago
Gerber, Mike ef3fb44fb5 Allow controlling of output hierarchy level, e.g. only line, not words+glyphs 5 years ago
Gerber, Mike 0f0bae18ba Remove GT text to not accidently check it instead of OCR text 5 years ago
Gerber, Mike 82fe0333f1 Test word segmentation (Fixes #30) 5 years ago
Gerber, Mike 9010250911 ♻ test: Move binarization into the workspace fixture 5 years ago
Gerber, Mike 6f4736f8e4 Do word segmentation as expected by OCR-D PAGE specs 5 years ago
Gerber, Mike 0f9c94e7dc 🐛 Start with TextEquiv index=1 to adhere to OCR-D PAGE conventions
https://ocr-d.github.io/page#multiple-textequivs
5 years ago
Gerber, Mike 909632493b 🚧 Add future TODOs 5 years ago
Gerber, Mike 3149e1d9e0 📝 unwanted() 5 years ago
Gerber, Mike 91cca1e1b8 📝 Document why we are using Unicode text segmentation to produce word results 5 years ago
Gerber, Mike 0a572df0ba 📝 README: Add information about the new glyph and word segmentation 5 years ago