Gerber, Mike
7da45a0ec1
Set pcGtsId
...
Newest OCR-D validation checks PAGE-XML pcGtsId against METS file/@ID.
Set the pcGtsId here correctly.
Fixes #40 .
4 years ago
Gerber, Mike
046e3e8ee3
🚧 Tests: Add some TODOs re data + namespace version changes
4 years ago
Gerber, Mike
0a9dbd0c25
🧹 Do not install numpy, let the TF dependency do it
4 years ago
Gerber, Mike
d9afb05cf3
🐛 Use TensorFlow >= 2.3.0rc2 to fix retracing warnings
4 years ago
Gerber, Mike
4eb4f97f6b
Merge branch 'feat/update-calamari1' of https://github.com/OCR-D/ocrd_calamari into feat/update-calamari1
4 years ago
Gerber, Mike
93190fae3b
⚡ Recognize more than one line at a time (Fixes gh#20)
4 years ago
Gerber, Mike
7584d0135c
⬆️ Update model download for Calamari 1.0
4 years ago
Gerber, Mike
9ea50e25d1
⬆️ Update to Calamari 1.0.x
4 years ago
Gerber, Mike
027fcd7d75
🐛 Fix test file path
4 years ago
Gerber, Mike
8ab57e44dc
⬆️ Update model download for Calamari 1.0
4 years ago
Gerber, Mike
7dff7784c5
⬆️ Update to Calamari 1.0.x
4 years ago
Mike Gerber
c6ced9b3e9
Merge pull request #39 from OCR-D/dont-install-test
...
setup.py: exclude "test", not "tests", from installation
5 years ago
kba
e03ff4064b
setup.py: exclude "test", not "tests", from installation
5 years ago
Mike Gerber
fb538845d8
📄 Update license ( Fixes #35 )
...
Set copyright owner name. Also, going along the lines of "update the year when substantial revision of the work happenend", set the copyright years. The latter may be not be necessary, because "life of author + 70 years" or something.
5 years ago
Gerber, Mike
123ee61a8b
v0.0.6
5 years ago
Gerber, Mike
69df78bce1
🐛 setup.py: Fix GitHub url by s/kba/OCR-D
5 years ago
Gerber, Mike
62e5e0c295
🐛 ocrd-tool.json: Fix GitHub url by s/kba/OCR-D
5 years ago
Gerber, Mike
0334a35870
🐛 Sort predictions in exactly the same way, also when building the text
5 years ago
Gerber, Mike
0c9e1f13c7
🐛 Sort predictions in exactly the same way to make sure we are correctly removing spaces
5 years ago
Gerber, Mike
d2c843aa3f
📦 v0.0.5
5 years ago
Gerber, Mike
cd8f6a5fcb
🐛 Use line id for debug message
5 years ago
Gerber, Mike
5b6d8b3f41
🐛 Build line text on our own
...
Calamari does whitespace post-processing on prediction.sentence, while
it does not do the same on prediction.positions. Do it on our own to
have consistency.
Fixes GH-37.
5 years ago
Gerber, Mike
30f7e1b246
🐳 Docker: Run pip3 check for good measure
5 years ago
Gerber, Mike
303172b279
📝 Document make targets
5 years ago
Gerber, Mike
a2d1d76dbd
🐳 Docker: Do not use the make target to install calamari-ocr, stick to pip
5 years ago
Gerber, Mike
41f5c8a8fa
🐳 Docker: Upgrade pip to silence warning and fix potential other problems
5 years ago
Gerber, Mike
7c18b1d391
🐳 Docker: Use ocrd/core:master instead of outdated :edge
5 years ago
Gerber, Mike
1fda419f25
🐳 Fix Docker build
5 years ago
Gerber, Mike
71096493ac
📝 README-DEV: Improve info about releasing
5 years ago
Gerber, Mike
b26194179c
📝 README-DEV: Improve markdown
5 years ago
Gerber, Mike
cf7a788854
📝 README-DEV: Mention cleaning up the dict/ directory
5 years ago
Gerber, Mike
4508e3ec47
📦 v0.0.4
5 years ago
Gerber, Mike
73beab1770
📝 README: Add a missing `cd`
5 years ago
Gerber, Mike
3416a155ec
📝 README: Provide a complete example using real data and other processors
...
See #33 .
5 years ago
Gerber, Mike
f2001a79f1
Merge branch 'master' of https://github.com/OCR-D/ocrd_calamari
5 years ago
Gerber, Mike
3e426b2a0a
📝 README: Use gt4histocr-calamari from the Makefile in the example
...
See #33 .
5 years ago
Mike Gerber
46fe34400f
📝 README: Link to the correct ocrd-tool.json
5 years ago
Mike Gerber
0c7cd69526
📝 README: Update intro that we're mostly on par with Calamari's functionality
5 years ago
Gerber, Mike
b802b4deaf
✨ Allow configuring a cut off confidence value for glyph alternatives
5 years ago
Gerber, Mike
e39a2bce01
📝 Fix example parameters JSON
5 years ago
Gerber, Mike
ef3fb44fb5
✨ Allow controlling of output hierarchy level, e.g. only line, not words+glyphs
5 years ago
Gerber, Mike
0f0bae18ba
✅ Remove GT text to not accidently check it instead of OCR text
5 years ago
Gerber, Mike
82fe0333f1
✅ Test word segmentation ( Fixes #30 )
5 years ago
Gerber, Mike
9010250911
♻ test: Move binarization into the workspace fixture
5 years ago
Gerber, Mike
6f4736f8e4
✨ Do word segmentation as expected by OCR-D PAGE specs
5 years ago
Gerber, Mike
0f9c94e7dc
🐛 Start with TextEquiv index=1 to adhere to OCR-D PAGE conventions
...
https://ocr-d.github.io/page#multiple-textequivs
5 years ago
Gerber, Mike
909632493b
🚧 Add future TODOs
5 years ago
Gerber, Mike
3149e1d9e0
📝 unwanted()
5 years ago
Gerber, Mike
91cca1e1b8
📝 Document why we are using Unicode text segmentation to produce word results
5 years ago
Gerber, Mike
0a572df0ba
📝 README: Add information about the new glyph and word segmentation
5 years ago