|
87968cd297
|
🧹 README: Move TODO to my usual TODO list
|
2019-09-24 13:44:15 +02:00 |
|
|
debecf71b9
|
💩 Install the right Pillow version manually...
|
2019-09-23 15:04:28 +02:00 |
|
|
a3d6befb0d
|
🏗 Build Tesseract from source
|
2019-09-23 15:02:59 +02:00 |
|
|
d903e3634c
|
📝 README: Clarify workspace TODO
|
2019-09-23 15:01:27 +02:00 |
|
|
8a04602044
|
📝 README: Not using podman anymore
|
2019-09-23 15:01:03 +02:00 |
|
|
0782dbde32
|
⬆ Update dinglehopper
|
2019-08-22 15:37:48 +02:00 |
|
|
343a3fbf82
|
🔧 Evaluate both Tesseract and Calamari results
|
2019-08-21 13:07:27 +02:00 |
|
|
0bc06c2fad
|
✨ Run Calamari OCR
|
2019-08-21 11:54:01 +02:00 |
|
|
001e62f54a
|
🔧 Use docker, not podman
|
2019-08-20 12:25:12 +02:00 |
|
|
daed87566e
|
🚑 Don't install typegroups classifier for now
|
2019-08-16 18:23:15 +02:00 |
|
|
d8f3438ac5
|
🚑 Don't check pixel density
|
2019-08-16 18:21:59 +02:00 |
|
|
b169f35bb1
|
🔧 Build container with cache again
|
2019-08-16 18:21:12 +02:00 |
|
|
85ff80d548
|
✨ Use dinglehopper's new OCR-D interface
|
2019-08-16 14:04:41 +02:00 |
|
|
d5aa273b44
|
🚧 Use ocr-eval aka dinglehopper
|
2019-08-13 18:13:49 +02:00 |
|
|
be5750f4e1
|
✨ As a last step, downgrade to PAGE 2018 to support PAGE Viewer
|
2019-08-05 18:46:36 +02:00 |
|
|
cf2b4de2a0
|
🧹 Validate again after fixing image references
|
2019-08-05 17:46:20 +02:00 |
|
|
21e00932be
|
🐛 Use a valid filegrp USE for fontident
|
2019-08-05 17:38:24 +02:00 |
|
|
ade39a278c
|
🎨 Align file groups
|
2019-08-05 17:08:58 +02:00 |
|
|
3fee2d4fe6
|
📌 Use my ocrd_typegroups_classifier fix for passing down the page id
|
2019-08-05 17:00:54 +02:00 |
|
|
44772f1923
|
🚧 Work around problems with ocrd-tesserocr producing TextEquiv/@conf
|
2019-08-05 15:40:39 +02:00 |
|
|
8b67866aac
|
✨ Validate PAGE XML after OCR
|
2019-08-05 15:31:24 +02:00 |
|
|
0d7fd21446
|
✨ Validate workspace after each step
|
2019-08-05 15:27:38 +02:00 |
|
|
d37db86da1
|
📌 Use my ocrd_kraken fix for passing down the page id
|
2019-08-05 13:19:44 +02:00 |
|
|
4addde2e19
|
Use PAGE 2019
|
2019-08-02 11:59:16 +02:00 |
|
|
de841746e3
|
Use PAGE 2019
|
2019-08-02 11:58:56 +02:00 |
|
|
ff0570e151
|
Use frk for now
|
2019-08-02 11:58:46 +02:00 |
|
|
b4f5d44ac8
|
🐛 ocrd-bugs: bug-ocropy-segment-littering.sh
|
2019-08-01 15:13:04 +02:00 |
|
|
cc81afa1a5
|
🧹 No need to clean up after tesserocr
|
2019-07-03 13:46:49 +02:00 |
|
|
89a2893e4e
|
❌ I do not care for the multiple mets:agents elements
|
2019-07-03 12:35:15 +02:00 |
|
|
0e63fa1756
|
⁉ PyTessApi seems to use both engine modes
|
2019-07-03 12:30:52 +02:00 |
|
|
e3a1afbc93
|
📝 Document the functions
|
2019-07-03 12:22:55 +02:00 |
|
|
2204aee104
|
🐋 Docker: Simplify requirements install
|
2019-07-02 17:31:42 +02:00 |
|
|
ddda6e48bc
|
🐛 Add my collection of OCR-D bug reproducers
|
2019-06-26 16:26:13 +02:00 |
|
|
cfa7d10747
|
📜 Add README.md
|
2019-06-25 17:54:16 +02:00 |
|
|
0ea6b02fff
|
⬆ Update to ocrd >= 1.0.0b10
|
2019-06-25 17:20:36 +02:00 |
|
|
d49c0bd2d1
|
XXX Do not run privileged, use udica instead
|
2019-06-25 13:25:51 +02:00 |
|
|
964aef1393
|
🐛 Use my version of ocrd_models until fix is merged
|
2019-06-25 13:24:49 +02:00 |
|
|
51a2ccc224
|
🧹 Remove container after run
|
2019-06-24 17:36:06 +02:00 |
|
|
f3e37dd16c
|
Do not hardcode path to typegroups model binary
|
2019-06-24 17:31:25 +02:00 |
|
|
3f366339ad
|
Add container setup
|
2019-06-24 16:36:19 +02:00 |
|
|
8d66469621
|
Binarize images before segmenting
|
2019-06-24 12:34:08 +02:00 |
|
|
5e1ece4877
|
Use ocrd-tesserocr-segment-*
|
2019-06-24 12:13:49 +02:00 |
|
|
e30f03699c
|
TODO Binarization
|
2019-06-24 12:12:12 +02:00 |
|
|
0d5b5b1b17
|
XXX does ocrd_tesserocr use the LSTM engine?
|
2019-06-24 12:09:35 +02:00 |
|
|
16f2f16dbe
|
XXX <error>INCONSISTENCY in TextRegion ID 'dummy'
|
2019-06-21 12:13:20 +02:00 |
|
|
89abc507e0
|
XXX ocrd-ocropy-segment throws an exception for buerger_gedichte_1778.ocrd
|
2019-06-21 12:10:55 +02:00 |
|
|
ad3a7c2b95
|
XXX remove_filegrp link to OCR-D issue
|
2019-06-21 12:10:19 +02:00 |
|
|
f94230c587
|
Set log level to DEBUG again
|
2019-06-21 12:09:44 +02:00 |
|
|
2b2c39d6d4
|
Add a global LOG_LEVEL option
|
2019-06-19 17:48:38 +02:00 |
|
|
fbc3b8ca4f
|
Fix image references
|
2019-06-19 17:20:05 +02:00 |
|