-
224762e1bb
🐛 Let ocrd_calamari handle the weird setuptools depencency
Gerber, Mike
2019-09-27 16:31:30 +0200
-
a272237bd8
⬆ Update ocrd dependency
Gerber, Mike
2019-09-27 15:42:11 +0200
-
2b9cab1a1a
⬆ Update ocrd_calamari
Gerber, Mike
2019-09-27 15:34:38 +0200
-
e5cd5b937e
✨ Run pip3 list for easier checking
Gerber, Mike
2019-09-27 13:16:14 +0200
-
bd24624bd7
⬆ Do not downgrade to PAGE 2018 anymore
Gerber, Mike
2019-09-27 13:02:46 +0200
-
0b2b66a0b4
🔧 Allow setting LOG_LEVEL
Gerber, Mike
2019-09-27 12:09:37 +0200
-
d6c38b5b9f
🧹 Do not install extra tesserocr
Gerber, Mike
2019-09-26 18:12:15 +0200
-
f19bba45b8
💩 Remove mysterious TEMP directory for now
Gerber, Mike
2019-09-26 16:55:54 +0200
-
68902f923d
📜 Downgrading to PAGE 2018 is not the last step anymore
Gerber, Mike
2019-09-26 16:55:02 +0200
-
6c0d7e0aee
💩 Do not fix PAGE image references for now
Gerber, Mike
2019-09-26 16:46:12 +0200
-
e3bf65b502
⬆ Update dinglehopper
Gerber, Mike
2019-09-26 15:24:52 +0200
-
87968cd297
🧹 README: Move TODO to my usual TODO list
Gerber, Mike
2019-09-24 13:44:15 +0200
-
debecf71b9
💩 Install the right Pillow version manually...
Gerber, Mike
2019-09-23 15:04:28 +0200
-
a3d6befb0d
🏗 Build Tesseract from source
Gerber, Mike
2019-09-23 15:02:59 +0200
-
d903e3634c
📝 README: Clarify workspace TODO
Gerber, Mike
2019-09-23 15:01:27 +0200
-
8a04602044
📝 README: Not using podman anymore
Gerber, Mike
2019-09-23 15:01:03 +0200
-
0782dbde32
⬆ Update dinglehopper
Gerber, Mike
2019-08-22 15:37:48 +0200
-
343a3fbf82
🔧 Evaluate both Tesseract and Calamari results
Gerber, Mike
2019-08-21 13:07:27 +0200
-
0bc06c2fad
✨ Run Calamari OCR
Gerber, Mike
2019-08-21 11:54:01 +0200
-
001e62f54a
🔧 Use docker, not podman
Gerber, Mike
2019-08-20 12:25:12 +0200
-
daed87566e
🚑 Don't install typegroups classifier for now
Gerber, Mike
2019-08-16 18:23:15 +0200
-
d8f3438ac5
🚑 Don't check pixel density
Gerber, Mike
2019-08-16 18:21:59 +0200
-
b169f35bb1
🔧 Build container with cache again
Gerber, Mike
2019-08-16 18:21:12 +0200
-
85ff80d548
✨ Use dinglehopper's new OCR-D interface
Gerber, Mike
2019-08-16 14:04:36 +0200
-
d5aa273b44
🚧 Use ocr-eval aka dinglehopper
Gerber, Mike
2019-08-13 18:13:49 +0200
-
be5750f4e1
✨ As a last step, downgrade to PAGE 2018 to support PAGE Viewer
Gerber, Mike
2019-08-05 18:46:36 +0200
-
cf2b4de2a0
🧹 Validate again after fixing image references
Gerber, Mike
2019-08-05 17:46:20 +0200
-
21e00932be
🐛 Use a valid filegrp USE for fontident
Gerber, Mike
2019-08-05 17:38:24 +0200
-
ade39a278c
🎨 Align file groups
Gerber, Mike
2019-08-05 17:08:58 +0200
-
3fee2d4fe6
📌 Use my ocrd_typegroups_classifier fix for passing down the page id
Gerber, Mike
2019-08-05 17:00:54 +0200
-
44772f1923
🚧 Work around problems with ocrd-tesserocr producing TextEquiv/@conf
Gerber, Mike
2019-08-05 15:40:39 +0200
-
8b67866aac
✨ Validate PAGE XML after OCR
Gerber, Mike
2019-08-05 15:31:24 +0200
-
0d7fd21446
✨ Validate workspace after each step
Gerber, Mike
2019-08-05 15:27:38 +0200
-
d37db86da1
📌 Use my ocrd_kraken fix for passing down the page id
Gerber, Mike
2019-08-05 13:19:44 +0200
-
4addde2e19
Use PAGE 2019
Gerber, Mike
2019-08-02 11:59:16 +0200
-
de841746e3
Use PAGE 2019
Gerber, Mike
2019-08-02 11:58:56 +0200
-
ff0570e151
Use frk for now
Gerber, Mike
2019-08-02 11:58:46 +0200
-
b4f5d44ac8
🐛 ocrd-bugs: bug-ocropy-segment-littering.sh
Gerber, Mike
2019-08-01 15:13:04 +0200
-
cc81afa1a5
🧹 No need to clean up after tesserocr
Gerber, Mike
2019-07-03 13:46:49 +0200
-
89a2893e4e
❌ I do not care for the multiple mets:agents elements
Gerber, Mike
2019-07-03 12:35:15 +0200
-
0e63fa1756
⁉ PyTessApi seems to use both engine modes
Gerber, Mike
2019-07-03 12:30:52 +0200
-
e3a1afbc93
📝 Document the functions
Gerber, Mike
2019-07-03 12:22:55 +0200
-
2204aee104
🐋 Docker: Simplify requirements install
Gerber, Mike
2019-07-02 17:31:42 +0200
-
ddda6e48bc
🐛 Add my collection of OCR-D bug reproducers
Gerber, Mike
2019-06-26 16:26:13 +0200
-
cfa7d10747
📜 Add README.md
Gerber, Mike
2019-06-25 17:54:16 +0200
-
0ea6b02fff
⬆ Update to ocrd >= 1.0.0b10
Gerber, Mike
2019-06-25 17:20:36 +0200
-
d49c0bd2d1
XXX Do not run privileged, use udica instead
Gerber, Mike
2019-06-25 13:25:51 +0200
-
964aef1393
🐛 Use my version of ocrd_models until fix is merged
Gerber, Mike
2019-06-25 13:24:49 +0200
-
51a2ccc224
🧹 Remove container after run
Gerber, Mike
2019-06-24 17:36:06 +0200
-
f3e37dd16c
Do not hardcode path to typegroups model binary
Gerber, Mike
2019-06-24 17:31:25 +0200
-
3f366339ad
Add container setup
Gerber, Mike
2019-06-24 16:36:19 +0200
-
8d66469621
Binarize images before segmenting
Gerber, Mike
2019-06-24 12:34:08 +0200
-
5e1ece4877
Use ocrd-tesserocr-segment-*
Gerber, Mike
2019-06-24 12:13:49 +0200
-
e30f03699c
TODO Binarization
Gerber, Mike
2019-06-24 12:12:12 +0200
-
0d5b5b1b17
XXX does ocrd_tesserocr use the LSTM engine?
Gerber, Mike
2019-06-24 12:09:35 +0200
-
16f2f16dbe
XXX <error>INCONSISTENCY in TextRegion ID 'dummy'
Gerber, Mike
2019-06-21 12:13:20 +0200
-
89abc507e0
XXX ocrd-ocropy-segment throws an exception for buerger_gedichte_1778.ocrd
Gerber, Mike
2019-06-21 12:10:55 +0200
-
ad3a7c2b95
XXX remove_filegrp link to OCR-D issue
Gerber, Mike
2019-06-21 12:10:19 +0200
-
f94230c587
Set log level to DEBUG again
Gerber, Mike
2019-06-21 12:09:44 +0200
-
2b2c39d6d4
Add a global LOG_LEVEL option
Gerber, Mike
2019-06-19 17:48:38 +0200
-
fbc3b8ca4f
Fix image references
Gerber, Mike
2019-06-19 17:20:05 +0200
-
b6c490e18b
Add a PAGE fix XML step
Gerber, Mike
2019-06-19 15:03:16 +0200
-
d98ce2d2d4
Add a PAGE validation step
Gerber, Mike
2019-06-19 14:55:50 +0200
-
10c4068a99
XXX Global -l DEBUG
Gerber, Mike
2019-06-19 13:26:28 +0200
-
f8f44e990d
Clean up after ocrd-ocropy-segment's mess
Gerber, Mike
2019-06-19 13:26:11 +0200
-
243ddea674
Use ocrd-ocropy-segment instead of non-functional ocrd-tesserocr-segment-line
Gerber, Mike
2019-06-19 13:24:25 +0200
-
9bd3853c78
Add OCR step
Gerber, Mike
2019-06-19 13:02:54 +0200
-
a64b9cf5c8
XXX Multiple calls create multiple identical mets:agent elements
Gerber, Mike
2019-06-19 12:52:10 +0200
-
c207859bcd
Refactor: Extract functions for the steps
Gerber, Mike
2019-06-19 12:51:52 +0200
-
a2d547b857
Reformat to use shorter lines
Gerber, Mike
2019-06-19 12:39:42 +0200
-
b5f9dcb7f3
Initial commit
Gerber, Mike
2019-06-19 12:22:41 +0200