Gerber, Mike
|
c23930c8df
|
✨ Add an example for ocrd-cis-ocropy-segment
|
4 years ago |
Gerber, Mike
|
0841af5491
|
🚧 Prepare supporting ocrd-sbb-binarize
ocrd-sbb-binarize seems to work but its input does not work with
ocrd-sbb-textline-detector:
https://github.com/qurator-spk/sbb_binarization/issues/8
https://github.com/qurator-spk/sbb_textline_detection/issues/47
|
4 years ago |
Gerber, Mike
|
568cf60b4c
|
⚙️ Consistently set LOG_LEVEL to INFO by default
|
4 years ago |
Gerber, Mike
|
17c6b15a1b
|
🐛 (Better) Handle missing pip3 in the main script
|
4 years ago |
Gerber, Mike
|
fc853d4d13
|
🐛 Handle missing pip3 in the main script
|
4 years ago |
Gerber, Mike
|
0b1da9a5db
|
🧹 Update Calamari model path
|
4 years ago |
Gerber, Mike
|
d1a2bfe669
|
🐛 Deal with ocrd_olena >= 1.2.0 using one output file group only
|
4 years ago |
Gerber, Mike
|
1a308a5522
|
🧹 Use OCR-D's -P, remove now redundant validation and remove now unnecessary functions
|
4 years ago |
Gerber, Mike
|
efd955c04f
|
🧹 Modernize my_ocrd_workflow and use OCR-D's new --overwrite
|
4 years ago |
Gerber, Mike
|
c5ae23d2ef
|
✨ Validate before even starting, to find data problems
|
4 years ago |
Gerber, Mike
|
a5b4e06a09
|
✨ Allow skipping validation
|
5 years ago |
Gerber, Mike
|
78f632a523
|
✨ Support --input-file-grp/-I command line parameter
|
5 years ago |
Gerber, Mike
|
58282c9e95
|
✨ Include glyph output
|
5 years ago |
Gerber, Mike
|
11a30892c5
|
🔍 Only do pip3 list when LOG_LEVEL >= DEBUG
|
5 years ago |
Gerber, Mike
|
9f111ca362
|
🧹 Do not validate OCR results twice
|
5 years ago |
Gerber, Mike
|
8ca25f3c56
|
🎨 Expose OCR textequiv_level as a environment variable
|
5 years ago |
Gerber, Mike
|
979c7044a8
|
✨ Make OCR-D-IMG-BIN output group explicit
|
5 years ago |
Gerber, Mike
|
28bb482ceb
|
✨ Produce word results
|
5 years ago |
Gerber, Mike
|
6ae85063c5
|
📝 Document do_validate() options better
|
5 years ago |
Gerber, Mike
|
2cf68f149d
|
♻ Extract a main() function for the main stuff
|
5 years ago |
Gerber, Mike
|
be0a0c353a
|
📝 Document the two remaining un-documented functions
|
5 years ago |
Gerber, Mike
|
848dd143fd
|
🎨 Use long command lines again
|
5 years ago |
Gerber, Mike
|
6b83d5ae1e
|
🧹 Update/move some XXXs/TODOs
|
5 years ago |
Gerber, Mike
|
5a55598d0c
|
🧹 Remove image reference fixing remnants - jpageviewer now has --resolve-dir
|
5 years ago |
Gerber, Mike
|
44979e7fa2
|
🧹 do_linesegmentation_sbb: It's now clear that sbb segmentation works with RGB images
|
5 years ago |
Gerber, Mike
|
460b6c34d1
|
✏ Fix typo in $ocrd_olena_binarize_parameters
|
5 years ago |
Gerber, Mike
|
71d54c6978
|
🔧 Set up logging level using /etc/ocrd_logging.py instead of "-l"
|
5 years ago |
Gerber, Mike
|
1a538dce1a
|
🧹 Remove superfluous mets.xml options
|
5 years ago |
Gerber, Mike
|
c192bfdbfe
|
🧹 Remove workaround for TEMP/ directory bug
|
5 years ago |
Gerber, Mike
|
d7a2aac44b
|
♻ Remove file groups using "ocrd workspace remove-group"
|
5 years ago |
Gerber, Mike
|
c8039db686
|
🎨 Put validate options into a variable
|
5 years ago |
Gerber, Mike
|
5ece7f1b0a
|
🧹 Remove remnants of ocrd-ocropy-segment
|
5 years ago |
Gerber, Mike
|
135489eaeb
|
🧹 Remove page_downgrade_to_2018
|
5 years ago |
Gerber, Mike
|
423d9c2ed6
|
🚧 do_validate: Skip dimension checking
|
5 years ago |
Gerber, Mike
|
948e9074df
|
⬆ Update to ocrd_calamari 0.0.4
|
5 years ago |
Gerber, Mike
|
1ef850992c
|
🎨 Use same style of specifying parameters for all processors
|
5 years ago |
Gerber, Mike
|
b468d688f2
|
🧹 Remove font identification for now
|
5 years ago |
Gerber, Mike
|
07555e8270
|
🎨 Use new OCR-D JSON string parameters
|
5 years ago |
Gerber, Mike
|
9c31d604e9
|
⬆ Update ocrd-sbb-textline-detector command
|
5 years ago |
Gerber, Mike
|
fd56731464
|
🚧 Do not check PAGE coordinates for now
|
5 years ago |
Gerber, Mike
|
87a2bce93c
|
⬆ Update calamari-models URL + path
|
5 years ago |
Gerber, Mike
|
d166077a55
|
✨ Update to sbb_textline_detector with the fixed AlternativeImage support (= merged PAGE results)
|
5 years ago |
Gerber, Mike
|
de47a3e5b1
|
🔥 Remove now unused page_fix_image_references()
|
5 years ago |
Gerber, Mike
|
1af18c629e
|
🧹 Validate imagefilename again
|
5 years ago |
Gerber, Mike
|
de49aa715b
|
⬆ Update to OCR-D 1.0.0
|
5 years ago |
Gerber, Mike
|
7025d960b4
|
✨ Use ocrd_olena for binarization
|
5 years ago |
Gerber, Mike
|
3687d6d7b4
|
🧹 Do not remove line confidences anymore
|
5 years ago |
Gerber, Mike
|
6454d20998
|
✨ Use sbb_textline_detector to segment lines
|
5 years ago |
Gerber, Mike
|
bdab016e2c
|
✨ Use GT4HistOCR_2000000 model from qurator-data for Tesseract
|
5 years ago |
Gerber, Mike
|
47dd5d3b62
|
🎨 Move XML schemata to a better path
|
5 years ago |