Commit Graph

739 Commits (3dcbb20cac8cb0c35b85ae415817ddd0d32570f6)
 

Author SHA1 Message Date
Clemens Neudecker 3dcbb20cac
Merge pull request from bertsky/main
update docker
Robert Sachunsky e9179e1d34 docker: use latest core base stage
Robert Sachunsky f8b4d29a59 docker: prepackage ocrd-all-module-dir.json
vahidrezanezhad e2da7a6239 Fix model name to return the correct machine-based model name
vahidrezanezhad b227736094 Fix OCR text cleaning to correctly handle 'U', 'K', and 'N' starting sentence; update text line splitting size
vahidrezanezhad 4cb4414740 Resolve remaining issue with and resolving
vahidrezanezhad 208bde706f resolving issue
Konstantin Baierer 3e8adb86c2
Merge pull request from qurator-spk/kba-patch-1
CI: Use most recent actions/setup-python@v5
Konstantin Baierer 77dae129d5
CI: Use most recent actions/setup-python@v5
Clemens Neudecker b4df978dd5
Merge pull request from qurator-spk/ci-pypi
CI: pypi
kba 30ba234641 CI: pypi
kba 41318f0404 📝 changelog
vahidrezanezhad a22df11ebb Restoring the contour in the original image caused an error due to an empty tuple. This issue has been resolved, and as expected, the confidence score for this contour is set to zero
kba 8080bd823c 📦 v0.4.0
Robert Sachunsky bcf1898aa4 📝 changelog
Robert Sachunsky 177e017167 test_run: ensure exceptions are shown
vahidrezanezhad e2907f67e0 'from PIL.Image import Image' causes an error when using Image.new(), and since Image is already imported, this line can be safely commented out.
Robert Sachunsky 132d3e3d27 CI: use clash-free artifact name for report upload
Robert Sachunsky dc64079b6b CI: fix coverage report calls
Robert Sachunsky 7609c64c8b CI: make coverage cfg work with both editable and dist install
Robert Sachunsky bbc06dbbc1 CI: forgot to (re-)enable verbose logging
Robert Sachunsky a41f18b13d CI: (try to) store/upload coverage results
Robert Sachunsky 4339444e47 binarization CLI: fix option checks, simplify to asserts, fix dir_in mode
Robert Sachunsky 56cc179d35 pytest: add tests for directory mode (layout+bin)
Robert Sachunsky a3e1b3d4d5 pytest: add asserts for results, add binarization
Robert Sachunsky b03116f4a6 pytest: use subtests for various layout options, add coverage
Robert Sachunsky 91a340f619 CLI: simplify option checks to asserts (also avoid stack trace)
Robert Sachunsky e0a7fde537 logger: fix type hint
Robert Sachunsky 108ce1f5a1 Merge remote-tracking branch 'origin/main' into v3-api-release-foreal
(bad-ass difficult diff diffing)
Konstantin Baierer e0d38517d3
Merge pull request from qurator-spk/v3-api
port processor to core v3
vahidrezanezhad 2e3a29f66b In light mode: To determine whether a main region is a header, I adjusted the ratio to achieve better results.
Konstantin Baierer 85566c2186
Merge pull request from bertsky/v3-api
fix, merge, resolve conflicts, apply review, migrate sbb-binarize
Robert Sachunsky 1a0b9d1958
Merge pull request from bertsky/v3-api-refactor-init
refactoring of Eynollah init and model loading
vahidrezanezhad 38a2d60fa2 Confidence value for textregions and in the case of not light version is set to zero. This is done to let the pipeline go through. It will be updated to return the correct value in upcomming commits
vahidrezanezhad 6b52da227c docorating eynollah with textregion confidence score
Robert Sachunsky 559d001eef another fix to avoid frequent warnings
Robert Sachunsky dd478279a4 CLI: also --overwrite in single-image mode
Robert Sachunsky 8159e6336a fix typo (preventing log messages)
Robert Sachunsky 2919538382 minor fixes to avoid frequent warnings
Robert Sachunsky 903c87aca0 update readme (OCR-D section)
Robert Sachunsky dcf2ed5e22 run: also write out XML in single filename mode
Robert Sachunsky fe77171d45 run_single: reduce indentation
Robert Sachunsky c7dc952851 smoke-test: also test dir-in mode and overwrite
Robert Sachunsky 79003a083c CLI: ValueError instead of print+exit
Robert Sachunsky e17d34fafa factor run_single() out of run(), simplify kwargs
Robert Sachunsky 1a0a1cb00b remove session methods and redundant model loaders
Robert Sachunsky ab3da17547
Update requirements.txt
Co-authored-by: Konstantin Baierer <kba@users.noreply.github.com>
Robert Sachunsky dd51f900b9 OCR-D: init Eynollah in 'setup', re-use instance for each page via non-public API
Robert Sachunsky ffeb4a343d Eynollah: remove useless 'pcgts' attr
Robert Sachunsky 9dc33db108 CI: add binarization models to cache