Commit graph

743 commits

Author SHA1 Message Date
vahidrezanezhad
8e2d0f3179
Merge 21ec4fbfb5 into 3dcbb20cac 2025-05-07 12:04:07 +00:00
vahidrezanezhad
21ec4fbfb5 The text region coordinates are now correctly written into the XML output when using the skip layout and reading order option 2025-05-07 14:04:01 +02:00
vahidrezanezhad
83211ae684 In the case of skip_layout_and_reading_order, the confidence value was not set correctly, leading to an error while writing to the XML file. 2025-05-07 12:33:03 +02:00
Clemens Neudecker
3dcbb20cac
Merge pull request #159 from bertsky/main
update docker
2025-05-06 15:14:06 +02:00
Robert Sachunsky
e9179e1d34 docker: use latest core base stage 2025-05-02 00:16:22 +02:00
Robert Sachunsky
f8b4d29a59 docker: prepackage ocrd-all-module-dir.json 2025-05-02 00:16:22 +02:00
vahidrezanezhad
e2da7a6239 Fix model name to return the correct machine-based model name 2025-04-30 16:06:29 +02:00
vahidrezanezhad
b227736094 Fix OCR text cleaning to correctly handle 'U', 'K', and 'N' starting sentence; update text line splitting size 2025-04-30 16:04:34 +02:00
vahidrezanezhad
4cb4414740 Resolve remaining issue with #158 and resolving #124 2025-04-30 16:01:52 +02:00
vahidrezanezhad
208bde706f resolving issue #158 2025-04-30 13:55:09 +02:00
Konstantin Baierer
3e8adb86c2
Merge pull request #157 from qurator-spk/kba-patch-1
CI: Use most recent actions/setup-python@v5
2025-04-29 11:42:18 +02:00
Konstantin Baierer
77dae129d5
CI: Use most recent actions/setup-python@v5 2025-04-22 13:22:28 +02:00
vahidrezanezhad
192b9111e3 updating eynollah README, how to use it for use cases 2025-04-22 00:23:01 +02:00
Clemens Neudecker
b4df978dd5
Merge pull request #154 from qurator-spk/ci-pypi
CI: pypi
2025-04-17 17:01:20 +02:00
kba
30ba234641 CI: pypi 2025-04-16 19:27:17 +02:00
kba
41318f0404 📝 changelog 2025-04-15 11:14:26 +02:00
vahidrezanezhad
a22df11ebb Restoring the contour in the original image caused an error due to an empty tuple. This issue has been resolved, and as expected, the confidence score for this contour is set to zero 2025-04-14 00:42:08 +02:00
kba
8080bd823c 📦 v0.4.0 2025-04-07 16:48:57 +02:00
Robert Sachunsky
bcf1898aa4 📝 changelog 2025-04-07 16:46:58 +02:00
Robert Sachunsky
177e017167 test_run: ensure exceptions are shown 2025-04-07 10:39:50 +00:00
vahidrezanezhad
e2907f67e0 'from PIL.Image import Image' causes an error when using Image.new(), and since Image is already imported, this line can be safely commented out. 2025-04-06 00:33:36 +02:00
Robert Sachunsky
132d3e3d27 CI: use clash-free artifact name for report upload 2025-04-05 11:36:21 +02:00
Robert Sachunsky
dc64079b6b CI: fix coverage report calls 2025-04-05 03:40:02 +02:00
Robert Sachunsky
7609c64c8b CI: make coverage cfg work with both editable and dist install 2025-04-05 03:05:26 +02:00
Robert Sachunsky
bbc06dbbc1 CI: forgot to (re-)enable verbose logging 2025-04-05 02:10:52 +02:00
Robert Sachunsky
a41f18b13d CI: (try to) store/upload coverage results 2025-04-05 01:34:28 +02:00
Robert Sachunsky
4339444e47 binarization CLI: fix option checks, simplify to asserts, fix dir_in mode 2025-04-05 01:21:08 +02:00
Robert Sachunsky
56cc179d35 pytest: add tests for directory mode (layout+bin) 2025-04-05 01:20:38 +02:00
Robert Sachunsky
a3e1b3d4d5 pytest: add asserts for results, add binarization 2025-04-04 23:37:00 +02:00
Robert Sachunsky
b03116f4a6 pytest: use subtests for various layout options, add coverage 2025-04-04 22:22:50 +02:00
Robert Sachunsky
91a340f619 CLI: simplify option checks to asserts (also avoid stack trace) 2025-04-04 20:42:28 +02:00
Robert Sachunsky
e0a7fde537 logger: fix type hint 2025-04-04 20:27:15 +02:00
Robert Sachunsky
108ce1f5a1 Merge remote-tracking branch 'origin/main' into v3-api-release-foreal
(bad-ass difficult diff diffing)
2025-04-04 20:23:23 +02:00
Konstantin Baierer
e0d38517d3
Merge pull request #130 from qurator-spk/v3-api
port processor to core v3
2025-04-04 16:01:45 +02:00
vahidrezanezhad
2e3a29f66b In light mode: To determine whether a main region is a header, I adjusted the ratio to achieve better results. 2025-04-04 15:36:31 +02:00
Konstantin Baierer
85566c2186
Merge pull request #148 from bertsky/v3-api
fix, merge, resolve conflicts, apply review, migrate sbb-binarize
2025-04-04 13:31:00 +02:00
Robert Sachunsky
1a0b9d1958
Merge pull request #1 from bertsky/v3-api-refactor-init
refactoring of Eynollah init and model loading
2025-04-04 13:30:23 +02:00
vahidrezanezhad
38a2d60fa2 Confidence value for textregions and in the case of not light version is set to zero. This is done to let the pipeline go through. It will be updated to return the correct value in upcomming commits 2025-04-03 12:47:27 +02:00
vahidrezanezhad
6b52da227c docorating eynollah with textregion confidence score #135 2025-04-03 00:39:21 +02:00
Robert Sachunsky
559d001eef another fix to avoid frequent warnings 2025-04-02 05:45:34 +00:00
Robert Sachunsky
dd478279a4 CLI: also --overwrite in single-image mode 2025-04-02 05:40:21 +00:00
Robert Sachunsky
8159e6336a fix typo (preventing log messages) 2025-04-02 00:01:02 +00:00
Robert Sachunsky
2919538382 minor fixes to avoid frequent warnings 2025-04-01 23:33:26 +00:00
Robert Sachunsky
903c87aca0 update readme (OCR-D section) 2025-04-01 23:26:38 +02:00
Robert Sachunsky
dcf2ed5e22 run: also write out XML in single filename mode 2025-04-01 23:13:24 +02:00
Robert Sachunsky
fe77171d45 run_single: reduce indentation 2025-04-01 22:47:33 +02:00
Robert Sachunsky
c7dc952851 smoke-test: also test dir-in mode and overwrite 2025-04-01 22:43:30 +02:00
Robert Sachunsky
79003a083c CLI: ValueError instead of print+exit 2025-04-01 22:43:01 +02:00
Robert Sachunsky
e17d34fafa factor run_single() out of run(), simplify kwargs 2025-04-01 22:12:24 +02:00
Robert Sachunsky
1a0a1cb00b remove session methods and redundant model loaders 2025-04-01 21:15:41 +02:00