Commit Graph

  • 51b3cb5fb0
    Merge 77dae129d5 into b4df978dd5 Konstantin Baierer 2025-04-22 13:24:12 +0200
  • 77dae129d5
    CI: Use most recent actions/setup-python@v5 kba-patch-1 Konstantin Baierer 2025-04-22 13:22:28 +0200
  • 9a888d69f5
    Merge 192b9111e3 into b4df978dd5 vahidrezanezhad 2025-04-22 00:23:58 +0200
  • 192b9111e3 updating eynollah README, how to use it for use cases updating_readme_for_eynollah_use_cases vahidrezanezhad 2025-04-22 00:23:01 +0200
  • b4df978dd5
    Merge pull request #154 from qurator-spk/ci-pypi main Clemens Neudecker 2025-04-17 17:01:20 +0200
  • 30ba234641 CI: pypi ci-pypi kba 2025-04-16 19:27:17 +0200
  • 41318f0404 📝 changelog kba 2025-04-15 11:14:26 +0200
  • 87e02c2a4d
    Merge 3b1886973d into a22df11ebb Konstantin Baierer 2025-04-14 11:40:32 +0000
  • a22df11ebb Restoring the contour in the original image caused an error due to an empty tuple. This issue has been resolved, and as expected, the confidence score for this contour is set to zero vahidrezanezhad 2025-04-14 00:42:08 +0200
  • 3b1886973d gha snippets from codecov sample page test-codecov kba 2025-04-08 14:52:38 +0200
  • 60a05711bb test kba 2025-04-08 14:46:22 +0200
  • 8080bd823c 📦 v0.4.0 v0.4.0 kba 2025-04-07 16:48:57 +0200
  • bcf1898aa4 📝 changelog Robert Sachunsky 2025-04-07 16:46:58 +0200
  • 177e017167 test_run: ensure exceptions are shown v3-api-release-foreal-codecov Robert Sachunsky 2025-04-06 18:24:56 +0000
  • e2907f67e0 'from PIL.Image import Image' causes an error when using Image.new(), and since Image is already imported, this line can be safely commented out. vahidrezanezhad 2025-04-06 00:33:36 +0200
  • 132d3e3d27 CI: use clash-free artifact name for report upload Robert Sachunsky 2025-04-05 11:36:21 +0200
  • dc64079b6b CI: fix coverage report calls Robert Sachunsky 2025-04-05 03:40:02 +0200
  • 7609c64c8b CI: make coverage cfg work with both editable and dist install Robert Sachunsky 2025-04-05 03:05:26 +0200
  • bbc06dbbc1 CI: forgot to (re-)enable verbose logging Robert Sachunsky 2025-04-05 02:10:52 +0200
  • a41f18b13d CI: (try to) store/upload coverage results Robert Sachunsky 2025-04-05 01:34:28 +0200
  • 4339444e47 binarization CLI: fix option checks, simplify to asserts, fix dir_in mode Robert Sachunsky 2025-04-05 01:21:08 +0200
  • 56cc179d35 pytest: add tests for directory mode (layout+bin) Robert Sachunsky 2025-04-04 23:48:30 +0200
  • a3e1b3d4d5 pytest: add asserts for results, add binarization Robert Sachunsky 2025-04-04 23:37:00 +0200
  • b03116f4a6 pytest: use subtests for various layout options, add coverage Robert Sachunsky 2025-04-04 22:22:50 +0200
  • 91a340f619 CLI: simplify option checks to asserts (also avoid stack trace) Robert Sachunsky 2025-04-04 20:42:28 +0200
  • e0a7fde537 logger: fix type hint Robert Sachunsky 2025-04-04 20:27:15 +0200
  • 108ce1f5a1 Merge remote-tracking branch 'origin/main' into v3-api-release-foreal Robert Sachunsky 2025-04-04 20:23:23 +0200
  • 5c45cb4aee Merge remote-tracking branch 'origin/main' into v3-api-release kba 2025-04-04 17:20:49 +0200
  • e0d38517d3
    Merge pull request #130 from qurator-spk/v3-api machine_based_reading_order_integration Konstantin Baierer 2025-04-04 16:01:45 +0200
  • 2e3a29f66b In light mode: To determine whether a main region is a header, I adjusted the ratio to achieve better results. vahidrezanezhad 2025-04-04 15:36:31 +0200
  • 85566c2186
    Merge pull request #148 from bertsky/v3-api Konstantin Baierer 2025-04-04 13:31:00 +0200
  • 1a0b9d1958
    Merge pull request #1 from bertsky/v3-api-refactor-init Robert Sachunsky 2025-04-04 13:30:23 +0200
  • e9a4324b8f Merge branch 'v3-api-refactor-init' into bertsky-v3-api kba 2025-04-04 13:14:12 +0200
  • 38a2d60fa2 Confidence value for textregions and in the case of not light version is set to zero. This is done to let the pipeline go through. It will be updated to return the correct value in upcomming commits vahidrezanezhad 2025-04-03 12:47:27 +0200
  • 6b52da227c docorating eynollah with textregion confidence score #135 vahidrezanezhad 2025-04-03 00:39:21 +0200
  • 559d001eef another fix to avoid frequent warnings Robert Sachunsky 2025-04-02 05:45:34 +0000
  • dd478279a4 CLI: also --overwrite in single-image mode Robert Sachunsky 2025-04-02 05:40:21 +0000
  • 8159e6336a fix typo (preventing log messages) Robert Sachunsky 2025-04-02 00:01:02 +0000
  • 2919538382 minor fixes to avoid frequent warnings Robert Sachunsky 2025-04-01 23:33:26 +0000
  • 903c87aca0 update readme (OCR-D section) Robert Sachunsky 2025-04-01 23:26:38 +0200
  • dcf2ed5e22 run: also write out XML in single filename mode Robert Sachunsky 2025-04-01 23:13:24 +0200
  • fe77171d45 run_single: reduce indentation Robert Sachunsky 2025-04-01 22:47:33 +0200
  • c7dc952851 smoke-test: also test dir-in mode and overwrite Robert Sachunsky 2025-04-01 22:43:30 +0200
  • 79003a083c CLI: ValueError instead of print+exit Robert Sachunsky 2025-04-01 22:43:01 +0200
  • e17d34fafa factor run_single() out of run(), simplify kwargs Robert Sachunsky 2025-04-01 22:12:24 +0200
  • 1a0a1cb00b remove session methods and redundant model loaders Robert Sachunsky 2025-04-01 21:15:41 +0200
  • ab3da17547
    Update requirements.txt Robert Sachunsky 2025-04-01 18:13:28 +0200
  • dd51f900b9 OCR-D: init Eynollah in 'setup', re-use instance for each page via non-public API Robert Sachunsky 2025-04-01 13:02:30 +0200
  • ffeb4a343d Eynollah: remove useless 'pcgts' attr Robert Sachunsky 2025-04-01 13:00:41 +0200
  • 9dc33db108 CI: add binarization models to cache Robert Sachunsky 2025-04-01 11:36:56 +0200
  • 9c769d4cc5 CI: run CLI tests, too Robert Sachunsky 2025-04-01 11:13:16 +0200
  • 250fc02606 add tests for binarization, remove dependency on deps-test Robert Sachunsky 2025-04-01 11:13:04 +0200
  • 91b2201b07 cnnrnn Ocr: width of input textline image can not be zero! vahidrezanezhad 2025-04-01 10:55:40 +0200
  • 515b4023f6 sbb_binarize: fix missing reference Robert Sachunsky 2025-04-01 10:54:36 +0200
  • 95a681aa8c add Continuous Deployment via Dockerhub and GHCR Robert Sachunsky 2025-04-01 01:27:10 +0200
  • df3510750c
    Github Actions CI: no more Docker clean or build Robert Sachunsky 2025-04-01 00:28:16 +0200
  • 45e3ab9692
    Github Actions: free space: all existing Docker images Robert Sachunsky 2025-04-01 00:23:53 +0200
  • 4de441eaaa OCR prediction is now enabled to integrate results from both RGB and binarized images or to be performed on each individually vahidrezanezhad 2025-03-31 21:28:05 +0200
  • b1da0a3327 In OCR, the predicted text is now drawn on the image, and the results are saved in a specified directory. This makes it easier to review the predicted output vahidrezanezhad 2025-03-31 18:43:14 +0200
  • 31aeb9629d
    Github Actions: free space more aggressively Robert Sachunsky 2025-03-31 18:16:17 +0200
  • 7430b57b65 dockerfile: add smoke test Robert Sachunsky 2025-03-31 16:56:47 +0200
  • f35f49376e run CLI test in TMPDIR, add ocrd-test Robert Sachunsky 2025-03-31 16:55:57 +0200
  • ae066388ea docker: no need for g++, but install w/ 'EXTRAS=OCR' Robert Sachunsky 2025-03-31 15:58:57 +0200
  • 722b5c6bf1 add make variable EXTRAS for optional dependencies Robert Sachunsky 2025-03-31 15:58:12 +0200
  • c01609ff4e allow even more empty imports for optional dependencies Robert Sachunsky 2025-03-31 15:57:22 +0200
  • 51e9bfd6d7 improve+extend dockerfile Robert Sachunsky 2025-03-31 14:14:08 +0200
  • 09248d4829 improve+extend makefile Robert Sachunsky 2025-03-31 14:13:16 +0200
  • 46618f4229 allow more empty imports for optional dependencies Robert Sachunsky 2025-03-31 14:11:50 +0200
  • 4be89910a2 CLI: fix arg vs kwarg from merge Robert Sachunsky 2025-03-31 02:38:24 +0200
  • 9d61acf173 simplify Robert Sachunsky 2025-03-31 02:02:30 +0200
  • a1068ff2eb OCR-D: move sbb-binarize to ocrd-tool.json, update to v3 Robert Sachunsky 2025-03-31 01:47:32 +0200
  • c794d4d29f OCR-D: fix typo light_mode→light_version Robert Sachunsky 2025-03-31 01:46:29 +0200
  • 4338259ca1 OCR-D: ensure page image gets replaced in result as well if not the original file Robert Sachunsky 2025-03-31 01:17:14 +0200
  • 55969b0173 OCR-D: add docstring Robert Sachunsky 2025-03-31 01:15:26 +0200
  • 3916474b8b OCR-D: require >=v3.1 Robert Sachunsky 2025-03-31 01:15:12 +0200
  • 6d02e90570 OCR-D: restrict max_workers=1 Robert Sachunsky 2025-03-31 01:14:54 +0200
  • efd3fa6775 allow empty imports for optional dependencies Robert Sachunsky 2025-03-31 00:32:26 +0200
  • 238132e260 use 'image_filename' for pseudo-iteration outside 'dir_in' mode Robert Sachunsky 2025-03-31 00:31:49 +0200
  • af4e2a4ffc do not require 'dir_out' outside 'dir_in' mode Robert Sachunsky 2025-03-31 00:31:09 +0200
  • ea136e3ddd 'overwrite' check: only in 'dir_in' mode Robert Sachunsky 2025-03-31 00:30:06 +0200
  • 1f4a17b60d Merge remote-tracking branch 'origin/machine_based_reading_order_integration' into v3-api Robert Sachunsky 2025-03-30 20:59:01 +0200
  • edf924c2cb ocrd-tool: add dockerhub Robert Sachunsky 2025-03-30 19:47:25 +0200
  • 9b04688ebc The rotate_image function has been updated. Additionally, the reading order is now correct in the case of the light version, provided that slope_deskew exceeds the slope_threshold. vahidrezanezhad 2025-03-30 15:34:27 +0200
  • cf40f9ecc5 The rotate_image function produces the exact same rotation as Imutils. Therefore, there is no need to retain the remove-imutils-1 branch. vahidrezanezhad 2025-03-28 20:58:32 +0100
  • b55389ac62
    Update requirements.txt vahidrezanezhad 2025-03-28 14:59:31 +0100
  • 8bf70d905f
    Merge pull request #147 from qurator-spk/revert-146-remove-imutils-1 vahidrezanezhad 2025-03-28 14:58:04 +0100
  • f756b08c9b
    Revert "replace usages of `imutils` with opencv equivalents" revert-146-remove-imutils-1 vahidrezanezhad 2025-03-28 14:57:40 +0100
  • c9de578d4d removing imutils from requirements vahidrezanezhad 2025-03-28 11:25:03 +0100
  • 52c605185a
    Merge pull request #146 from qurator-spk/remove-imutils-1 vahidrezanezhad 2025-03-28 11:10:25 +0100
  • 0e9a72ea52 consolidate usage documentation cneud 2025-03-27 23:14:59 +0100
  • 3a55b6ce91 consolidate usage documentation cneud 2025-03-27 23:11:18 +0100
  • e9fa691308 add model and training documentation cneud 2025-03-27 22:41:10 +0100
  • 6f36c7177f For OCR, the splitting ratio of text lines is adjusted vahidrezanezhad 2025-03-27 18:24:47 +0100
  • b29b137986 not needle in haystack -> needle not in haystack code-suggestions kba 2025-03-27 14:35:08 +0100
  • 181c0c584f bbox rotation with opencv cneud 2025-03-26 22:25:22 +0100
  • eaff9e3537 Merge branch 'main' into remove-imutils-1 cneud 2025-03-26 20:16:46 +0100
  • 7df0427b04 In the context of OCR, if Page-XML files already contain text, the new predicted text will replace the existing text. vahidrezanezhad 2025-03-26 18:42:06 +0100
  • 370d44a66b Slope deskew in the light version is set to zero because when the slope_deskew value exceeds the slope_threshold, the reading order becomes incorrect. This issue needs to be addressed. Additionally, the textlines order within text region in the light version was reversed, and this has been corrected. vahidrezanezhad 2025-03-26 10:45:34 +0100
  • 4179a2ea3a integrate binarization in ocrd-tool.json cneud 2025-03-26 01:27:02 +0100
  • 284c2aed7f unresolved references cneud 2025-03-26 01:06:19 +0100