Commit graph

  • d14bd162ca saving enhanced image in org or scaled resolution vahidrezanezhad 2025-06-01 22:10:13 +02:00
  • 3b475915c7 image enhancer is integrated vahidrezanezhad 2025-06-01 15:53:04 +02:00
  • cc36694dfd image enhancer is integrated vahidrezanezhad 2025-06-01 15:53:04 +02:00
  • df903aa1b4 Parametrize OCR for handling curved lines vahidrezanezhad 2025-05-31 01:09:14 +02:00
  • 928a548b70 Parametrize OCR for handling curved lines vahidrezanezhad 2025-05-31 01:09:14 +02:00
  • 1e7cecfcf9 updating ocr vahidrezanezhad 2025-05-28 01:17:21 +02:00
  • 48285ce3f5 updating ocr vahidrezanezhad 2025-05-28 01:17:21 +02:00
  • 03f52e7a46 updating ocr vahidrezanezhad 2025-05-27 23:45:22 +02:00
  • b93fc112bf updating ocr vahidrezanezhad 2025-05-27 23:45:22 +02:00
  • 31d9fa0c80 strings alignment function is added + new changes needed for prediction with both bin and rgb inputs is implemented vahidrezanezhad 2025-05-25 21:44:36 +02:00
  • 0f154c605a strings alignment function is added + new changes needed for prediction with both bin and rgb inputs is implemented vahidrezanezhad 2025-05-25 21:44:36 +02:00
  • b18691f96a rnn ocr for all layout textregion types vahidrezanezhad 2025-05-25 03:33:54 +02:00
  • 097520bfd2 rnn ocr for all layout textregion types vahidrezanezhad 2025-05-25 03:33:54 +02:00
  • ba3420b2d8 Drop capitals are written separately and are not attached to their corresponding text line. The OCR use case also supports single-image input. vahidrezanezhad 2025-05-25 01:12:58 +02:00
  • 27c4b0d0e0 Drop capitals are written separately and are not attached to their corresponding text line. The OCR use case also supports single-image input. vahidrezanezhad 2025-05-25 01:12:58 +02:00
  • 25e3a2a99f visualizing ro for single xml file vahidrezanezhad 2025-05-23 18:30:51 +02:00
  • 0250a6d3d0 enhancing ocr vahidrezanezhad 2025-05-23 18:06:53 +02:00
  • adcf03c7b7 enhancing ocr vahidrezanezhad 2025-05-23 18:06:53 +02:00
  • 089029cec7 commit 21ec4fb is picked + rnn ocr at the same time with segmentation + enhancement of mb reading order vahidrezanezhad 2025-05-23 15:55:03 +02:00
  • d4f6e10251 commit 21ec4fb is picked + rnn ocr at the same time with segmentation + enhancement of mb reading order vahidrezanezhad 2025-05-23 15:55:03 +02:00
  • ee2c7e9013 enhancing curved lines OCR vahidrezanezhad 2025-05-21 17:42:44 +02:00
  • a0647eff93 enhancing curved lines OCR vahidrezanezhad 2025-05-21 17:42:44 +02:00
  • 14b70c2556 Implement hyphenated textline merging in OCR engine and a bug fixed for curved textline OCR vahidrezanezhad 2025-05-21 14:39:31 +02:00
  • f94fc9973b Implement hyphenated textline merging in OCR engine and a bug fixed for curved textline OCR vahidrezanezhad 2025-05-21 14:39:31 +02:00
  • 3ad621e956 ocr for curved lines vahidrezanezhad 2025-05-20 19:01:52 +02:00
  • c0835665a9 ocr for curved lines vahidrezanezhad 2025-05-20 19:01:52 +02:00
  • 44ff51f5c1 mb reading order now can be done faster. Text regions are clustered using dilation, and mb reading order needs to be implemented for fewer regions vahidrezanezhad 2025-05-20 16:51:08 +02:00
  • 848156dd9d mb reading order now can be done faster. Text regions are clustered using dilation, and mb reading order needs to be implemented for fewer regions vahidrezanezhad 2025-05-20 16:51:08 +02:00
  • 5016039cd7 enhancing marginal detection for light version vahidrezanezhad 2025-05-18 02:48:05 +02:00
  • 7a34bbb493 enhancing marginal detection for light version vahidrezanezhad 2025-05-18 02:48:05 +02:00
  • f298643fcf
    Fix ReduceONPlateau wrong logic johnlockejrr 2025-05-17 23:24:40 +03:00
  • 7661080899
    LR Warmup and Optimization Implementation johnlockejrr 2025-05-17 16:17:38 +03:00
  • f9390c71e7 updating inference for mb reading order vahidrezanezhad 2025-05-17 02:18:27 +02:00
  • 1cbc669d36 marginals detection enhanced for light version vahidrezanezhad 2025-05-15 15:33:50 +02:00
  • 0819730355 marginals detection enhanced for light version vahidrezanezhad 2025-05-15 15:33:50 +02:00
  • 1b229ba7ae enhancement for vertical textlines vahidrezanezhad 2025-05-15 00:45:22 +02:00
  • adee1dc55c enhancement for vertical textlines vahidrezanezhad 2025-05-15 00:45:22 +02:00
  • 7a22e51f5d resolve some comments from review cneud 2025-05-14 21:56:03 +02:00
  • ed46615f00 enhance ocr for vertical textlines vahidrezanezhad 2025-05-14 18:34:58 +02:00
  • a9cdd56e9a enhance ocr for vertical textlines vahidrezanezhad 2025-05-14 18:34:58 +02:00
  • 25abc0fabc Update gt_gen_utils.py johnlockejrr 2025-05-14 03:34:51 -07:00
  • 1bf801985b
    Update gt_gen_utils.py johnlockejrr 2025-05-14 03:34:51 -07:00
  • 88e0315321 Accurately writing text line contours into xml file when the deskewing exceeds 45 degrees and the text line is in light mode vahidrezanezhad 2025-05-13 15:53:05 +02:00
  • 1ccd3fb7cf Accurately writing text line contours into xml file when the deskewing exceeds 45 degrees and the text line is in light mode vahidrezanezhad 2025-05-13 15:53:05 +02:00
  • 54088c6b04 The initial attempt at reading heavily deskewed or vertically aligned lines. vahidrezanezhad 2025-05-13 14:40:57 +02:00
  • 07f5b52fa7 The initial attempt at reading heavily deskewed or vertically aligned lines. vahidrezanezhad 2025-05-13 14:40:57 +02:00
  • 4a7728bb34 visuliazation layout from eynollah page-xml output vahidrezanezhad 2025-05-12 22:39:47 +02:00
  • 4ddc84dee8 visulizing textline detection from eynollah page-xml output vahidrezanezhad 2025-05-12 18:31:40 +02:00
  • c12b09a868 I have tried to address the issues #163 and #161 . The changes have also improved marginal detection and enhanced the isolation of headers. vahidrezanezhad 2025-05-12 00:10:18 +02:00
  • 02a679a145 I have tried to address the issues #163 and #161 . The changes have also improved marginal detection and enhanced the isolation of headers. vahidrezanezhad 2025-05-12 00:10:18 +02:00
  • 3a9fc0efde Update utils.py johnlockejrr 2025-05-11 06:09:17 -07:00
  • 102b04c84d
    Update utils.py johnlockejrr 2025-05-11 06:09:17 -07:00
  • 6fa766d6a5 Update utils.py johnlockejrr 2025-05-11 05:31:34 -07:00
  • be57f137d7
    Update utils.py johnlockejrr 2025-05-11 05:31:34 -07:00
  • 21ec4fbfb5 The text region coordinates are now correctly written into the XML output when using the skip layout and reading order option vahidrezanezhad 2025-05-07 14:04:01 +02:00
  • 83211ae684 In the case of skip_layout_and_reading_order, the confidence value was not set correctly, leading to an error while writing to the XML file. vahidrezanezhad 2025-05-07 12:33:03 +02:00
  • 92954b1b7b resolving issued with saving model by steps vahidrezanezhad 2025-05-05 16:13:38 +02:00
  • 5694d971c5 saving model by steps is added to reading order and pixel wise segmentation use cases training vahidrezanezhad 2025-05-05 15:39:05 +02:00
  • 3b123b039c adding min_early parameter for generating training dataset for machine based reading order model vahidrezanezhad 2025-05-03 19:25:32 +02:00
  • 89aa545049 let to add dataset abbrevation to extracted textline images and text vahidrezanezhad 2025-05-03 02:59:16 +02:00
  • 5d447abcc4 let to add dataset abbrevation to extracted textline images and text vahidrezanezhad 2025-05-03 02:59:16 +02:00
  • 48e8dd4ab3 machine based model name changed to public one vahidrezanezhad 2025-05-02 12:57:26 +02:00
  • 8c8fa461bb machine based model name changed to public one vahidrezanezhad 2025-05-02 12:57:26 +02:00
  • a1a004b19d inference batch size for ocr is passed as an argument vahidrezanezhad 2025-05-02 12:53:33 +02:00
  • a4defbb04d inference batch size for ocr is passed as an argument vahidrezanezhad 2025-05-02 12:53:33 +02:00
  • 5d8c864c08 adding space between splitted textline predicted text in the case of trocr vahidrezanezhad 2025-05-02 01:02:32 +02:00
  • fd375e15d5 adding space between splitted textline predicted text in the case of trocr vahidrezanezhad 2025-05-02 01:02:32 +02:00
  • 184af46664 displaying detexted text on an image is provided for trocr case vahidrezanezhad 2025-05-02 00:30:36 +02:00
  • a2359ea4c4
    Merge pull request #171 from bertsky/ocrd-machine-based-ro Clemens Neudecker 2025-08-15 18:40:13 +02:00
  • 21615a986d OCR-D processor: expose reading_order_machine_based Robert Sachunsky 2025-08-13 14:14:37 +02:00
  • 6b8893b188
    Merge pull request #167 from qurator-spk/ocrd-fixes vahidrezanezhad 2025-07-22 14:46:25 +02:00
  • b7b218ff11 OCR-D processor: same behavior as standalone wrt light_version/textline_light kba 2025-06-12 15:30:17 +02:00
  • c194a20c9c Fixed duplicate textline_light assignments (true and false) in the OCR-D framework for the Eynollah light version, which caused rectangles to be used instead of contours for textlines vahidrezanezhad 2025-06-11 18:57:08 +02:00
  • 3dcbb20cac
    Merge pull request #159 from bertsky/main Clemens Neudecker 2025-05-06 15:14:06 +02:00
  • 5c8084a397 displaying detexted text on an image is provided for trocr case vahidrezanezhad 2025-05-02 00:30:36 +02:00
  • e9179e1d34 docker: use latest core base stage Robert Sachunsky 2025-05-02 00:13:06 +02:00
  • f8b4d29a59 docker: prepackage ocrd-all-module-dir.json Robert Sachunsky 2025-05-02 00:13:11 +02:00
  • e2da7a6239 Fix model name to return the correct machine-based model name vahidrezanezhad 2025-04-30 16:06:29 +02:00
  • b227736094 Fix OCR text cleaning to correctly handle 'U', 'K', and 'N' starting sentence; update text line splitting size vahidrezanezhad 2025-04-30 16:04:34 +02:00
  • 4cb4414740 Resolve remaining issue with #158 and resolving #124 vahidrezanezhad 2025-04-30 16:01:52 +02:00
  • 208bde706f resolving issue #158 vahidrezanezhad 2025-04-30 13:55:09 +02:00
  • 3e8adb86c2
    Merge pull request #157 from qurator-spk/kba-patch-1 Konstantin Baierer 2025-04-29 11:42:18 +02:00
  • 77dae129d5
    CI: Use most recent actions/setup-python@v5 Konstantin Baierer 2025-04-22 13:22:28 +02:00
  • 192b9111e3 updating eynollah README, how to use it for use cases vahidrezanezhad 2025-04-22 00:23:01 +02:00
  • b4df978dd5
    Merge pull request #154 from qurator-spk/ci-pypi Clemens Neudecker 2025-04-17 17:01:20 +02:00
  • 44d02687c6 Merge pull request #18 from johnlockejrr/unifying-training-models Clemens Neudecker 2025-04-17 15:43:28 +02:00
  • 4635dd219d updating:rotation augmentation is provided for machine based reading order vahidrezanezhad 2025-04-17 00:12:30 +02:00
  • dd21a3b33a updating:rotation augmentation is provided for machine based reading order vahidrezanezhad 2025-04-17 00:05:59 +02:00
  • 825b2634f9 rotation augmentation is provided for machine based reading order vahidrezanezhad 2025-04-16 23:36:41 +02:00
  • 30ba234641 CI: pypi kba 2025-04-16 19:27:17 +02:00
  • 41318f0404 📝 changelog kba 2025-04-15 11:14:26 +02:00
  • a22df11ebb Restoring the contour in the original image caused an error due to an empty tuple. This issue has been resolved, and as expected, the confidence score for this contour is set to zero vahidrezanezhad 2025-04-14 00:42:08 +02:00
  • 3b1886973d gha snippets from codecov sample page test-codecov kba 2025-04-08 14:52:38 +02:00
  • 60a05711bb test kba 2025-04-08 14:46:22 +02:00
  • 8080bd823c 📦 v0.4.0 v0.4.0 kba 2025-04-07 16:48:57 +02:00
  • bcf1898aa4 📝 changelog Robert Sachunsky 2025-04-07 16:46:58 +02:00
  • 177e017167 test_run: ensure exceptions are shown Robert Sachunsky 2025-04-06 18:24:56 +00:00
  • e2907f67e0 'from PIL.Image import Image' causes an error when using Image.new(), and since Image is already imported, this line can be safely commented out. vahidrezanezhad 2025-04-06 00:33:36 +02:00
  • 132d3e3d27 CI: use clash-free artifact name for report upload Robert Sachunsky 2025-04-05 11:36:21 +02:00
  • dc64079b6b CI: fix coverage report calls Robert Sachunsky 2025-04-05 03:40:02 +02:00