Commit graph

  • d968a306e4 should merged text for the whole page be written in xml? vahidrezanezhad 2025-07-21 14:50:05 +02:00
  • 673e67a847 update model names vahidrezanezhad 2025-07-21 10:54:20 +02:00
  • 920705c3b1 update model names vahidrezanezhad 2025-07-21 10:54:20 +02:00
  • fee40049cd ocr model renamed - image text font for ocr result is now using Charis-7.000 font (downloaded from here https://software.sil.org/charis/download/) vahidrezanezhad 2025-07-16 14:00:12 +02:00
  • e0f4a007e4 ocr model renamed - image text font for ocr result is now using Charis-7.000 font (downloaded from here https://software.sil.org/charis/download/) vahidrezanezhad 2025-07-16 14:00:12 +02:00
  • 04fead348f ocr: make sure that image height or width is not zero vahidrezanezhad 2025-07-03 15:24:52 +02:00
  • e54ebaa23e ocr: make sure that image height or width is not zero vahidrezanezhad 2025-07-03 15:24:52 +02:00
  • 53dd4b26a9 decorated with confidence value for cnnrnn ocr model vahidrezanezhad 2025-07-03 11:50:47 +02:00
  • 59ea493803 decorated with confidence value for cnnrnn ocr model vahidrezanezhad 2025-07-03 11:50:47 +02:00
  • 1b222594d6 Update README.md: how to train model using docker image vahidrezanezhad 2025-06-25 18:33:55 +02:00
  • f5a1d1a255 docker file to train model with desired cuda and cudnn vahidrezanezhad 2025-06-25 18:24:16 +02:00
  • 42a3cc2335 cv2pil: limit color depth on output imgs Robert Sachunsky 2025-06-13 01:21:25 +02:00
  • dadb879376 pil2cv: allow (and drop) alpha channels on input imgs Robert Sachunsky 2025-06-12 20:55:37 +02:00
  • 32889ef1e0 adapt binarization CLI according to #156 kba 2025-06-12 13:57:41 +02:00
  • 9b4e78c55c
    Fixed duplicate textline_light assignments (true and false) in the OCR-D framework for the Eynollah light version, which caused rectangles to be used instead of contours for textlines vahidrezanezhad 2025-06-11 18:57:08 +02:00
  • 0e7de52f5e Merge pull request #24 from johnlockejrr/unifying-training-models Clemens Neudecker 2025-06-03 09:00:56 +02:00
  • eb91000490 layout visualization updated vahidrezanezhad 2025-06-02 18:23:34 +02:00
  • f79af201ab Fix: Resolved OCR bug when text region type is undefined vahidrezanezhad 2025-06-02 18:21:33 +02:00
  • 065f1f9a93 Fix: Resolved OCR bug when text region type is undefined vahidrezanezhad 2025-06-02 18:21:33 +02:00
  • e26c4ab9b4 image enhancer updated vahidrezanezhad 2025-06-01 22:44:50 +02:00
  • 7996afac69 image enhancer updated vahidrezanezhad 2025-06-01 22:44:50 +02:00
  • 9342b76038 saving enhanced image in org or scaled resolution vahidrezanezhad 2025-06-01 22:10:13 +02:00
  • d14bd162ca saving enhanced image in org or scaled resolution vahidrezanezhad 2025-06-01 22:10:13 +02:00
  • 3b475915c7 image enhancer is integrated vahidrezanezhad 2025-06-01 15:53:04 +02:00
  • cc36694dfd image enhancer is integrated vahidrezanezhad 2025-06-01 15:53:04 +02:00
  • df903aa1b4 Parametrize OCR for handling curved lines vahidrezanezhad 2025-05-31 01:09:14 +02:00
  • 928a548b70 Parametrize OCR for handling curved lines vahidrezanezhad 2025-05-31 01:09:14 +02:00
  • 1e7cecfcf9 updating ocr vahidrezanezhad 2025-05-28 01:17:21 +02:00
  • 48285ce3f5 updating ocr vahidrezanezhad 2025-05-28 01:17:21 +02:00
  • 03f52e7a46 updating ocr vahidrezanezhad 2025-05-27 23:45:22 +02:00
  • b93fc112bf updating ocr vahidrezanezhad 2025-05-27 23:45:22 +02:00
  • 31d9fa0c80 strings alignment function is added + new changes needed for prediction with both bin and rgb inputs is implemented vahidrezanezhad 2025-05-25 21:44:36 +02:00
  • 0f154c605a strings alignment function is added + new changes needed for prediction with both bin and rgb inputs is implemented vahidrezanezhad 2025-05-25 21:44:36 +02:00
  • b18691f96a rnn ocr for all layout textregion types vahidrezanezhad 2025-05-25 03:33:54 +02:00
  • 097520bfd2 rnn ocr for all layout textregion types vahidrezanezhad 2025-05-25 03:33:54 +02:00
  • ba3420b2d8 Drop capitals are written separately and are not attached to their corresponding text line. The OCR use case also supports single-image input. vahidrezanezhad 2025-05-25 01:12:58 +02:00
  • 27c4b0d0e0 Drop capitals are written separately and are not attached to their corresponding text line. The OCR use case also supports single-image input. vahidrezanezhad 2025-05-25 01:12:58 +02:00
  • 25e3a2a99f visualizing ro for single xml file vahidrezanezhad 2025-05-23 18:30:51 +02:00
  • 0250a6d3d0 enhancing ocr vahidrezanezhad 2025-05-23 18:06:53 +02:00
  • adcf03c7b7 enhancing ocr vahidrezanezhad 2025-05-23 18:06:53 +02:00
  • 089029cec7 commit 21ec4fb is picked + rnn ocr at the same time with segmentation + enhancement of mb reading order vahidrezanezhad 2025-05-23 15:55:03 +02:00
  • d4f6e10251 commit 21ec4fb is picked + rnn ocr at the same time with segmentation + enhancement of mb reading order vahidrezanezhad 2025-05-23 15:55:03 +02:00
  • ee2c7e9013 enhancing curved lines OCR vahidrezanezhad 2025-05-21 17:42:44 +02:00
  • a0647eff93 enhancing curved lines OCR vahidrezanezhad 2025-05-21 17:42:44 +02:00
  • 14b70c2556 Implement hyphenated textline merging in OCR engine and a bug fixed for curved textline OCR vahidrezanezhad 2025-05-21 14:39:31 +02:00
  • f94fc9973b Implement hyphenated textline merging in OCR engine and a bug fixed for curved textline OCR vahidrezanezhad 2025-05-21 14:39:31 +02:00
  • 3ad621e956 ocr for curved lines vahidrezanezhad 2025-05-20 19:01:52 +02:00
  • c0835665a9 ocr for curved lines vahidrezanezhad 2025-05-20 19:01:52 +02:00
  • 44ff51f5c1 mb reading order now can be done faster. Text regions are clustered using dilation, and mb reading order needs to be implemented for fewer regions vahidrezanezhad 2025-05-20 16:51:08 +02:00
  • 848156dd9d mb reading order now can be done faster. Text regions are clustered using dilation, and mb reading order needs to be implemented for fewer regions vahidrezanezhad 2025-05-20 16:51:08 +02:00
  • 5016039cd7 enhancing marginal detection for light version vahidrezanezhad 2025-05-18 02:48:05 +02:00
  • 7a34bbb493 enhancing marginal detection for light version vahidrezanezhad 2025-05-18 02:48:05 +02:00
  • f298643fcf
    Fix ReduceONPlateau wrong logic johnlockejrr 2025-05-17 23:24:40 +03:00
  • 7661080899
    LR Warmup and Optimization Implementation johnlockejrr 2025-05-17 16:17:38 +03:00
  • f9390c71e7 updating inference for mb reading order vahidrezanezhad 2025-05-17 02:18:27 +02:00
  • 1cbc669d36 marginals detection enhanced for light version vahidrezanezhad 2025-05-15 15:33:50 +02:00
  • 0819730355 marginals detection enhanced for light version vahidrezanezhad 2025-05-15 15:33:50 +02:00
  • 1b229ba7ae enhancement for vertical textlines vahidrezanezhad 2025-05-15 00:45:22 +02:00
  • adee1dc55c enhancement for vertical textlines vahidrezanezhad 2025-05-15 00:45:22 +02:00
  • 7a22e51f5d resolve some comments from review cneud 2025-05-14 21:56:03 +02:00
  • ed46615f00 enhance ocr for vertical textlines vahidrezanezhad 2025-05-14 18:34:58 +02:00
  • a9cdd56e9a enhance ocr for vertical textlines vahidrezanezhad 2025-05-14 18:34:58 +02:00
  • 25abc0fabc Update gt_gen_utils.py johnlockejrr 2025-05-14 03:34:51 -07:00
  • 1bf801985b
    Update gt_gen_utils.py johnlockejrr 2025-05-14 03:34:51 -07:00
  • 88e0315321 Accurately writing text line contours into xml file when the deskewing exceeds 45 degrees and the text line is in light mode vahidrezanezhad 2025-05-13 15:53:05 +02:00
  • 1ccd3fb7cf Accurately writing text line contours into xml file when the deskewing exceeds 45 degrees and the text line is in light mode vahidrezanezhad 2025-05-13 15:53:05 +02:00
  • 54088c6b04 The initial attempt at reading heavily deskewed or vertically aligned lines. vahidrezanezhad 2025-05-13 14:40:57 +02:00
  • 07f5b52fa7 The initial attempt at reading heavily deskewed or vertically aligned lines. vahidrezanezhad 2025-05-13 14:40:57 +02:00
  • 4a7728bb34 visuliazation layout from eynollah page-xml output vahidrezanezhad 2025-05-12 22:39:47 +02:00
  • 4ddc84dee8 visulizing textline detection from eynollah page-xml output vahidrezanezhad 2025-05-12 18:31:40 +02:00
  • c12b09a868 I have tried to address the issues #163 and #161 . The changes have also improved marginal detection and enhanced the isolation of headers. vahidrezanezhad 2025-05-12 00:10:18 +02:00
  • 02a679a145 I have tried to address the issues #163 and #161 . The changes have also improved marginal detection and enhanced the isolation of headers. vahidrezanezhad 2025-05-12 00:10:18 +02:00
  • 3a9fc0efde Update utils.py johnlockejrr 2025-05-11 06:09:17 -07:00
  • 102b04c84d
    Update utils.py johnlockejrr 2025-05-11 06:09:17 -07:00
  • 6fa766d6a5 Update utils.py johnlockejrr 2025-05-11 05:31:34 -07:00
  • be57f137d7
    Update utils.py johnlockejrr 2025-05-11 05:31:34 -07:00
  • 21ec4fbfb5 The text region coordinates are now correctly written into the XML output when using the skip layout and reading order option vahidrezanezhad 2025-05-07 14:04:01 +02:00
  • 83211ae684 In the case of skip_layout_and_reading_order, the confidence value was not set correctly, leading to an error while writing to the XML file. vahidrezanezhad 2025-05-07 12:33:03 +02:00
  • 92954b1b7b resolving issued with saving model by steps vahidrezanezhad 2025-05-05 16:13:38 +02:00
  • 5694d971c5 saving model by steps is added to reading order and pixel wise segmentation use cases training vahidrezanezhad 2025-05-05 15:39:05 +02:00
  • 3b123b039c adding min_early parameter for generating training dataset for machine based reading order model vahidrezanezhad 2025-05-03 19:25:32 +02:00
  • 89aa545049 let to add dataset abbrevation to extracted textline images and text vahidrezanezhad 2025-05-03 02:59:16 +02:00
  • 5d447abcc4 let to add dataset abbrevation to extracted textline images and text vahidrezanezhad 2025-05-03 02:59:16 +02:00
  • 48e8dd4ab3 machine based model name changed to public one vahidrezanezhad 2025-05-02 12:57:26 +02:00
  • 8c8fa461bb machine based model name changed to public one vahidrezanezhad 2025-05-02 12:57:26 +02:00
  • a1a004b19d inference batch size for ocr is passed as an argument vahidrezanezhad 2025-05-02 12:53:33 +02:00
  • a4defbb04d inference batch size for ocr is passed as an argument vahidrezanezhad 2025-05-02 12:53:33 +02:00
  • 5d8c864c08 adding space between splitted textline predicted text in the case of trocr vahidrezanezhad 2025-05-02 01:02:32 +02:00
  • fd375e15d5 adding space between splitted textline predicted text in the case of trocr vahidrezanezhad 2025-05-02 01:02:32 +02:00
  • 184af46664 displaying detexted text on an image is provided for trocr case vahidrezanezhad 2025-05-02 00:30:36 +02:00
  • a2359ea4c4
    Merge pull request #171 from bertsky/ocrd-machine-based-ro Clemens Neudecker 2025-08-15 18:40:13 +02:00
  • 21615a986d OCR-D processor: expose reading_order_machine_based Robert Sachunsky 2025-08-13 14:14:37 +02:00
  • 6b8893b188
    Merge pull request #167 from qurator-spk/ocrd-fixes vahidrezanezhad 2025-07-22 14:46:25 +02:00
  • b7b218ff11 OCR-D processor: same behavior as standalone wrt light_version/textline_light kba 2025-06-12 15:30:17 +02:00
  • c194a20c9c Fixed duplicate textline_light assignments (true and false) in the OCR-D framework for the Eynollah light version, which caused rectangles to be used instead of contours for textlines vahidrezanezhad 2025-06-11 18:57:08 +02:00
  • 3dcbb20cac
    Merge pull request #159 from bertsky/main Clemens Neudecker 2025-05-06 15:14:06 +02:00
  • 5c8084a397 displaying detexted text on an image is provided for trocr case vahidrezanezhad 2025-05-02 00:30:36 +02:00
  • e9179e1d34 docker: use latest core base stage Robert Sachunsky 2025-05-02 00:13:06 +02:00
  • f8b4d29a59 docker: prepackage ocrd-all-module-dir.json Robert Sachunsky 2025-05-02 00:13:11 +02:00
  • e2da7a6239 Fix model name to return the correct machine-based model name vahidrezanezhad 2025-04-30 16:06:29 +02:00