Commit graph

768 commits

Author SHA1 Message Date
vahidrezanezhad
920705c3b1 update model names 2025-07-21 10:54:20 +02:00
vahidrezanezhad
e0f4a007e4 ocr model renamed - image text font for ocr result is now using Charis-7.000 font (downloaded from here https://software.sil.org/charis/download/) 2025-07-16 14:00:12 +02:00
vahidrezanezhad
e54ebaa23e ocr: make sure that image height or width is not zero 2025-07-03 15:24:52 +02:00
vahidrezanezhad
59ea493803 decorated with confidence value for cnnrnn ocr model 2025-07-03 11:50:47 +02:00
vahidrezanezhad
065f1f9a93 Fix: Resolved OCR bug when text region type is undefined 2025-06-02 18:21:33 +02:00
vahidrezanezhad
7996afac69 image enhancer updated 2025-06-01 22:44:50 +02:00
vahidrezanezhad
d14bd162ca saving enhanced image in org or scaled resolution 2025-06-01 22:10:13 +02:00
vahidrezanezhad
cc36694dfd image enhancer is integrated 2025-06-01 15:53:04 +02:00
vahidrezanezhad
928a548b70 Parametrize OCR for handling curved lines 2025-05-31 01:09:14 +02:00
vahidrezanezhad
48285ce3f5 updating ocr 2025-05-28 01:17:21 +02:00
vahidrezanezhad
b93fc112bf updating ocr 2025-05-27 23:45:22 +02:00
vahidrezanezhad
0f154c605a strings alignment function is added + new changes needed for prediction with both bin and rgb inputs is implemented 2025-05-25 21:44:36 +02:00
vahidrezanezhad
097520bfd2 rnn ocr for all layout textregion types 2025-05-25 03:33:54 +02:00
vahidrezanezhad
27c4b0d0e0 Drop capitals are written separately and are not attached to their corresponding text line. The OCR use case also supports single-image input. 2025-05-25 01:12:58 +02:00
vahidrezanezhad
adcf03c7b7 enhancing ocr 2025-05-23 18:06:53 +02:00
vahidrezanezhad
d4f6e10251 commit 21ec4fb is picked + rnn ocr at the same time with segmentation + enhancement of mb reading order 2025-05-23 15:55:03 +02:00
vahidrezanezhad
a0647eff93 enhancing curved lines OCR 2025-05-21 17:42:44 +02:00
vahidrezanezhad
f94fc9973b Implement hyphenated textline merging in OCR engine and a bug fixed for curved textline OCR 2025-05-21 14:39:31 +02:00
vahidrezanezhad
c0835665a9 ocr for curved lines 2025-05-20 19:01:52 +02:00
vahidrezanezhad
848156dd9d mb reading order now can be done faster. Text regions are clustered using dilation, and mb reading order needs to be implemented for fewer regions 2025-05-20 16:51:08 +02:00
vahidrezanezhad
7a34bbb493 enhancing marginal detection for light version 2025-05-18 02:48:05 +02:00
vahidrezanezhad
0819730355 marginals detection enhanced for light version 2025-05-15 15:33:50 +02:00
vahidrezanezhad
adee1dc55c enhancement for vertical textlines 2025-05-15 00:45:22 +02:00
vahidrezanezhad
a9cdd56e9a enhance ocr for vertical textlines 2025-05-14 18:34:58 +02:00
vahidrezanezhad
1ccd3fb7cf Accurately writing text line contours into xml file when the deskewing exceeds 45 degrees and the text line is in light mode 2025-05-13 15:53:05 +02:00
vahidrezanezhad
07f5b52fa7 The initial attempt at reading heavily deskewed or vertically aligned lines. 2025-05-13 14:40:57 +02:00
vahidrezanezhad
02a679a145 I have tried to address the issues #163 and #161 . The changes have also improved marginal detection and enhanced the isolation of headers. 2025-05-12 00:10:18 +02:00
vahidrezanezhad
5d447abcc4 let to add dataset abbrevation to extracted textline images and text 2025-05-03 02:59:16 +02:00
vahidrezanezhad
8c8fa461bb machine based model name changed to public one 2025-05-02 12:57:26 +02:00
vahidrezanezhad
a4defbb04d inference batch size for ocr is passed as an argument 2025-05-02 12:53:33 +02:00
vahidrezanezhad
fd375e15d5 adding space between splitted textline predicted text in the case of trocr 2025-05-02 01:02:32 +02:00
vahidrezanezhad
5c8084a397 displaying detexted text on an image is provided for trocr case 2025-05-02 00:30:36 +02:00
vahidrezanezhad
e2da7a6239 Fix model name to return the correct machine-based model name 2025-04-30 16:06:29 +02:00
vahidrezanezhad
b227736094 Fix OCR text cleaning to correctly handle 'U', 'K', and 'N' starting sentence; update text line splitting size 2025-04-30 16:04:34 +02:00
vahidrezanezhad
4cb4414740 Resolve remaining issue with #158 and resolving #124 2025-04-30 16:01:52 +02:00
vahidrezanezhad
208bde706f resolving issue #158 2025-04-30 13:55:09 +02:00
Konstantin Baierer
3e8adb86c2
Merge pull request #157 from qurator-spk/kba-patch-1
CI: Use most recent actions/setup-python@v5
2025-04-29 11:42:18 +02:00
Konstantin Baierer
77dae129d5
CI: Use most recent actions/setup-python@v5 2025-04-22 13:22:28 +02:00
Clemens Neudecker
b4df978dd5
Merge pull request #154 from qurator-spk/ci-pypi
CI: pypi
2025-04-17 17:01:20 +02:00
kba
30ba234641 CI: pypi 2025-04-16 19:27:17 +02:00
kba
41318f0404 📝 changelog 2025-04-15 11:14:26 +02:00
vahidrezanezhad
a22df11ebb Restoring the contour in the original image caused an error due to an empty tuple. This issue has been resolved, and as expected, the confidence score for this contour is set to zero 2025-04-14 00:42:08 +02:00
kba
8080bd823c 📦 v0.4.0 2025-04-07 16:48:57 +02:00
Robert Sachunsky
bcf1898aa4 📝 changelog 2025-04-07 16:46:58 +02:00
Robert Sachunsky
177e017167 test_run: ensure exceptions are shown 2025-04-07 10:39:50 +00:00
vahidrezanezhad
e2907f67e0 'from PIL.Image import Image' causes an error when using Image.new(), and since Image is already imported, this line can be safely commented out. 2025-04-06 00:33:36 +02:00
Robert Sachunsky
132d3e3d27 CI: use clash-free artifact name for report upload 2025-04-05 11:36:21 +02:00
Robert Sachunsky
dc64079b6b CI: fix coverage report calls 2025-04-05 03:40:02 +02:00
Robert Sachunsky
7609c64c8b CI: make coverage cfg work with both editable and dist install 2025-04-05 03:05:26 +02:00
Robert Sachunsky
bbc06dbbc1 CI: forgot to (re-)enable verbose logging 2025-04-05 02:10:52 +02:00