vahidrezanezhad
|
44ff51f5c1
|
mb reading order now can be done faster. Text regions are clustered using dilation, and mb reading order needs to be implemented for fewer regions
|
2025-05-20 16:51:08 +02:00 |
|
vahidrezanezhad
|
5016039cd7
|
enhancing marginal detection for light version
|
2025-05-18 02:48:05 +02:00 |
|
vahidrezanezhad
|
1cbc669d36
|
marginals detection enhanced for light version
|
2025-05-15 15:33:50 +02:00 |
|
vahidrezanezhad
|
1b229ba7ae
|
enhancement for vertical textlines
|
2025-05-15 00:45:22 +02:00 |
|
vahidrezanezhad
|
ed46615f00
|
enhance ocr for vertical textlines
|
2025-05-14 18:34:58 +02:00 |
|
vahidrezanezhad
|
88e0315321
|
Accurately writing text line contours into xml file when the deskewing exceeds 45 degrees and the text line is in light mode
|
2025-05-13 15:53:05 +02:00 |
|
vahidrezanezhad
|
54088c6b04
|
The initial attempt at reading heavily deskewed or vertically aligned lines.
|
2025-05-13 14:40:57 +02:00 |
|
vahidrezanezhad
|
c12b09a868
|
I have tried to address the issues #163 and #161 . The changes have also improved marginal detection and enhanced the isolation of headers.
|
2025-05-12 00:10:18 +02:00 |
|
vahidrezanezhad
|
89aa545049
|
let to add dataset abbrevation to extracted textline images and text
|
2025-05-03 02:59:16 +02:00 |
|
vahidrezanezhad
|
48e8dd4ab3
|
machine based model name changed to public one
|
2025-05-02 12:57:26 +02:00 |
|
vahidrezanezhad
|
a1a004b19d
|
inference batch size for ocr is passed as an argument
|
2025-05-02 12:53:33 +02:00 |
|
vahidrezanezhad
|
5d8c864c08
|
adding space between splitted textline predicted text in the case of trocr
|
2025-05-02 01:02:32 +02:00 |
|
vahidrezanezhad
|
184af46664
|
displaying detexted text on an image is provided for trocr case
|
2025-05-02 00:30:36 +02:00 |
|
Robert Sachunsky
|
21615a986d
|
OCR-D processor: expose reading_order_machine_based
|
2025-08-13 14:14:37 +02:00 |
|
kba
|
b7b218ff11
|
OCR-D processor: same behavior as standalone wrt light_version/textline_light
|
2025-06-12 15:30:17 +02:00 |
|
vahidrezanezhad
|
c194a20c9c
|
Fixed duplicate textline_light assignments (true and false) in the OCR-D framework for the Eynollah light version, which caused rectangles to be used instead of contours for textlines
|
2025-06-12 15:27:22 +02:00 |
|
vahidrezanezhad
|
e2da7a6239
|
Fix model name to return the correct machine-based model name
|
2025-04-30 16:06:29 +02:00 |
|
vahidrezanezhad
|
b227736094
|
Fix OCR text cleaning to correctly handle 'U', 'K', and 'N' starting sentence; update text line splitting size
|
2025-04-30 16:04:34 +02:00 |
|
vahidrezanezhad
|
4cb4414740
|
Resolve remaining issue with #158 and resolving #124
|
2025-04-30 16:01:52 +02:00 |
|
vahidrezanezhad
|
208bde706f
|
resolving issue #158
|
2025-04-30 13:55:09 +02:00 |
|
vahidrezanezhad
|
a22df11ebb
|
Restoring the contour in the original image caused an error due to an empty tuple. This issue has been resolved, and as expected, the confidence score for this contour is set to zero
|
2025-04-14 00:42:08 +02:00 |
|
kba
|
8080bd823c
|
📦 v0.4.0
|
2025-04-07 16:48:57 +02:00 |
|
vahidrezanezhad
|
e2907f67e0
|
'from PIL.Image import Image' causes an error when using Image.new(), and since Image is already imported, this line can be safely commented out.
|
2025-04-06 00:33:36 +02:00 |
|
Robert Sachunsky
|
4339444e47
|
binarization CLI: fix option checks, simplify to asserts, fix dir_in mode
|
2025-04-05 01:21:08 +02:00 |
|
Robert Sachunsky
|
91a340f619
|
CLI: simplify option checks to asserts (also avoid stack trace)
|
2025-04-04 20:42:28 +02:00 |
|
Robert Sachunsky
|
e0a7fde537
|
logger: fix type hint
|
2025-04-04 20:27:15 +02:00 |
|
Robert Sachunsky
|
108ce1f5a1
|
Merge remote-tracking branch 'origin/main' into v3-api-release-foreal
(bad-ass difficult diff diffing)
|
2025-04-04 20:23:23 +02:00 |
|
vahidrezanezhad
|
2e3a29f66b
|
In light mode: To determine whether a main region is a header, I adjusted the ratio to achieve better results.
|
2025-04-04 15:36:31 +02:00 |
|
vahidrezanezhad
|
38a2d60fa2
|
Confidence value for textregions and in the case of not light version is set to zero. This is done to let the pipeline go through. It will be updated to return the correct value in upcomming commits
|
2025-04-03 12:47:27 +02:00 |
|
vahidrezanezhad
|
6b52da227c
|
docorating eynollah with textregion confidence score #135
|
2025-04-03 00:39:21 +02:00 |
|
Robert Sachunsky
|
559d001eef
|
another fix to avoid frequent warnings
|
2025-04-02 05:45:34 +00:00 |
|
Robert Sachunsky
|
dd478279a4
|
CLI: also --overwrite in single-image mode
|
2025-04-02 05:40:21 +00:00 |
|
Robert Sachunsky
|
8159e6336a
|
fix typo (preventing log messages)
|
2025-04-02 00:01:02 +00:00 |
|
Robert Sachunsky
|
2919538382
|
minor fixes to avoid frequent warnings
|
2025-04-01 23:33:26 +00:00 |
|
Robert Sachunsky
|
dcf2ed5e22
|
run: also write out XML in single filename mode
|
2025-04-01 23:13:24 +02:00 |
|
Robert Sachunsky
|
fe77171d45
|
run_single: reduce indentation
|
2025-04-01 22:47:33 +02:00 |
|
Robert Sachunsky
|
79003a083c
|
CLI: ValueError instead of print+exit
|
2025-04-01 22:43:01 +02:00 |
|
Robert Sachunsky
|
e17d34fafa
|
factor run_single() out of run(), simplify kwargs
|
2025-04-01 22:12:24 +02:00 |
|
Robert Sachunsky
|
1a0a1cb00b
|
remove session methods and redundant model loaders
|
2025-04-01 21:15:41 +02:00 |
|
Robert Sachunsky
|
dd51f900b9
|
OCR-D: init Eynollah in 'setup', re-use instance for each page via non-public API
|
2025-04-01 13:02:30 +02:00 |
|
Robert Sachunsky
|
ffeb4a343d
|
Eynollah: remove useless 'pcgts' attr
|
2025-04-01 13:00:41 +02:00 |
|
vahidrezanezhad
|
91b2201b07
|
cnnrnn Ocr: width of input textline image can not be zero!
|
2025-04-01 10:55:40 +02:00 |
|
Robert Sachunsky
|
515b4023f6
|
sbb_binarize: fix missing reference
|
2025-04-01 10:54:36 +02:00 |
|
vahidrezanezhad
|
4de441eaaa
|
OCR prediction is now enabled to integrate results from both RGB and binarized images or to be performed on each individually
|
2025-03-31 21:28:05 +02:00 |
|
vahidrezanezhad
|
b1da0a3327
|
In OCR, the predicted text is now drawn on the image, and the results are saved in a specified directory. This makes it easier to review the predicted output
|
2025-03-31 18:43:14 +02:00 |
|
Robert Sachunsky
|
c01609ff4e
|
allow even more empty imports for optional dependencies
|
2025-03-31 15:57:22 +02:00 |
|
Robert Sachunsky
|
46618f4229
|
allow more empty imports for optional dependencies
|
2025-03-31 14:11:50 +02:00 |
|
Robert Sachunsky
|
4be89910a2
|
CLI: fix arg vs kwarg from merge
|
2025-03-31 02:38:24 +02:00 |
|
Robert Sachunsky
|
9d61acf173
|
simplify
|
2025-03-31 02:02:30 +02:00 |
|
Robert Sachunsky
|
a1068ff2eb
|
OCR-D: move sbb-binarize to ocrd-tool.json, update to v3
|
2025-03-31 01:47:32 +02:00 |
|