Commit graph

489 commits

Author SHA1 Message Date
vahidrezanezhad
4de441eaaa OCR prediction is now enabled to integrate results from both RGB and binarized images or to be performed on each individually 2025-03-31 21:28:05 +02:00
vahidrezanezhad
b1da0a3327 In OCR, the predicted text is now drawn on the image, and the results are saved in a specified directory. This makes it easier to review the predicted output 2025-03-31 18:43:14 +02:00
Robert Sachunsky
c01609ff4e allow even more empty imports for optional dependencies 2025-03-31 15:57:22 +02:00
Robert Sachunsky
46618f4229 allow more empty imports for optional dependencies 2025-03-31 14:11:50 +02:00
Robert Sachunsky
4be89910a2 CLI: fix arg vs kwarg from merge 2025-03-31 02:38:24 +02:00
Robert Sachunsky
9d61acf173 simplify 2025-03-31 02:02:30 +02:00
Robert Sachunsky
a1068ff2eb OCR-D: move sbb-binarize to ocrd-tool.json, update to v3 2025-03-31 01:47:32 +02:00
Robert Sachunsky
c794d4d29f OCR-D: fix typo light_mode→light_version 2025-03-31 01:46:29 +02:00
Robert Sachunsky
4338259ca1 OCR-D: ensure page image gets replaced in result as well if not the original file 2025-03-31 01:17:14 +02:00
Robert Sachunsky
55969b0173 OCR-D: add docstring 2025-03-31 01:15:26 +02:00
Robert Sachunsky
6d02e90570 OCR-D: restrict max_workers=1 2025-03-31 01:14:54 +02:00
Robert Sachunsky
efd3fa6775 allow empty imports for optional dependencies 2025-03-31 00:32:26 +02:00
Robert Sachunsky
238132e260 use 'image_filename' for pseudo-iteration outside 'dir_in' mode 2025-03-31 00:31:49 +02:00
Robert Sachunsky
af4e2a4ffc do not require 'dir_out' outside 'dir_in' mode 2025-03-31 00:31:09 +02:00
Robert Sachunsky
ea136e3ddd 'overwrite' check: only in 'dir_in' mode 2025-03-31 00:30:06 +02:00
Robert Sachunsky
1f4a17b60d Merge remote-tracking branch 'origin/machine_based_reading_order_integration' into v3-api 2025-03-30 21:21:59 +02:00
Robert Sachunsky
edf924c2cb ocrd-tool: add dockerhub 2025-03-30 19:47:25 +02:00
vahidrezanezhad
9b04688ebc The rotate_image function has been updated. Additionally, the reading order is now correct in the case of the light version, provided that slope_deskew exceeds the slope_threshold. 2025-03-30 15:34:27 +02:00
vahidrezanezhad
cf40f9ecc5 The rotate_image function produces the exact same rotation as Imutils. Therefore, there is no need to retain the remove-imutils-1 branch. 2025-03-28 20:58:32 +01:00
vahidrezanezhad
f756b08c9b
Revert "replace usages of imutils with opencv equivalents" 2025-03-28 14:57:40 +01:00
vahidrezanezhad
52c605185a
Merge pull request #146 from qurator-spk/remove-imutils-1
replace usages of `imutils` with opencv equivalents
2025-03-28 11:10:25 +01:00
vahidrezanezhad
6f36c7177f For OCR, the splitting ratio of text lines is adjusted 2025-03-27 18:24:47 +01:00
cneud
181c0c584f bbox rotation with opencv 2025-03-26 22:25:22 +01:00
cneud
eaff9e3537 Merge branch 'main' into remove-imutils-1 2025-03-26 20:16:46 +01:00
vahidrezanezhad
7df0427b04 In the context of OCR, if Page-XML files already contain text, the new predicted text will replace the existing text. 2025-03-26 18:42:06 +01:00
vahidrezanezhad
370d44a66b Slope deskew in the light version is set to zero because when the slope_deskew value exceeds the slope_threshold, the reading order becomes incorrect. This issue needs to be addressed. Additionally, the textlines order within text region in the light version was reversed, and this has been corrected. 2025-03-26 10:45:34 +01:00
vahidrezanezhad
d3a4c06e7f This commit enables the export of cropped text line images along with their corresponding texts from a Page-XML file. These exported text line images and texts can be utilized for training a text line-based OCR model. 2025-03-20 18:21:44 +01:00
vahidrezanezhad
c8b8529951 For the CNN-RNN OCR model, long text lines are split into two segments 2025-03-17 19:50:58 +01:00
vahidrezanezhad
aa72ca3006 Resolved an issue in the OCR-D framework where dir_out received a None value 2025-03-13 15:02:38 +01:00
vahidrezanezhad
a4f1f35125 Resolving test failure 2025-03-07 13:19:56 +01:00
kba
54040c1db4 Merge remote-tracking branch 'bertsky/machine_based_reading_order_integration_fixes' into machine_based_reading_order_integration 2025-03-06 15:48:52 +01:00
cneud
0b2c1b9275 remove imutils dependency 2025-03-03 22:21:57 +01:00
Clemens Neudecker
687aba1fa2
replace usages of imutils with opencv equivalents
should fix https://github.com/qurator-spk/eynollah/issues/141
2025-03-03 22:10:40 +01:00
vahidrezanezhad
7110bd971f resolved an error for light version in the case that slope_deskew is smaller than slope_threshold 2025-02-27 19:11:15 +01:00
vahidrezanezhad
25116a2c79 resolved 2 errors 2025-02-19 00:35:48 +01:00
kba
869110f185 merge main 2025-01-20 14:45:27 +01:00
vahidrezanezhad
33fda2f8be changing cnn ocr model name 2024-12-26 22:45:40 +01:00
Robert Sachunsky
335aa273a1 simplify, wrap extremely long lines 2024-12-23 13:36:29 +00:00
Robert Sachunsky
cfc65128b1 reduce redundancy/indentation 2024-12-22 14:56:32 +00:00
Robert Sachunsky
01376af905 do_order_of_regions_with_model: simplify 2024-12-22 13:10:05 +00:00
vahidrezanezhad
92bfac4b41 Provide OCR as an option to process a directory of XML files, incorporating layout and text line coordinates. 2024-12-20 15:47:21 +01:00
vahidrezanezhad
fbeef79d50 adding scatter_nd inference 2024-12-16 01:11:54 +01:00
Robert Sachunsky
0ae28f7d3e switch from stdlib to loky.ProcessPoolExecutor, ensure shutdown 2024-12-14 12:16:29 +00:00
vahidrezanezhad
f93c6c288d function of patch-wise inference with scatter_nd is added 2024-12-14 02:50:17 +01:00
vahidrezanezhad
0e8c561618 debugging issues 2024-12-14 00:24:29 +01:00
Robert Sachunsky
dcaf796283 change polarity of orientation angle (PAGE schema required cw=positive) 2024-12-11 23:07:56 +00:00
Robert Sachunsky
b4b0890294 add option to overwrite output xml, but skip by default if file exists 2024-12-11 19:52:21 +00:00
Robert Sachunsky
b9ca7a6191 log num_cols-dependent resizing 2024-12-11 18:48:26 +00:00
Robert Sachunsky
9270ea4550 annotate region angles in PAGE 2024-12-11 18:48:26 +00:00
Robert Sachunsky
3b70b11ea6 avoid deskewing patches if binary-empty 2024-12-11 18:48:26 +00:00