vahidrezanezhad
|
38a2d60fa2
|
Confidence value for textregions and in the case of not light version is set to zero. This is done to let the pipeline go through. It will be updated to return the correct value in upcomming commits
|
2025-04-03 12:47:27 +02:00 |
|
vahidrezanezhad
|
6b52da227c
|
docorating eynollah with textregion confidence score #135
|
2025-04-03 00:39:21 +02:00 |
|
vahidrezanezhad
|
91b2201b07
|
cnnrnn Ocr: width of input textline image can not be zero!
|
2025-04-01 10:55:40 +02:00 |
|
vahidrezanezhad
|
4de441eaaa
|
OCR prediction is now enabled to integrate results from both RGB and binarized images or to be performed on each individually
|
2025-03-31 21:28:05 +02:00 |
|
vahidrezanezhad
|
b1da0a3327
|
In OCR, the predicted text is now drawn on the image, and the results are saved in a specified directory. This makes it easier to review the predicted output
|
2025-03-31 18:43:14 +02:00 |
|
vahidrezanezhad
|
9b04688ebc
|
The rotate_image function has been updated. Additionally, the reading order is now correct in the case of the light version, provided that slope_deskew exceeds the slope_threshold.
|
2025-03-30 15:34:27 +02:00 |
|
vahidrezanezhad
|
cf40f9ecc5
|
The rotate_image function produces the exact same rotation as Imutils. Therefore, there is no need to retain the remove-imutils-1 branch.
|
2025-03-28 20:58:32 +01:00 |
|
vahidrezanezhad
|
b55389ac62
|
Update requirements.txt
|
2025-03-28 14:59:31 +01:00 |
|
vahidrezanezhad
|
8bf70d905f
|
Merge pull request #147 from qurator-spk/revert-146-remove-imutils-1
Revert "replace usages of `imutils` with opencv equivalents"
|
2025-03-28 14:58:04 +01:00 |
|
vahidrezanezhad
|
f756b08c9b
|
Revert "replace usages of imutils with opencv equivalents"
|
2025-03-28 14:57:40 +01:00 |
|
vahidrezanezhad
|
c9de578d4d
|
removing imutils from requirements
|
2025-03-28 11:25:03 +01:00 |
|
vahidrezanezhad
|
52c605185a
|
Merge pull request #146 from qurator-spk/remove-imutils-1
replace usages of `imutils` with opencv equivalents
|
2025-03-28 11:10:25 +01:00 |
|
cneud
|
0e9a72ea52
|
consolidate usage documentation
|
2025-03-27 23:14:59 +01:00 |
|
cneud
|
3a55b6ce91
|
consolidate usage documentation
|
2025-03-27 23:11:18 +01:00 |
|
cneud
|
e9fa691308
|
add model and training documentation
|
2025-03-27 22:41:10 +01:00 |
|
vahidrezanezhad
|
6f36c7177f
|
For OCR, the splitting ratio of text lines is adjusted
|
2025-03-27 18:24:47 +01:00 |
|
cneud
|
181c0c584f
|
bbox rotation with opencv
|
2025-03-26 22:25:22 +01:00 |
|
cneud
|
eaff9e3537
|
Merge branch 'main' into remove-imutils-1
|
2025-03-26 20:16:46 +01:00 |
|
vahidrezanezhad
|
7df0427b04
|
In the context of OCR, if Page-XML files already contain text, the new predicted text will replace the existing text.
|
2025-03-26 18:42:06 +01:00 |
|
vahidrezanezhad
|
370d44a66b
|
Slope deskew in the light version is set to zero because when the slope_deskew value exceeds the slope_threshold, the reading order becomes incorrect. This issue needs to be addressed. Additionally, the textlines order within text region in the light version was reversed, and this has been corrected.
|
2025-03-26 10:45:34 +01:00 |
|
Clemens Neudecker
|
005b6988f4
|
Merge pull request #140 from qurator-spk/machine_based_reading_order_integration
Machine based reading order integration
|
2025-03-25 11:00:44 +01:00 |
|
vahidrezanezhad
|
d3a4c06e7f
|
This commit enables the export of cropped text line images along with their corresponding texts from a Page-XML file. These exported text line images and texts can be utilized for training a text line-based OCR model.
|
2025-03-20 18:21:44 +01:00 |
|
vahidrezanezhad
|
c8b8529951
|
For the CNN-RNN OCR model, long text lines are split into two segments
|
2025-03-17 19:50:58 +01:00 |
|
vahidrezanezhad
|
aa72ca3006
|
Resolved an issue in the OCR-D framework where dir_out received a None value
|
2025-03-13 15:02:38 +01:00 |
|
vahidrezanezhad
|
a4f1f35125
|
Resolving test failure
|
2025-03-07 13:19:56 +01:00 |
|
kba
|
54040c1db4
|
Merge remote-tracking branch 'bertsky/machine_based_reading_order_integration_fixes' into machine_based_reading_order_integration
|
2025-03-06 15:48:52 +01:00 |
|
cneud
|
0b2c1b9275
|
remove imutils dependency
|
2025-03-03 22:21:57 +01:00 |
|
Clemens Neudecker
|
687aba1fa2
|
replace usages of imutils with opencv equivalents
should fix https://github.com/qurator-spk/eynollah/issues/141
|
2025-03-03 22:10:40 +01:00 |
|
vahidrezanezhad
|
7110bd971f
|
resolved an error for light version in the case that slope_deskew is smaller than slope_threshold
|
2025-02-27 19:11:15 +01:00 |
|
vahidrezanezhad
|
25116a2c79
|
resolved 2 errors
|
2025-02-19 00:35:48 +01:00 |
|
vahidrezanezhad
|
33fda2f8be
|
changing cnn ocr model name
|
2024-12-26 22:45:40 +01:00 |
|
Robert Sachunsky
|
335aa273a1
|
simplify, wrap extremely long lines
|
2024-12-23 13:36:29 +00:00 |
|
Robert Sachunsky
|
cfc65128b1
|
reduce redundancy/indentation
|
2024-12-22 14:56:32 +00:00 |
|
Robert Sachunsky
|
01376af905
|
do_order_of_regions_with_model: simplify
|
2024-12-22 13:10:05 +00:00 |
|
vahidrezanezhad
|
92bfac4b41
|
Provide OCR as an option to process a directory of XML files, incorporating layout and text line coordinates.
|
2024-12-20 15:47:21 +01:00 |
|
vahidrezanezhad
|
fbeef79d50
|
adding scatter_nd inference
|
2024-12-16 01:11:54 +01:00 |
|
Robert Sachunsky
|
0ae28f7d3e
|
switch from stdlib to loky.ProcessPoolExecutor, ensure shutdown
|
2024-12-14 12:16:29 +00:00 |
|
vahidrezanezhad
|
f93c6c288d
|
function of patch-wise inference with scatter_nd is added
|
2024-12-14 02:50:17 +01:00 |
|
vahidrezanezhad
|
0e8c561618
|
debugging issues
|
2024-12-14 00:24:29 +01:00 |
|
Robert Sachunsky
|
e9c0d716f6
|
CI: install optional dependencies, too
|
2024-12-11 23:48:56 +00:00 |
|
Robert Sachunsky
|
dcaf796283
|
change polarity of orientation angle (PAGE schema required cw=positive)
|
2024-12-11 23:07:56 +00:00 |
|
Robert Sachunsky
|
b4b0890294
|
add option to overwrite output xml, but skip by default if file exists
|
2024-12-11 19:52:21 +00:00 |
|
Robert Sachunsky
|
b9ca7a6191
|
log num_cols-dependent resizing
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
9270ea4550
|
annotate region angles in PAGE
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
3b70b11ea6
|
avoid deskewing patches if binary-empty
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
7e9ee90e6e
|
switch from (ad-hoc) mp.Pool to (attribute) concurrent.futures.ProcessPoolExecutor
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
68456ea002
|
do_work_of_slopes_new*, do_back_rotation_and_get_cnt_back, do_work_of_contours_in_image: use mp.Pool, simplify
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
25e967397d
|
exit early if no text regions found (to avoid segfault)
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
21efea8711
|
no del on function argument
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
5e0c1da711
|
simplify
|
2024-12-11 00:18:58 +00:00 |
|