vahidrezanezhad
|
38a2d60fa2
|
Confidence value for textregions and in the case of not light version is set to zero. This is done to let the pipeline go through. It will be updated to return the correct value in upcomming commits
|
2025-04-03 12:47:27 +02:00 |
|
vahidrezanezhad
|
6b52da227c
|
docorating eynollah with textregion confidence score #135
|
2025-04-03 00:39:21 +02:00 |
|
vahidrezanezhad
|
91b2201b07
|
cnnrnn Ocr: width of input textline image can not be zero!
|
2025-04-01 10:55:40 +02:00 |
|
vahidrezanezhad
|
4de441eaaa
|
OCR prediction is now enabled to integrate results from both RGB and binarized images or to be performed on each individually
|
2025-03-31 21:28:05 +02:00 |
|
vahidrezanezhad
|
b1da0a3327
|
In OCR, the predicted text is now drawn on the image, and the results are saved in a specified directory. This makes it easier to review the predicted output
|
2025-03-31 18:43:14 +02:00 |
|
vahidrezanezhad
|
9b04688ebc
|
The rotate_image function has been updated. Additionally, the reading order is now correct in the case of the light version, provided that slope_deskew exceeds the slope_threshold.
|
2025-03-30 15:34:27 +02:00 |
|
vahidrezanezhad
|
cf40f9ecc5
|
The rotate_image function produces the exact same rotation as Imutils. Therefore, there is no need to retain the remove-imutils-1 branch.
|
2025-03-28 20:58:32 +01:00 |
|
vahidrezanezhad
|
f756b08c9b
|
Revert "replace usages of imutils with opencv equivalents"
|
2025-03-28 14:57:40 +01:00 |
|
vahidrezanezhad
|
52c605185a
|
Merge pull request #146 from qurator-spk/remove-imutils-1
replace usages of `imutils` with opencv equivalents
|
2025-03-28 11:10:25 +01:00 |
|
vahidrezanezhad
|
6f36c7177f
|
For OCR, the splitting ratio of text lines is adjusted
|
2025-03-27 18:24:47 +01:00 |
|
cneud
|
181c0c584f
|
bbox rotation with opencv
|
2025-03-26 22:25:22 +01:00 |
|
cneud
|
eaff9e3537
|
Merge branch 'main' into remove-imutils-1
|
2025-03-26 20:16:46 +01:00 |
|
vahidrezanezhad
|
7df0427b04
|
In the context of OCR, if Page-XML files already contain text, the new predicted text will replace the existing text.
|
2025-03-26 18:42:06 +01:00 |
|
vahidrezanezhad
|
370d44a66b
|
Slope deskew in the light version is set to zero because when the slope_deskew value exceeds the slope_threshold, the reading order becomes incorrect. This issue needs to be addressed. Additionally, the textlines order within text region in the light version was reversed, and this has been corrected.
|
2025-03-26 10:45:34 +01:00 |
|
vahidrezanezhad
|
d3a4c06e7f
|
This commit enables the export of cropped text line images along with their corresponding texts from a Page-XML file. These exported text line images and texts can be utilized for training a text line-based OCR model.
|
2025-03-20 18:21:44 +01:00 |
|
vahidrezanezhad
|
c8b8529951
|
For the CNN-RNN OCR model, long text lines are split into two segments
|
2025-03-17 19:50:58 +01:00 |
|
vahidrezanezhad
|
aa72ca3006
|
Resolved an issue in the OCR-D framework where dir_out received a None value
|
2025-03-13 15:02:38 +01:00 |
|
vahidrezanezhad
|
a4f1f35125
|
Resolving test failure
|
2025-03-07 13:19:56 +01:00 |
|
kba
|
54040c1db4
|
Merge remote-tracking branch 'bertsky/machine_based_reading_order_integration_fixes' into machine_based_reading_order_integration
|
2025-03-06 15:48:52 +01:00 |
|
cneud
|
0b2c1b9275
|
remove imutils dependency
|
2025-03-03 22:21:57 +01:00 |
|
Clemens Neudecker
|
687aba1fa2
|
replace usages of imutils with opencv equivalents
should fix https://github.com/qurator-spk/eynollah/issues/141
|
2025-03-03 22:10:40 +01:00 |
|
vahidrezanezhad
|
7110bd971f
|
resolved an error for light version in the case that slope_deskew is smaller than slope_threshold
|
2025-02-27 19:11:15 +01:00 |
|
vahidrezanezhad
|
25116a2c79
|
resolved 2 errors
|
2025-02-19 00:35:48 +01:00 |
|
vahidrezanezhad
|
33fda2f8be
|
changing cnn ocr model name
|
2024-12-26 22:45:40 +01:00 |
|
Robert Sachunsky
|
335aa273a1
|
simplify, wrap extremely long lines
|
2024-12-23 13:36:29 +00:00 |
|
Robert Sachunsky
|
cfc65128b1
|
reduce redundancy/indentation
|
2024-12-22 14:56:32 +00:00 |
|
Robert Sachunsky
|
01376af905
|
do_order_of_regions_with_model: simplify
|
2024-12-22 13:10:05 +00:00 |
|
vahidrezanezhad
|
92bfac4b41
|
Provide OCR as an option to process a directory of XML files, incorporating layout and text line coordinates.
|
2024-12-20 15:47:21 +01:00 |
|
vahidrezanezhad
|
fbeef79d50
|
adding scatter_nd inference
|
2024-12-16 01:11:54 +01:00 |
|
Robert Sachunsky
|
0ae28f7d3e
|
switch from stdlib to loky.ProcessPoolExecutor, ensure shutdown
|
2024-12-14 12:16:29 +00:00 |
|
vahidrezanezhad
|
f93c6c288d
|
function of patch-wise inference with scatter_nd is added
|
2024-12-14 02:50:17 +01:00 |
|
vahidrezanezhad
|
0e8c561618
|
debugging issues
|
2024-12-14 00:24:29 +01:00 |
|
Robert Sachunsky
|
dcaf796283
|
change polarity of orientation angle (PAGE schema required cw=positive)
|
2024-12-11 23:07:56 +00:00 |
|
Robert Sachunsky
|
b4b0890294
|
add option to overwrite output xml, but skip by default if file exists
|
2024-12-11 19:52:21 +00:00 |
|
Robert Sachunsky
|
b9ca7a6191
|
log num_cols-dependent resizing
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
9270ea4550
|
annotate region angles in PAGE
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
3b70b11ea6
|
avoid deskewing patches if binary-empty
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
7e9ee90e6e
|
switch from (ad-hoc) mp.Pool to (attribute) concurrent.futures.ProcessPoolExecutor
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
68456ea002
|
do_work_of_slopes_new*, do_back_rotation_and_get_cnt_back, do_work_of_contours_in_image: use mp.Pool, simplify
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
25e967397d
|
exit early if no text regions found (to avoid segfault)
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
21efea8711
|
no del on function argument
|
2024-12-11 18:48:26 +00:00 |
|
Robert Sachunsky
|
5e0c1da711
|
simplify
|
2024-12-11 00:18:58 +00:00 |
|
Robert Sachunsky
|
54cb15056b
|
do_image_rotation / return_deskew_slop: avoid code duplication, simplify via mp.Pool
|
2024-12-10 09:52:32 +00:00 |
|
Robert Sachunsky
|
6fe02df973
|
do_image_rotation: fix f93fa12 (do return results)
|
2024-12-09 16:35:31 +00:00 |
|
Robert Sachunsky
|
d68017037c
|
do_prediction: trigger GC to avoid CUDA OOM
|
2024-12-09 11:27:11 +00:00 |
|
Robert Sachunsky
|
ad748d0039
|
do_prediction: avoid code duplication
|
2024-12-09 10:55:41 +00:00 |
|
Robert Sachunsky
|
c3163caefd
|
avoid indentation
|
2024-12-05 14:28:17 +00:00 |
|
Robert Sachunsky
|
055463d23a
|
avoid indentation
|
2024-12-05 09:43:30 +00:00 |
|
Robert Sachunsky
|
aaea2ef463
|
simplify
|
2024-12-05 09:40:02 +00:00 |
|
Robert Sachunsky
|
3d88b207fc
|
run: log instead of print
|
2024-12-05 09:39:55 +00:00 |
|