vahidrezanezhad
|
38a2d60fa2
|
Confidence value for textregions and in the case of not light version is set to zero. This is done to let the pipeline go through. It will be updated to return the correct value in upcomming commits
|
2 weeks ago |
vahidrezanezhad
|
6b52da227c
|
docorating eynollah with textregion confidence score #135
|
2 weeks ago |
vahidrezanezhad
|
91b2201b07
|
cnnrnn Ocr: width of input textline image can not be zero!
|
3 weeks ago |
vahidrezanezhad
|
4de441eaaa
|
OCR prediction is now enabled to integrate results from both RGB and binarized images or to be performed on each individually
|
3 weeks ago |
vahidrezanezhad
|
b1da0a3327
|
In OCR, the predicted text is now drawn on the image, and the results are saved in a specified directory. This makes it easier to review the predicted output
|
3 weeks ago |
vahidrezanezhad
|
9b04688ebc
|
The rotate_image function has been updated. Additionally, the reading order is now correct in the case of the light version, provided that slope_deskew exceeds the slope_threshold.
|
3 weeks ago |
vahidrezanezhad
|
cf40f9ecc5
|
The rotate_image function produces the exact same rotation as Imutils. Therefore, there is no need to retain the remove-imutils-1 branch.
|
3 weeks ago |
vahidrezanezhad
|
b55389ac62
|
Update requirements.txt
|
3 weeks ago |
vahidrezanezhad
|
8bf70d905f
|
Merge pull request #147 from qurator-spk/revert-146-remove-imutils-1
Revert "replace usages of `imutils` with opencv equivalents"
|
3 weeks ago |
vahidrezanezhad
|
f756b08c9b
|
Revert "replace usages of `imutils` with opencv equivalents"
|
3 weeks ago |
vahidrezanezhad
|
c9de578d4d
|
removing imutils from requirements
|
3 weeks ago |
vahidrezanezhad
|
52c605185a
|
Merge pull request #146 from qurator-spk/remove-imutils-1
replace usages of `imutils` with opencv equivalents
|
3 weeks ago |
cneud
|
0e9a72ea52
|
consolidate usage documentation
|
3 weeks ago |
cneud
|
3a55b6ce91
|
consolidate usage documentation
|
3 weeks ago |
cneud
|
e9fa691308
|
add model and training documentation
|
3 weeks ago |
vahidrezanezhad
|
6f36c7177f
|
For OCR, the splitting ratio of text lines is adjusted
|
3 weeks ago |
cneud
|
181c0c584f
|
bbox rotation with opencv
|
3 weeks ago |
cneud
|
eaff9e3537
|
Merge branch 'main' into remove-imutils-1
|
3 weeks ago |
vahidrezanezhad
|
7df0427b04
|
In the context of OCR, if Page-XML files already contain text, the new predicted text will replace the existing text.
|
3 weeks ago |
vahidrezanezhad
|
370d44a66b
|
Slope deskew in the light version is set to zero because when the slope_deskew value exceeds the slope_threshold, the reading order becomes incorrect. This issue needs to be addressed. Additionally, the textlines order within text region in the light version was reversed, and this has been corrected.
|
3 weeks ago |
Clemens Neudecker
|
005b6988f4
|
Merge pull request #140 from qurator-spk/machine_based_reading_order_integration
Machine based reading order integration
|
4 weeks ago |
vahidrezanezhad
|
d3a4c06e7f
|
This commit enables the export of cropped text line images along with their corresponding texts from a Page-XML file. These exported text line images and texts can be utilized for training a text line-based OCR model.
|
1 month ago |
vahidrezanezhad
|
c8b8529951
|
For the CNN-RNN OCR model, long text lines are split into two segments
|
1 month ago |
vahidrezanezhad
|
aa72ca3006
|
Resolved an issue in the OCR-D framework where dir_out received a None value
|
1 month ago |
vahidrezanezhad
|
a4f1f35125
|
Resolving test failure
|
1 month ago |
kba
|
54040c1db4
|
Merge remote-tracking branch 'bertsky/machine_based_reading_order_integration_fixes' into machine_based_reading_order_integration
|
1 month ago |
cneud
|
0b2c1b9275
|
remove `imutils` dependency
|
2 months ago |
Clemens Neudecker
|
687aba1fa2
|
replace usages of `imutils` with opencv equivalents
should fix https://github.com/qurator-spk/eynollah/issues/141
|
2 months ago |
vahidrezanezhad
|
7110bd971f
|
resolved an error for light version in the case that slope_deskew is smaller than slope_threshold
|
2 months ago |
vahidrezanezhad
|
25116a2c79
|
resolved 2 errors
|
2 months ago |
vahidrezanezhad
|
33fda2f8be
|
changing cnn ocr model name
|
4 months ago |
Robert Sachunsky
|
335aa273a1
|
simplify, wrap extremely long lines
|
4 months ago |
Robert Sachunsky
|
cfc65128b1
|
reduce redundancy/indentation
|
4 months ago |
Robert Sachunsky
|
01376af905
|
do_order_of_regions_with_model: simplify
|
4 months ago |
vahidrezanezhad
|
92bfac4b41
|
Provide OCR as an option to process a directory of XML files, incorporating layout and text line coordinates.
|
4 months ago |
vahidrezanezhad
|
fbeef79d50
|
adding scatter_nd inference
|
4 months ago |
Robert Sachunsky
|
0ae28f7d3e
|
switch from stdlib to loky.ProcessPoolExecutor, ensure shutdown
|
4 months ago |
vahidrezanezhad
|
f93c6c288d
|
function of patch-wise inference with scatter_nd is added
|
4 months ago |
vahidrezanezhad
|
0e8c561618
|
debugging issues
|
4 months ago |
Robert Sachunsky
|
e9c0d716f6
|
CI: install optional dependencies, too
|
4 months ago |
Robert Sachunsky
|
dcaf796283
|
change polarity of orientation angle (PAGE schema required cw=positive)
|
4 months ago |
Robert Sachunsky
|
b4b0890294
|
add option to overwrite output xml, but skip by default if file exists
|
4 months ago |
Robert Sachunsky
|
b9ca7a6191
|
log num_cols-dependent resizing
|
4 months ago |
Robert Sachunsky
|
9270ea4550
|
annotate region angles in PAGE
|
4 months ago |
Robert Sachunsky
|
3b70b11ea6
|
avoid deskewing patches if binary-empty
|
4 months ago |
Robert Sachunsky
|
7e9ee90e6e
|
switch from (ad-hoc) mp.Pool to (attribute) concurrent.futures.ProcessPoolExecutor
|
4 months ago |
Robert Sachunsky
|
68456ea002
|
do_work_of_slopes_new*, do_back_rotation_and_get_cnt_back, do_work_of_contours_in_image: use mp.Pool, simplify
|
4 months ago |
Robert Sachunsky
|
25e967397d
|
exit early if no text regions found (to avoid segfault)
|
4 months ago |
Robert Sachunsky
|
21efea8711
|
no del on function argument
|
4 months ago |
Robert Sachunsky
|
5e0c1da711
|
simplify
|
4 months ago |