Robert Sachunsky
|
a1068ff2eb
|
OCR-D: move sbb-binarize to ocrd-tool.json, update to v3
|
2 weeks ago |
Robert Sachunsky
|
c794d4d29f
|
OCR-D: fix typo light_mode→light_version
|
2 weeks ago |
Robert Sachunsky
|
4338259ca1
|
OCR-D: ensure page image gets replaced in result as well if not the original file
|
2 weeks ago |
Robert Sachunsky
|
55969b0173
|
OCR-D: add docstring
|
2 weeks ago |
Robert Sachunsky
|
3916474b8b
|
OCR-D: require >=v3.1
|
2 weeks ago |
Robert Sachunsky
|
6d02e90570
|
OCR-D: restrict max_workers=1
|
2 weeks ago |
Robert Sachunsky
|
efd3fa6775
|
allow empty imports for optional dependencies
|
2 weeks ago |
Robert Sachunsky
|
238132e260
|
use 'image_filename' for pseudo-iteration outside 'dir_in' mode
|
2 weeks ago |
Robert Sachunsky
|
af4e2a4ffc
|
do not require 'dir_out' outside 'dir_in' mode
|
2 weeks ago |
Robert Sachunsky
|
ea136e3ddd
|
'overwrite' check: only in 'dir_in' mode
|
2 weeks ago |
Robert Sachunsky
|
1f4a17b60d
|
Merge remote-tracking branch 'origin/machine_based_reading_order_integration' into v3-api
|
2 weeks ago |
Robert Sachunsky
|
edf924c2cb
|
ocrd-tool: add dockerhub
|
2 weeks ago |
vahidrezanezhad
|
9b04688ebc
|
The rotate_image function has been updated. Additionally, the reading order is now correct in the case of the light version, provided that slope_deskew exceeds the slope_threshold.
|
2 weeks ago |
vahidrezanezhad
|
cf40f9ecc5
|
The rotate_image function produces the exact same rotation as Imutils. Therefore, there is no need to retain the remove-imutils-1 branch.
|
3 weeks ago |
vahidrezanezhad
|
b55389ac62
|
Update requirements.txt
|
3 weeks ago |
vahidrezanezhad
|
8bf70d905f
|
Merge pull request #147 from qurator-spk/revert-146-remove-imutils-1
Revert "replace usages of `imutils` with opencv equivalents"
|
3 weeks ago |
vahidrezanezhad
|
f756b08c9b
|
Revert "replace usages of `imutils` with opencv equivalents"
|
3 weeks ago |
vahidrezanezhad
|
c9de578d4d
|
removing imutils from requirements
|
3 weeks ago |
vahidrezanezhad
|
52c605185a
|
Merge pull request #146 from qurator-spk/remove-imutils-1
replace usages of `imutils` with opencv equivalents
|
3 weeks ago |
cneud
|
0e9a72ea52
|
consolidate usage documentation
|
3 weeks ago |
cneud
|
3a55b6ce91
|
consolidate usage documentation
|
3 weeks ago |
cneud
|
e9fa691308
|
add model and training documentation
|
3 weeks ago |
vahidrezanezhad
|
6f36c7177f
|
For OCR, the splitting ratio of text lines is adjusted
|
3 weeks ago |
cneud
|
181c0c584f
|
bbox rotation with opencv
|
3 weeks ago |
cneud
|
eaff9e3537
|
Merge branch 'main' into remove-imutils-1
|
3 weeks ago |
vahidrezanezhad
|
7df0427b04
|
In the context of OCR, if Page-XML files already contain text, the new predicted text will replace the existing text.
|
3 weeks ago |
vahidrezanezhad
|
370d44a66b
|
Slope deskew in the light version is set to zero because when the slope_deskew value exceeds the slope_threshold, the reading order becomes incorrect. This issue needs to be addressed. Additionally, the textlines order within text region in the light version was reversed, and this has been corrected.
|
3 weeks ago |
Clemens Neudecker
|
005b6988f4
|
Merge pull request #140 from qurator-spk/machine_based_reading_order_integration
Machine based reading order integration
|
3 weeks ago |
vahidrezanezhad
|
d3a4c06e7f
|
This commit enables the export of cropped text line images along with their corresponding texts from a Page-XML file. These exported text line images and texts can be utilized for training a text line-based OCR model.
|
4 weeks ago |
vahidrezanezhad
|
c8b8529951
|
For the CNN-RNN OCR model, long text lines are split into two segments
|
4 weeks ago |
vahidrezanezhad
|
aa72ca3006
|
Resolved an issue in the OCR-D framework where dir_out received a None value
|
1 month ago |
vahidrezanezhad
|
a4f1f35125
|
Resolving test failure
|
1 month ago |
kba
|
54040c1db4
|
Merge remote-tracking branch 'bertsky/machine_based_reading_order_integration_fixes' into machine_based_reading_order_integration
|
1 month ago |
cneud
|
0b2c1b9275
|
remove `imutils` dependency
|
1 month ago |
Clemens Neudecker
|
687aba1fa2
|
replace usages of `imutils` with opencv equivalents
should fix https://github.com/qurator-spk/eynollah/issues/141
|
1 month ago |
vahidrezanezhad
|
7110bd971f
|
resolved an error for light version in the case that slope_deskew is smaller than slope_threshold
|
2 months ago |
vahidrezanezhad
|
25116a2c79
|
resolved 2 errors
|
2 months ago |
kba
|
869110f185
|
merge main
|
3 months ago |
vahidrezanezhad
|
33fda2f8be
|
changing cnn ocr model name
|
4 months ago |
Robert Sachunsky
|
335aa273a1
|
simplify, wrap extremely long lines
|
4 months ago |
Robert Sachunsky
|
cfc65128b1
|
reduce redundancy/indentation
|
4 months ago |
Robert Sachunsky
|
01376af905
|
do_order_of_regions_with_model: simplify
|
4 months ago |
vahidrezanezhad
|
92bfac4b41
|
Provide OCR as an option to process a directory of XML files, incorporating layout and text line coordinates.
|
4 months ago |
vahidrezanezhad
|
fbeef79d50
|
adding scatter_nd inference
|
4 months ago |
Robert Sachunsky
|
0ae28f7d3e
|
switch from stdlib to loky.ProcessPoolExecutor, ensure shutdown
|
4 months ago |
vahidrezanezhad
|
f93c6c288d
|
function of patch-wise inference with scatter_nd is added
|
4 months ago |
vahidrezanezhad
|
0e8c561618
|
debugging issues
|
4 months ago |
Robert Sachunsky
|
e9c0d716f6
|
CI: install optional dependencies, too
|
4 months ago |
Robert Sachunsky
|
dcaf796283
|
change polarity of orientation angle (PAGE schema required cw=positive)
|
4 months ago |
Robert Sachunsky
|
b4b0890294
|
add option to overwrite output xml, but skip by default if file exists
|
4 months ago |