Commit Graph

724 Commits (177e01716790f1a0f21121c4b39123588a089a06)
 

Author SHA1 Message Date
Robert Sachunsky 51e9bfd6d7 improve+extend dockerfile 2 weeks ago
Robert Sachunsky 09248d4829 improve+extend makefile 2 weeks ago
Robert Sachunsky 46618f4229 allow more empty imports for optional dependencies 2 weeks ago
Robert Sachunsky 4be89910a2 CLI: fix arg vs kwarg from merge 2 weeks ago
Robert Sachunsky 9d61acf173 simplify 2 weeks ago
Robert Sachunsky a1068ff2eb OCR-D: move sbb-binarize to ocrd-tool.json, update to v3 2 weeks ago
Robert Sachunsky c794d4d29f OCR-D: fix typo light_mode→light_version 2 weeks ago
Robert Sachunsky 4338259ca1 OCR-D: ensure page image gets replaced in result as well if not the original file 2 weeks ago
Robert Sachunsky 55969b0173 OCR-D: add docstring 2 weeks ago
Robert Sachunsky 3916474b8b OCR-D: require >=v3.1 2 weeks ago
Robert Sachunsky 6d02e90570 OCR-D: restrict max_workers=1 2 weeks ago
Robert Sachunsky efd3fa6775 allow empty imports for optional dependencies 2 weeks ago
Robert Sachunsky 238132e260 use 'image_filename' for pseudo-iteration outside 'dir_in' mode 2 weeks ago
Robert Sachunsky af4e2a4ffc do not require 'dir_out' outside 'dir_in' mode 2 weeks ago
Robert Sachunsky ea136e3ddd 'overwrite' check: only in 'dir_in' mode 2 weeks ago
Robert Sachunsky 1f4a17b60d Merge remote-tracking branch 'origin/machine_based_reading_order_integration' into v3-api 2 weeks ago
Robert Sachunsky edf924c2cb ocrd-tool: add dockerhub 2 weeks ago
vahidrezanezhad 9b04688ebc The rotate_image function has been updated. Additionally, the reading order is now correct in the case of the light version, provided that slope_deskew exceeds the slope_threshold. 2 weeks ago
vahidrezanezhad cf40f9ecc5 The rotate_image function produces the exact same rotation as Imutils. Therefore, there is no need to retain the remove-imutils-1 branch. 3 weeks ago
vahidrezanezhad b55389ac62
Update requirements.txt 3 weeks ago
vahidrezanezhad 8bf70d905f
Merge pull request #147 from qurator-spk/revert-146-remove-imutils-1
Revert "replace usages of `imutils` with opencv equivalents"
3 weeks ago
vahidrezanezhad f756b08c9b
Revert "replace usages of `imutils` with opencv equivalents" 3 weeks ago
vahidrezanezhad c9de578d4d removing imutils from requirements 3 weeks ago
vahidrezanezhad 52c605185a
Merge pull request #146 from qurator-spk/remove-imutils-1
replace usages of `imutils` with opencv equivalents
3 weeks ago
cneud 0e9a72ea52 consolidate usage documentation 3 weeks ago
cneud 3a55b6ce91 consolidate usage documentation 3 weeks ago
cneud e9fa691308 add model and training documentation 3 weeks ago
vahidrezanezhad 6f36c7177f For OCR, the splitting ratio of text lines is adjusted 3 weeks ago
cneud 181c0c584f bbox rotation with opencv 3 weeks ago
cneud eaff9e3537 Merge branch 'main' into remove-imutils-1 3 weeks ago
vahidrezanezhad 7df0427b04 In the context of OCR, if Page-XML files already contain text, the new predicted text will replace the existing text. 3 weeks ago
vahidrezanezhad 370d44a66b Slope deskew in the light version is set to zero because when the slope_deskew value exceeds the slope_threshold, the reading order becomes incorrect. This issue needs to be addressed. Additionally, the textlines order within text region in the light version was reversed, and this has been corrected. 3 weeks ago
Clemens Neudecker 005b6988f4
Merge pull request #140 from qurator-spk/machine_based_reading_order_integration
Machine based reading order integration
3 weeks ago
vahidrezanezhad d3a4c06e7f This commit enables the export of cropped text line images along with their corresponding texts from a Page-XML file. These exported text line images and texts can be utilized for training a text line-based OCR model. 4 weeks ago
vahidrezanezhad c8b8529951 For the CNN-RNN OCR model, long text lines are split into two segments 4 weeks ago
vahidrezanezhad aa72ca3006 Resolved an issue in the OCR-D framework where dir_out received a None value 1 month ago
vahidrezanezhad a4f1f35125 Resolving test failure 1 month ago
kba 54040c1db4 Merge remote-tracking branch 'bertsky/machine_based_reading_order_integration_fixes' into machine_based_reading_order_integration 1 month ago
cneud 0b2c1b9275 remove `imutils` dependency 1 month ago
Clemens Neudecker 687aba1fa2
replace usages of `imutils` with opencv equivalents
should fix https://github.com/qurator-spk/eynollah/issues/141
1 month ago
vahidrezanezhad 7110bd971f resolved an error for light version in the case that slope_deskew is smaller than slope_threshold 2 months ago
vahidrezanezhad 25116a2c79 resolved 2 errors 2 months ago
kba 869110f185 merge main 3 months ago
vahidrezanezhad 33fda2f8be changing cnn ocr model name 4 months ago
Robert Sachunsky 335aa273a1 simplify, wrap extremely long lines 4 months ago
Robert Sachunsky cfc65128b1 reduce redundancy/indentation 4 months ago
Robert Sachunsky 01376af905 do_order_of_regions_with_model: simplify 4 months ago
vahidrezanezhad 92bfac4b41 Provide OCR as an option to process a directory of XML files, incorporating layout and text line coordinates. 4 months ago
vahidrezanezhad fbeef79d50 adding scatter_nd inference 4 months ago
Robert Sachunsky 0ae28f7d3e switch from stdlib to loky.ProcessPoolExecutor, ensure shutdown 4 months ago