Commit Graph

732 Commits (3e8adb86c295525a0aca40513c4f789880effe3f)
 

Author SHA1 Message Date
vahidrezanezhad 4de441eaaa OCR prediction is now enabled to integrate results from both RGB and binarized images or to be performed on each individually 1 month ago
vahidrezanezhad b1da0a3327 In OCR, the predicted text is now drawn on the image, and the results are saved in a specified directory. This makes it easier to review the predicted output 1 month ago
Robert Sachunsky 31aeb9629d
Github Actions: free space more aggressively 1 month ago
Robert Sachunsky 7430b57b65 dockerfile: add smoke test 1 month ago
Robert Sachunsky f35f49376e run CLI test in TMPDIR, add ocrd-test 1 month ago
Robert Sachunsky ae066388ea docker: no need for g++, but install w/ 'EXTRAS=OCR' 1 month ago
Robert Sachunsky 722b5c6bf1 add make variable EXTRAS for optional dependencies 1 month ago
Robert Sachunsky c01609ff4e allow even more empty imports for optional dependencies 1 month ago
Robert Sachunsky 51e9bfd6d7 improve+extend dockerfile 1 month ago
Robert Sachunsky 09248d4829 improve+extend makefile 1 month ago
Robert Sachunsky 46618f4229 allow more empty imports for optional dependencies 1 month ago
Robert Sachunsky 4be89910a2 CLI: fix arg vs kwarg from merge 1 month ago
Robert Sachunsky 9d61acf173 simplify 1 month ago
Robert Sachunsky a1068ff2eb OCR-D: move sbb-binarize to ocrd-tool.json, update to v3 1 month ago
Robert Sachunsky c794d4d29f OCR-D: fix typo light_mode→light_version 1 month ago
Robert Sachunsky 4338259ca1 OCR-D: ensure page image gets replaced in result as well if not the original file 1 month ago
Robert Sachunsky 55969b0173 OCR-D: add docstring 1 month ago
Robert Sachunsky 3916474b8b OCR-D: require >=v3.1 1 month ago
Robert Sachunsky 6d02e90570 OCR-D: restrict max_workers=1 1 month ago
Robert Sachunsky efd3fa6775 allow empty imports for optional dependencies 1 month ago
Robert Sachunsky 238132e260 use 'image_filename' for pseudo-iteration outside 'dir_in' mode 1 month ago
Robert Sachunsky af4e2a4ffc do not require 'dir_out' outside 'dir_in' mode 1 month ago
Robert Sachunsky ea136e3ddd 'overwrite' check: only in 'dir_in' mode 1 month ago
Robert Sachunsky 1f4a17b60d Merge remote-tracking branch 'origin/machine_based_reading_order_integration' into v3-api 1 month ago
Robert Sachunsky edf924c2cb ocrd-tool: add dockerhub 1 month ago
vahidrezanezhad 9b04688ebc The rotate_image function has been updated. Additionally, the reading order is now correct in the case of the light version, provided that slope_deskew exceeds the slope_threshold. 1 month ago
vahidrezanezhad cf40f9ecc5 The rotate_image function produces the exact same rotation as Imutils. Therefore, there is no need to retain the remove-imutils-1 branch. 1 month ago
vahidrezanezhad b55389ac62
Update requirements.txt 1 month ago
vahidrezanezhad 8bf70d905f
Merge pull request #147 from qurator-spk/revert-146-remove-imutils-1
Revert "replace usages of `imutils` with opencv equivalents"
1 month ago
vahidrezanezhad f756b08c9b
Revert "replace usages of `imutils` with opencv equivalents" 1 month ago
vahidrezanezhad c9de578d4d removing imutils from requirements 1 month ago
vahidrezanezhad 52c605185a
Merge pull request #146 from qurator-spk/remove-imutils-1
replace usages of `imutils` with opencv equivalents
1 month ago
cneud 0e9a72ea52 consolidate usage documentation 1 month ago
cneud 3a55b6ce91 consolidate usage documentation 1 month ago
cneud e9fa691308 add model and training documentation 1 month ago
vahidrezanezhad 6f36c7177f For OCR, the splitting ratio of text lines is adjusted 1 month ago
cneud 181c0c584f bbox rotation with opencv 1 month ago
cneud eaff9e3537 Merge branch 'main' into remove-imutils-1 1 month ago
vahidrezanezhad 7df0427b04 In the context of OCR, if Page-XML files already contain text, the new predicted text will replace the existing text. 1 month ago
vahidrezanezhad 370d44a66b Slope deskew in the light version is set to zero because when the slope_deskew value exceeds the slope_threshold, the reading order becomes incorrect. This issue needs to be addressed. Additionally, the textlines order within text region in the light version was reversed, and this has been corrected. 1 month ago
Clemens Neudecker 005b6988f4
Merge pull request #140 from qurator-spk/machine_based_reading_order_integration
Machine based reading order integration
1 month ago
vahidrezanezhad d3a4c06e7f This commit enables the export of cropped text line images along with their corresponding texts from a Page-XML file. These exported text line images and texts can be utilized for training a text line-based OCR model. 1 month ago
vahidrezanezhad c8b8529951 For the CNN-RNN OCR model, long text lines are split into two segments 2 months ago
vahidrezanezhad aa72ca3006 Resolved an issue in the OCR-D framework where dir_out received a None value 2 months ago
vahidrezanezhad a4f1f35125 Resolving test failure 2 months ago
kba 54040c1db4 Merge remote-tracking branch 'bertsky/machine_based_reading_order_integration_fixes' into machine_based_reading_order_integration 2 months ago
cneud 0b2c1b9275 remove `imutils` dependency 2 months ago
Clemens Neudecker 687aba1fa2
replace usages of `imutils` with opencv equivalents
should fix https://github.com/qurator-spk/eynollah/issues/141
2 months ago
vahidrezanezhad 7110bd971f resolved an error for light version in the case that slope_deskew is smaller than slope_threshold 2 months ago
vahidrezanezhad 25116a2c79 resolved 2 errors 2 months ago