Commit Graph

558 Commits (ce5b6112960f67d7819b11a9b346da0d8f5fdb4d)
 

Author SHA1 Message Date
vahidrezanezhad ce5b611296 tests are passed - new models by the way should be uploaded
vahidrezanezhad f43c49c508 textlines of drop capitals are connected to corresponding textline if possible otherwise they are inserted in corresponding textregion
vahidrezanezhad 22b0b07a73 drop capital and marginals extraction is updated
Clemens Neudecker 1ae77e61c8
Update requirements.txt
vahidrezanezhad 8409de0e58 sbb_binarization is integrated into eynollah works in framework of ocrd - sbb_binarization in ocrd works for individual images by the way as standalone flowing from directory can be used now. For eynollah in ocrd framework I have added -light version as default parameter.
vahidrezanezhad 0914b5ff8a resolve merge conflict of main branch with machine based reading order branch
vahidrezanezhad 6aee70d0cd Resolve merge conflict of main and machine based reading order branch
vahidrezanezhad bceeeb56c1
Merge pull request from qurator-spk/extracting_images_only
Extracting images only
vahidrezanezhad f7e5fb917f resolving merge conflict of machine based reading order and extracting only images branches
vahidrezanezhad 751b0102f7 updating early layout inference for light version
vahidrezanezhad e796a99c5c updating inference for early layout in the case of documents with number of columns bigger than 2
vahidrezanezhad 438df52287 updating
vahidrezanezhad 90ee2d61dc textline segmentation is masked with drop capitals
vahidrezanezhad 5037e9896d Merge branch 'machine_based_reading_order_integration' of https://github.com/qurator-spk/eynollah into machine_based_reading_order_integration
vahidrezanezhad 82281bd6cf fixing a bug occuring with reading order + Slro option with no patch textline model and thresholding artificial class
vahidrezanezhad 328d33e3dc Temporary commit – textline prediction without patches
vahidrezanezhad 70772d4104 binarization as a standalone command
vahidrezanezhad f93fa12441 doing more multiprocessing in order to make the process faster
vahidrezanezhad 3ef4eac24c textlines of textregions are extracted in a faster way + early layout for all documents is done with no patches model and on rgb input
vahidrezanezhad 1da4b7f589 updating light version
vahidrezanezhad 543ed4bc38 -light version need -tll to be enabled otherwise the process will be ended.
Clemens Neudecker 51f6ef63f5
Merge pull request from qurator-spk/dockerfile
Add Dockerfile and make docker target
kba b13759fdcf ci: smoke-test make docker
kba c487be2a1d dockerfile: use src-layout
kba 7eb1390583 Merge branch 'main' into dockerfile
vahidrezanezhad ab63d5ba40 updating light version features
vahidrezanezhad 1774076f4a updating light version. Remove textlines or textregion contours inside a bigger one
vahidrezanezhad ad32316217 updating light version
vahidrezanezhad 133091137d dilation of textregions and marginals are accomplished
vahidrezanezhad 95effe54a0 updating textregions dilation
vahidrezanezhad b33739adee parametriyation in the case of textline contours dilation is accomplished
vahidrezanezhad 6626dc6866 updating textline dilation parameters
vahidrezanezhad 62f8ae4860 updating dilation of textlines and text regions
vahidrezanezhad 7f08458436 dilation of text regions without opencv
vahidrezanezhad 5d680136a4 updating light version
vahidrezanezhad b9e8959c4a update of light versions
vahidrezanezhad 2d18739d9b postprocessing of textline contour dilation + skip layout and reading order passed as an argument
Clemens Neudecker 4af0bc079c
Merge pull request from qurator-spk/extracting_images_only
Extracting images only
vahidrezanezhad 5a07cd9cfa the most effective version of contours dilation without opencv and all at once
michalbubula d168edfd77
Update cli.py to block other processing in the case of extract_image_only
michalbubula 723f27bec4
Add -eoi option to README.md
vahidrezanezhad 74a0699f6b extracting images only now works for a single image input
vahidrezanezhad a1f1f98de3 updating scaling contours
Clemens Neudecker 327b446a16
update Makefile with v0.3.1 models
Clemens Neudecker 351e9a897a
update `ocrd-tool.json` with v0.3.1 models
vahidrezanezhad 21380fc870 scaling contours without dilation
kba 478edc804a Add Dockerfile and make docker target
vahidrezanezhad 1b18ae874b passing number of columns as an argument
vahidrezanezhad 2c93904985 avoiding double binarization
vahidrezanezhad f0b49073b7 adding option for textline detection in printspace