Commit Graph

568 Commits (fbeef79d50412ca15d71766af9c109ee6a16aa10)
 

Author SHA1 Message Date
vahidrezanezhad fbeef79d50 adding scatter_nd inference
vahidrezanezhad 871d7bfc5a fixed: machine based reading order cause tuple index out of range error if number of textregion is one.
vahidrezanezhad 6aad006f4c filter textregions without textline
kba 1083d1c7fb gha: try to free disk space
vahidrezanezhad 8014a9e416
Update Makefile
vahidrezanezhad 3000255a24
Update Makefile
vahidrezanezhad 1746920275
Update Makefile
vahidrezanezhad b622494f34 new table detection model is integrated
vahidrezanezhad d9f79c3404 fixing IndexError by reading order detection
vahidrezanezhad 5fa8ca46a4 updating requirements
vahidrezanezhad ce5b611296 tests are passed - new models by the way should be uploaded
vahidrezanezhad f43c49c508 textlines of drop capitals are connected to corresponding textline if possible otherwise they are inserted in corresponding textregion
vahidrezanezhad 22b0b07a73 drop capital and marginals extraction is updated
Clemens Neudecker 1ae77e61c8
Update requirements.txt
vahidrezanezhad 8409de0e58 sbb_binarization is integrated into eynollah works in framework of ocrd - sbb_binarization in ocrd works for individual images by the way as standalone flowing from directory can be used now. For eynollah in ocrd framework I have added -light version as default parameter.
vahidrezanezhad 0914b5ff8a resolve merge conflict of main branch with machine based reading order branch
vahidrezanezhad 6aee70d0cd Resolve merge conflict of main and machine based reading order branch
vahidrezanezhad bceeeb56c1
Merge pull request from qurator-spk/extracting_images_only
Extracting images only
vahidrezanezhad f7e5fb917f resolving merge conflict of machine based reading order and extracting only images branches
vahidrezanezhad 751b0102f7 updating early layout inference for light version
vahidrezanezhad e796a99c5c updating inference for early layout in the case of documents with number of columns bigger than 2
vahidrezanezhad 438df52287 updating
vahidrezanezhad 90ee2d61dc textline segmentation is masked with drop capitals
vahidrezanezhad 5037e9896d Merge branch 'machine_based_reading_order_integration' of https://github.com/qurator-spk/eynollah into machine_based_reading_order_integration
vahidrezanezhad 82281bd6cf fixing a bug occuring with reading order + Slro option with no patch textline model and thresholding artificial class
vahidrezanezhad 328d33e3dc Temporary commit – textline prediction without patches
vahidrezanezhad 70772d4104 binarization as a standalone command
vahidrezanezhad f93fa12441 doing more multiprocessing in order to make the process faster
vahidrezanezhad 3ef4eac24c textlines of textregions are extracted in a faster way + early layout for all documents is done with no patches model and on rgb input
vahidrezanezhad 1da4b7f589 updating light version
vahidrezanezhad 543ed4bc38 -light version need -tll to be enabled otherwise the process will be ended.
Clemens Neudecker 51f6ef63f5
Merge pull request from qurator-spk/dockerfile
Add Dockerfile and make docker target
kba b13759fdcf ci: smoke-test make docker
kba c487be2a1d dockerfile: use src-layout
kba 7eb1390583 Merge branch 'main' into dockerfile
vahidrezanezhad ab63d5ba40 updating light version features
vahidrezanezhad 1774076f4a updating light version. Remove textlines or textregion contours inside a bigger one
vahidrezanezhad ad32316217 updating light version
vahidrezanezhad 133091137d dilation of textregions and marginals are accomplished
vahidrezanezhad 95effe54a0 updating textregions dilation
vahidrezanezhad b33739adee parametriyation in the case of textline contours dilation is accomplished
vahidrezanezhad 6626dc6866 updating textline dilation parameters
vahidrezanezhad 62f8ae4860 updating dilation of textlines and text regions
vahidrezanezhad 7f08458436 dilation of text regions without opencv
vahidrezanezhad 5d680136a4 updating light version
vahidrezanezhad b9e8959c4a update of light versions
vahidrezanezhad 2d18739d9b postprocessing of textline contour dilation + skip layout and reading order passed as an argument
Clemens Neudecker 4af0bc079c
Merge pull request from qurator-spk/extracting_images_only
Extracting images only
vahidrezanezhad 5a07cd9cfa the most effective version of contours dilation without opencv and all at once
michalbubula d168edfd77
Update cli.py to block other processing in the case of extract_image_only