Commit Graph

569 Commits (machine_based_reading_order_integration)
 

Author SHA1 Message Date
vahidrezanezhad 92bfac4b41 Provide OCR as an option to process a directory of XML files, incorporating layout and text line coordinates. 2 days ago
vahidrezanezhad fbeef79d50 adding scatter_nd inference 6 days ago
vahidrezanezhad 871d7bfc5a fixed: machine based reading order cause tuple index out of range error if number of textregion is one. 3 weeks ago
vahidrezanezhad 6aad006f4c filter textregions without textline 3 weeks ago
kba 1083d1c7fb gha: try to free disk space 4 weeks ago
vahidrezanezhad 8014a9e416
Update Makefile 4 weeks ago
vahidrezanezhad 3000255a24
Update Makefile 4 weeks ago
vahidrezanezhad 1746920275
Update Makefile 1 month ago
vahidrezanezhad b622494f34 new table detection model is integrated 1 month ago
vahidrezanezhad d9f79c3404 fixing IndexError by reading order detection 1 month ago
vahidrezanezhad 5fa8ca46a4 updating requirements 1 month ago
vahidrezanezhad ce5b611296 tests are passed - new models by the way should be uploaded 1 month ago
vahidrezanezhad f43c49c508 textlines of drop capitals are connected to corresponding textline if possible otherwise they are inserted in corresponding textregion 1 month ago
vahidrezanezhad 22b0b07a73 drop capital and marginals extraction is updated 1 month ago
Clemens Neudecker 1ae77e61c8
Update requirements.txt 1 month ago
vahidrezanezhad 8409de0e58 sbb_binarization is integrated into eynollah works in framework of ocrd - sbb_binarization in ocrd works for individual images by the way as standalone flowing from directory can be used now. For eynollah in ocrd framework I have added -light version as default parameter. 1 month ago
vahidrezanezhad 0914b5ff8a resolve merge conflict of main branch with machine based reading order branch 2 months ago
vahidrezanezhad 6aee70d0cd Resolve merge conflict of main and machine based reading order branch 2 months ago
vahidrezanezhad bceeeb56c1
Merge pull request #138 from qurator-spk/extracting_images_only
Extracting images only
2 months ago
vahidrezanezhad f7e5fb917f resolving merge conflict of machine based reading order and extracting only images branches 2 months ago
vahidrezanezhad 751b0102f7 updating early layout inference for light version 2 months ago
vahidrezanezhad e796a99c5c updating inference for early layout in the case of documents with number of columns bigger than 2 2 months ago
vahidrezanezhad 438df52287 updating 2 months ago
vahidrezanezhad 90ee2d61dc textline segmentation is masked with drop capitals 2 months ago
vahidrezanezhad 5037e9896d Merge branch 'machine_based_reading_order_integration' of https://github.com/qurator-spk/eynollah into machine_based_reading_order_integration 2 months ago
vahidrezanezhad 82281bd6cf fixing a bug occuring with reading order + Slro option with no patch textline model and thresholding artificial class 2 months ago
vahidrezanezhad 328d33e3dc Temporary commit – textline prediction without patches 2 months ago
vahidrezanezhad 70772d4104 binarization as a standalone command 2 months ago
vahidrezanezhad f93fa12441 doing more multiprocessing in order to make the process faster 2 months ago
vahidrezanezhad 3ef4eac24c textlines of textregions are extracted in a faster way + early layout for all documents is done with no patches model and on rgb input 2 months ago
vahidrezanezhad 1da4b7f589 updating light version 3 months ago
vahidrezanezhad 543ed4bc38 -light version need -tll to be enabled otherwise the process will be ended. 3 months ago
Clemens Neudecker 51f6ef63f5
Merge pull request #137 from qurator-spk/dockerfile
Add Dockerfile and make docker target
3 months ago
kba b13759fdcf ci: smoke-test make docker 3 months ago
kba c487be2a1d dockerfile: use src-layout 3 months ago
kba 7eb1390583 Merge branch 'main' into dockerfile 3 months ago
vahidrezanezhad ab63d5ba40 updating light version features 3 months ago
vahidrezanezhad 1774076f4a updating light version. Remove textlines or textregion contours inside a bigger one 3 months ago
vahidrezanezhad ad32316217 updating light version 3 months ago
vahidrezanezhad 133091137d dilation of textregions and marginals are accomplished 3 months ago
vahidrezanezhad 95effe54a0 updating textregions dilation 3 months ago
vahidrezanezhad b33739adee parametriyation in the case of textline contours dilation is accomplished 3 months ago
vahidrezanezhad 6626dc6866 updating textline dilation parameters 3 months ago
vahidrezanezhad 62f8ae4860 updating dilation of textlines and text regions 3 months ago
vahidrezanezhad 7f08458436 dilation of text regions without opencv 3 months ago
vahidrezanezhad 5d680136a4 updating light version 3 months ago
vahidrezanezhad b9e8959c4a update of light versions 3 months ago
vahidrezanezhad 2d18739d9b postprocessing of textline contour dilation + skip layout and reading order passed as an argument 3 months ago
Clemens Neudecker 4af0bc079c
Merge pull request #132 from qurator-spk/extracting_images_only
Extracting images only
3 months ago
vahidrezanezhad 5a07cd9cfa the most effective version of contours dilation without opencv and all at once 3 months ago