335aa273a1simplify, wrap extremely long lines
Robert Sachunsky
2024-12-23 03:13:21 +0000
cfc65128b1reduce redundancy/indentation
Robert Sachunsky
2024-12-22 14:56:32 +0000
01376af905do_order_of_regions_with_model: simplify
Robert Sachunsky
2024-12-22 13:10:05 +0000
92bfac4b41Provide OCR as an option to process a directory of XML files, incorporating layout and text line coordinates.
vahidrezanezhad
2024-12-20 15:47:21 +0100
329fac23f6do not reload enhancement model in dir_in mode, simplify
Robert Sachunsky
2024-12-04 18:29:49 +0000
3b9a29bc5csimplify dir_in conditionals
Robert Sachunsky
2024-12-04 18:19:54 +0000
7ae64f3717RO model: do not reload when in dir_in mode
Robert Sachunsky
2024-12-04 16:18:35 +0000
f765e2603bmove Torch to optional dependencies (to avoid clash with TF over CuDNN)
Robert Sachunsky
2024-12-04 15:57:13 +0000
871d7bfc5afixed: machine based reading order cause tuple index out of range error if number of textregion is one.
vahidrezanezhad
2024-12-04 16:41:00 +0100
6aad006f4cfilter textregions without textline
vahidrezanezhad
2024-12-02 12:43:57 +0100
1083d1c7fbgha: try to free disk space
kba
2024-11-25 19:32:42 +0100
ce5b611296tests are passed - new models by the way should be uploaded
vahidrezanezhad
2024-11-14 17:18:07 +0100
f43c49c508textlines of drop capitals are connected to corresponding textline if possible otherwise they are inserted in corresponding textregion
vahidrezanezhad
2024-11-13 11:53:56 +0100
22b0b07a73drop capital and marginals extraction is updated
vahidrezanezhad
2024-11-11 19:01:40 +0100
Update requirements.txt
Clemens Neudecker
2024-11-11 14:11:36 +0100
8409de0e58sbb_binarization is integrated into eynollah works in framework of ocrd - sbb_binarization in ocrd works for individual images by the way as standalone flowing from directory can be used now. For eynollah in ocrd framework I have added -light version as default parameter.
vahidrezanezhad
2024-11-10 19:34:43 +0100
0914b5ff8aresolve merge conflict of main branch with machine based reading order branch
vahidrezanezhad
2024-11-06 00:34:00 +0100
6aee70d0cdResolve merge conflict of main and machine based reading order branch
vahidrezanezhad
2024-11-06 00:10:25 +0100
82281bd6cffixing a bug occuring with reading order + Slro option with no patch textline model and thresholding artificial class
vahidrezanezhad
2024-10-25 19:42:48 +0200
70772d4104binarization as a standalone command
vahidrezanezhad
2024-10-21 23:46:38 +0200
f93fa12441doing more multiprocessing in order to make the process faster
vahidrezanezhad
2024-10-18 09:14:42 +0200
3ef4eac24ctextlines of textregions are extracted in a faster way + early layout for all documents is done with no patches model and on rgb input
vahidrezanezhad
2024-10-17 19:12:28 +0200