Commit graph

129 commits

Author SHA1 Message Date
Robert Sachunsky
055463d23a avoid indentation 2024-12-05 09:43:30 +00:00
Robert Sachunsky
aaea2ef463 simplify 2024-12-05 09:40:02 +00:00
Robert Sachunsky
3d88b207fc run: log instead of print 2024-12-05 09:39:55 +00:00
Robert Sachunsky
a520bd1f77 wrap extremely long lines 2024-12-04 23:04:51 +00:00
Robert Sachunsky
cd4e426977 avoid indentation (skip_layout_and_reading_order) 2024-12-04 23:04:48 +00:00
Robert Sachunsky
5b82320707 avoid indentation 2024-12-04 22:09:32 +00:00
Robert Sachunsky
9f12fa241d log-level: only set 'eynollah' logger level 2024-12-04 22:09:15 +00:00
Robert Sachunsky
14beb46224 simplify loading models w/o dir_in mode 2024-12-04 21:07:26 +00:00
Robert Sachunsky
329fac23f6 do not reload enhancement model in dir_in mode, simplify 2024-12-04 18:29:49 +00:00
Robert Sachunsky
3b9a29bc5c simplify dir_in conditionals 2024-12-04 18:19:54 +00:00
Robert Sachunsky
7ae64f3717 RO model: do not reload when in dir_in mode 2024-12-04 16:18:35 +00:00
vahidrezanezhad
871d7bfc5a fixed: machine based reading order cause tuple index out of range error if number of textregion is one. 2024-12-04 16:41:00 +01:00
vahidrezanezhad
6aad006f4c filter textregions without textline 2024-12-02 12:43:57 +01:00
vahidrezanezhad
b622494f34 new table detection model is integrated 2024-11-21 02:16:22 +01:00
vahidrezanezhad
d9f79c3404 fixing IndexError by reading order detection 2024-11-18 10:15:19 +01:00
vahidrezanezhad
f43c49c508 textlines of drop capitals are connected to corresponding textline if possible otherwise they are inserted in corresponding textregion 2024-11-13 11:53:56 +01:00
vahidrezanezhad
22b0b07a73 drop capital and marginals extraction is updated 2024-11-11 19:01:40 +01:00
vahidrezanezhad
8409de0e58 sbb_binarization is integrated into eynollah works in framework of ocrd - sbb_binarization in ocrd works for individual images by the way as standalone flowing from directory can be used now. For eynollah in ocrd framework I have added -light version as default parameter. 2024-11-10 19:34:43 +01:00
vahidrezanezhad
0914b5ff8a resolve merge conflict of main branch with machine based reading order branch 2024-11-06 00:34:00 +01:00
vahidrezanezhad
6aee70d0cd Resolve merge conflict of main and machine based reading order branch 2024-11-06 00:10:25 +01:00
vahidrezanezhad
f7e5fb917f resolving merge conflict of machine based reading order and extracting only images branches 2024-11-05 22:09:39 +01:00
michalbubula
d168edfd77
Update cli.py to block other processing in the case of extract_image_only 2024-09-19 15:20:37 +02:00
vahidrezanezhad
74a0699f6b extracting images only now works for a single image input 2024-09-19 11:20:13 +02:00
Clemens Neudecker
351e9a897a
update ocrd-tool.json with v0.3.1 models 2024-09-17 21:32:23 +02:00
vahidrezanezhad
6b2e5d110e all tests are passed 2024-09-03 13:55:55 +02:00
cneud
b6d3d2bdbf fix indentation 2024-09-02 20:11:42 +02:00
cneud
de32d86fb6 Merge branch 'refs/heads/main' into extracting_images_only
# Conflicts:
#	src/eynollah/eynollah.py
2024-09-02 19:55:33 +02:00
kba
c6e0e058d0 Merge branch 'main' into v3-api
# Conflicts:
#	pyproject.toml
#	src/eynollah/cli.py
2024-09-02 14:53:37 +02:00
kba
84b844203d switch from qurator namespace to src-layout 2024-08-29 17:11:29 +02:00