Commit Graph

54 Commits (main)

Author SHA1 Message Date
vahidrezanezhad 7983a65006 filtering separators in a correct way without missing them 8 months ago
cneud d3b06baa84 fix some typos 10 months ago
Robert Sachunsky 867a7261de
pil_cv2.check_dpi: fix class membership test
(depending on how the `PIL.Image` was instantiated – file plugin or array interface – the previous `isinstance` could fail, provoking a fall-through to `cv2pil` which does not work)
1 year ago
vahid 0b35011847 right2left reading order detection accomplished 1 year ago
vahid 0cda1f3c7a reading order type 1: right to left 1 year ago
vahidrezanezhad 1621532092
Merge branch 'main' into eynollah_light 1 year ago
Robert Sachunsky a56988a35a contours: numpy now needs dtype=object 1 year ago
vahid 8d5079c909 issue #77 is resolved on main branch 2 years ago
vahid 402c5339ac issue #77 is resolved 2 years ago
vahid c606391c31 flow from directory 2 years ago
Gerber, Mike 11d9b00510 🧹 Don't produce spurious TextEquiv elements.
eynollah produces spurious - and empy - pcGts TextEquiv elements. This
is a. unnecessary, b. wrong and c. produces a lot of warning messages
in subsequent OCR processing steps because the OCR processor warns
about already existing text.

Fix this by not generating any TextEquiv elements.

Fixes gh-37.
2 years ago
vahid 254abf4d3d more modifications for tables 3 years ago
vahid a5c940705a tables are integrated 3 years ago
vahid 80b17af40c #47 fixed 3 years ago
vahid becb0c1329 trivial 3 years ago
vahid 059905c9e4 #43 empty textlines caused by newer python-opencv, is resolved 3 years ago
vahid c4b2c71e68 resolving issue https://github.com/qurator-spk/eynollah/issues/38 3 years ago
vahid 7cbecadccc adding the binarization model and option to binarize input document for the cases like dark, stronly bright and other ones 3 years ago
vahid 44dad6a072 strong erosion, more modification 3 years ago
Konstantin Baierer 6c8852eb04 check_dpi: catch Pillow choking on faulty img, return 230 3 years ago
Konstantin Baierer ff265eee5c cv2pil: do COLOR_BGR2RGB conversion 3 years ago
Konstantin Baierer d0b0e23ac6 do DPI calculation as part of caching images 3 years ago
Konstantin Baierer 4897cefdb7 allow passing PIL image to Eynollah w/o disk I/O 3 years ago
Konstantin Baierer d40c453dad
check_dpi: raise exception if resolution == 1 to trigger except clause
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
3 years ago
Konstantin Baierer 1715f0d8b3 allow overriding DPI 3 years ago
Konstantin Baierer 416a84e542 replace lxml with OCR-D/core PAGE API 3 years ago
Konstantin Baierer a678bbf966 counter: add reset(); 3 years ago
Konstantin Baierer a3465ca1a0 eliminate id_of_texts from xml_reading_order, fix plus one error 3 years ago
Konstantin Baierer 6c60d9e90a reading order: fix @index 3 years ago
Konstantin Baierer 02aa31cc66 Merge remote-tracking branch 'origin/main' into xml-rfct 3 years ago
Konstantin Baierer c5736e9b74 fix region counting 3 years ago
vahid 67a9fc8820 .. 3 years ago
vahidrezanezhad 4b3c8a6707
bug in reading order is fixed 3 years ago
Konstantin Baierer 56b688befe counter: allow arbitrary line/region id 3 years ago
Konstantin Baierer 7eb973b3aa xml_reading_order takes id_of_marginals directly 3 years ago
Konstantin Baierer 98568402c7 counter: init-overrideable 3 years ago
Konstantin Baierer 24da879844 add EynollahIdCounter class 3 years ago
Konstantin Baierer 9f5e4af5f0 factor out marginalia ID calc from xml_reading_order 3 years ago
Konstantin Baierer 630002d96d minor clean up xml_reading_order 3 years ago
Konstantin Baierer 62fa710f27 move order_and_id_of_texts to utils.xml 3 years ago
Konstantin Baierer 8d4ce75656 typo s,teil,tile,g 3 years ago
Konstantin Baierer 18475b3c20 remaining typos idenitified by codespell 3 years ago
Konstantin Baierer adb96b5b49 s,hiearchy,hierarchy,g 3 years ago
Konstantin Baierer 10ae13f9b0 typo: s,separater,separator, 3 years ago
Konstantin Baierer 78a7ace863 typo: s,hirarchy,hierarchy, 3 years ago
Konstantin Baierer 526769354a typo: s,seperator,separator, 3 years ago
Konstantin Baierer ec9939c3c7 typo: s,hierachy,hierarchy, 3 years ago
Konstantin Baierer 5f2867bfcf typo: s,contoures,contours,g 3 years ago
Konstantin Baierer ccd83e94b1 typo: s,cprrsp,corresp,g 3 years ago
Konstantin Baierer 1f95a69bec typo: s,rotyate,rotate,g 3 years ago