vahidrezanezhad
7983a65006
filtering separators in a correct way without missing them
8 months ago
cneud
d3b06baa84
fix some typos
10 months ago
Robert Sachunsky
867a7261de
pil_cv2.check_dpi: fix class membership test
...
(depending on how the `PIL.Image` was instantiated – file plugin or array interface – the previous `isinstance` could fail, provoking a fall-through to `cv2pil` which does not work)
1 year ago
vahid
0b35011847
right2left reading order detection accomplished
1 year ago
vahid
0cda1f3c7a
reading order type 1: right to left
1 year ago
vahidrezanezhad
1621532092
Merge branch 'main' into eynollah_light
1 year ago
Robert Sachunsky
a56988a35a
contours: numpy now needs dtype=object
1 year ago
vahid
8d5079c909
issue #77 is resolved on main branch
2 years ago
vahid
402c5339ac
issue #77 is resolved
2 years ago
vahid
c606391c31
flow from directory
2 years ago
Gerber, Mike
11d9b00510
🧹 Don't produce spurious TextEquiv elements.
...
eynollah produces spurious - and empy - pcGts TextEquiv elements. This
is a. unnecessary, b. wrong and c. produces a lot of warning messages
in subsequent OCR processing steps because the OCR processor warns
about already existing text.
Fix this by not generating any TextEquiv elements.
Fixes gh-37.
2 years ago
vahid
254abf4d3d
more modifications for tables
3 years ago
vahid
a5c940705a
tables are integrated
3 years ago
vahid
80b17af40c
#47 fixed
3 years ago
vahid
becb0c1329
trivial
3 years ago
vahid
059905c9e4
#43 empty textlines caused by newer python-opencv, is resolved
3 years ago
vahid
c4b2c71e68
resolving issue https://github.com/qurator-spk/eynollah/issues/38
3 years ago
vahid
7cbecadccc
adding the binarization model and option to binarize input document for the cases like dark, stronly bright and other ones
3 years ago
vahid
44dad6a072
strong erosion, more modification
3 years ago
Konstantin Baierer
6c8852eb04
check_dpi: catch Pillow choking on faulty img, return 230
3 years ago
Konstantin Baierer
ff265eee5c
cv2pil: do COLOR_BGR2RGB conversion
3 years ago
Konstantin Baierer
d0b0e23ac6
do DPI calculation as part of caching images
3 years ago
Konstantin Baierer
4897cefdb7
allow passing PIL image to Eynollah w/o disk I/O
3 years ago
Konstantin Baierer
d40c453dad
check_dpi: raise exception if resolution == 1 to trigger except clause
...
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
3 years ago
Konstantin Baierer
1715f0d8b3
allow overriding DPI
3 years ago
Konstantin Baierer
416a84e542
replace lxml with OCR-D/core PAGE API
3 years ago
Konstantin Baierer
a678bbf966
counter: add reset();
3 years ago
Konstantin Baierer
a3465ca1a0
eliminate id_of_texts from xml_reading_order, fix plus one error
3 years ago
Konstantin Baierer
6c60d9e90a
reading order: fix @index
3 years ago
Konstantin Baierer
02aa31cc66
Merge remote-tracking branch 'origin/main' into xml-rfct
3 years ago
Konstantin Baierer
c5736e9b74
fix region counting
3 years ago
vahid
67a9fc8820
..
3 years ago
vahidrezanezhad
4b3c8a6707
bug in reading order is fixed
3 years ago
Konstantin Baierer
56b688befe
counter: allow arbitrary line/region id
3 years ago
Konstantin Baierer
7eb973b3aa
xml_reading_order takes id_of_marginals directly
3 years ago
Konstantin Baierer
98568402c7
counter: init-overrideable
3 years ago
Konstantin Baierer
24da879844
add EynollahIdCounter class
3 years ago
Konstantin Baierer
9f5e4af5f0
factor out marginalia ID calc from xml_reading_order
3 years ago
Konstantin Baierer
630002d96d
minor clean up xml_reading_order
3 years ago
Konstantin Baierer
62fa710f27
move order_and_id_of_texts to utils.xml
3 years ago
Konstantin Baierer
8d4ce75656
typo s,teil,tile,g
3 years ago
Konstantin Baierer
18475b3c20
remaining typos idenitified by codespell
3 years ago
Konstantin Baierer
adb96b5b49
s,hiearchy,hierarchy,g
3 years ago
Konstantin Baierer
10ae13f9b0
typo: s,separater,separator,
3 years ago
Konstantin Baierer
78a7ace863
typo: s,hirarchy,hierarchy,
3 years ago
Konstantin Baierer
526769354a
typo: s,seperator,separator,
3 years ago
Konstantin Baierer
ec9939c3c7
typo: s,hierachy,hierarchy,
3 years ago
Konstantin Baierer
5f2867bfcf
typo: s,contoures,contours,g
3 years ago
Konstantin Baierer
ccd83e94b1
typo: s,cprrsp,corresp,g
3 years ago
Konstantin Baierer
1f95a69bec
typo: s,rotyate,rotate,g
3 years ago