Commit graph

253 commits

Author SHA1 Message Date
vahid
7cbecadccc adding the binarization model and option to binarize input document for the cases like dark, stronly bright and other ones 2021-04-25 18:20:05 -04:00
vahid
44dad6a072 strong erosion, more modification 2021-04-23 13:48:21 -04:00
Konstantin Baierer
78d29f34c1
Merge pull request #36 from mikegerber/fix/ocrd-core-req
🐛 Fix ocrd core requirement
2021-04-23 09:32:12 +02:00
70e7316907 🐛 Fix ocrd core requirement
eynollah requires at ocrd >= 2.22.0 for the resource resolving code,
otherwise it fails with an AttributeError. Fix this by bumping up the
requirement.

I bumped it to 2.23.3 so core *also* includes the latest model resource
for eynollah.
2021-04-22 20:06:31 +02:00
vahidrezanezhad
176c7531ab
Update eynollah.py 2021-04-22 11:01:58 -04:00
vahid
f8cc048c76 Merge branch 'main' of https://github.com/qurator-spk/eynollah into main 2021-04-22 11:04:48 -04:00
vahid
c051e22432 fixing again the error raised because of erosion 2021-04-22 11:02:32 -04:00
Konstantin Baierer
12f5b2982c 📦 v0.0.1 2021-04-22 15:41:17 +02:00
vahidrezanezhad
d5be8aece3
Merge pull request #33 from qurator-spk/ocrd-cli
Ocrd cli
2021-04-22 15:22:22 +02:00
Konstantin Baierer
6c8852eb04 check_dpi: catch Pillow choking on faulty img, return 230 2021-04-22 13:12:40 +02:00
Konstantin Baierer
ff265eee5c cv2pil: do COLOR_BGR2RGB conversion 2021-04-22 12:57:04 +02:00
Konstantin Baierer
c7f304dcb6 ocrd processor: pass local filename as image_filename, ht @bertsky 2021-04-22 12:31:00 +02:00
Konstantin Baierer
d0b0e23ac6 do DPI calculation as part of caching images 2021-04-22 12:07:14 +02:00
Konstantin Baierer
ae0b4a825a ocrd cli: catch dpi == 1, return 230 2021-04-22 10:28:01 +02:00
Konstantin Baierer
2e8a3e3bee use Page.imageFilename directly for accurate DPI estimate 2021-04-21 18:30:48 +02:00
Konstantin Baierer
42ccb4711d
Update qurator/eynollah/ocrd-tool.json
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2021-04-21 10:55:28 +02:00
vahid
1184d3d2fc issue raised by Clemens, strong erosion causing 2021-04-18 17:59:18 -04:00
Konstantin Baierer
4897cefdb7 allow passing PIL image to Eynollah w/o disk I/O 2021-04-15 17:25:05 +02:00
Konstantin Baierer
ba561ec833 Merge branch 'ocrd-cli' of github.com:qurator-spk/eynollah into ocrd-cli 2021-04-15 15:28:18 +02:00
Konstantin Baierer
d40c453dad
check_dpi: raise exception if resolution == 1 to trigger except clause
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2021-04-14 17:42:37 +02:00
Konstantin Baierer
b11558cd4d Merge branch 'ocrd-cli' of github.com:qurator-spk/eynollah into ocrd-cli 2021-04-14 17:41:19 +02:00
Konstantin Baierer
1367f82605
improve ocrd-tool descriptions
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2021-04-14 17:31:57 +02:00
vahidrezanezhad
037210b292
update writer.py 2021-04-14 08:55:54 -04:00
Konstantin Baierer
8f7cf5d1fb setup.py: include json data 2021-04-14 11:58:12 +02:00
Konstantin Baierer
5e260eb448 setup.py: include json data 2021-04-14 11:55:44 +02:00
Konstantin Baierer
b8d818ede1 writer: don't create empty PcGts at init 2021-04-14 10:53:32 +02:00
Konstantin Baierer
8c4e9b6068 allow passing pcgts to eynollah and writer 2021-04-14 10:38:07 +02:00
Konstantin Baierer
2bc34891a5 fix CLI call 2021-04-13 17:55:22 +02:00
vahid
98f9272c4b a trivial issue is resolved 2021-04-13 11:43:44 -04:00
Konstantin Baierer
9db6edf51e OCR-D CLI 2021-04-13 17:38:02 +02:00
Konstantin Baierer
1715f0d8b3 allow overriding DPI 2021-04-13 16:03:58 +02:00
vahidrezanezhad
3e4ac11347
Merge pull request #32 from qurator-spk/ocrd-page-api-fix
Ocrd page api fix
2021-04-13 08:09:54 -04:00
Konstantin Baierer
c95529725a 🐛 typo type{,_} 2021-04-13 13:55:51 +02:00
Konstantin Baierer
93f93444ee 🐛 typo {c,C}oords 2021-04-13 13:55:31 +02:00
vahidrezanezhad
b643ced9fb
Merge pull request #31 from qurator-spk/ocrd-page-api
replace lxml with OCR-D/core PAGE API
2021-04-12 07:47:44 -04:00
Konstantin Baierer
f41d267456 Merge remote-tracking branch 'origin/main' into ocrd-page-api 2021-04-12 13:26:51 +02:00
Konstantin Baierer
416a84e542 replace lxml with OCR-D/core PAGE API 2021-04-12 13:25:29 +02:00
vahidrezanezhad
17c0f8ab11
Merge pull request #30 from qurator-spk/empty-page-fix
fix call to build xml for empty pages, fix #29
2021-04-08 05:33:07 -04:00
Konstantin Baierer
517843fe8b fix call to build xml for empty pages, fix #29 2021-04-08 10:39:56 +02:00
vahidrezanezhad
68e6f5c712
Merge pull request #28 from qurator-spk/xml-rfct
Xml rfct
2021-04-07 12:34:35 -04:00
vahidrezanezhad
7a859ffae4
Merge branch 'main' into xml-rfct 2021-04-07 12:34:23 -04:00
vahidrezanezhad
d5a9817390
back on track- freezing problem , memory error and issues with reading order by drop capitals and marginals are resolved 2021-04-07 11:35:42 -04:00
vahidrezanezhad
43b8759acf
back on track- freezing problem , memory error and issues with reading order by drop capitals and marginals are resolved 2021-04-07 11:34:35 -04:00
vahid
b473c85a59 OOM error happend with tensorflow-gpu=1.15.5 is resolved 2021-04-01 11:22:08 -04:00
Konstantin Baierer
3d9da4feaa writer: use a single counter for all regions/lines 2021-03-12 19:29:52 +01:00
Konstantin Baierer
a678bbf966 counter: add reset(); 2021-03-12 18:39:27 +01:00
Konstantin Baierer
a3465ca1a0 eliminate id_of_texts from xml_reading_order, fix plus one error 2021-03-11 19:44:42 +01:00
Konstantin Baierer
6c60d9e90a reading order: fix @index 2021-03-10 17:27:18 +01:00
Konstantin Baierer
02aa31cc66 Merge remote-tracking branch 'origin/main' into xml-rfct 2021-03-10 12:12:31 +01:00
Konstantin Baierer
c5736e9b74 fix region counting 2021-03-10 11:52:01 +01:00