Commit Graph

237 Commits (2e8a3e3bee5bb926121b5bba630270c8044f5fe4)
 

Author SHA1 Message Date
Konstantin Baierer 2e8a3e3bee use Page.imageFilename directly for accurate DPI estimate
Konstantin Baierer 42ccb4711d
Update qurator/eynollah/ocrd-tool.json
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
Konstantin Baierer 4897cefdb7 allow passing PIL image to Eynollah w/o disk I/O
Konstantin Baierer ba561ec833 Merge branch 'ocrd-cli' of github.com:qurator-spk/eynollah into ocrd-cli
Konstantin Baierer d40c453dad
check_dpi: raise exception if resolution == 1 to trigger except clause
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
Konstantin Baierer b11558cd4d Merge branch 'ocrd-cli' of github.com:qurator-spk/eynollah into ocrd-cli
Konstantin Baierer 1367f82605
improve ocrd-tool descriptions
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
vahidrezanezhad 037210b292
update writer.py
Konstantin Baierer 8f7cf5d1fb setup.py: include json data
Konstantin Baierer 5e260eb448 setup.py: include json data
Konstantin Baierer b8d818ede1 writer: don't create empty PcGts at init
Konstantin Baierer 8c4e9b6068 allow passing pcgts to eynollah and writer
Konstantin Baierer 2bc34891a5 fix CLI call
Konstantin Baierer 9db6edf51e OCR-D CLI
Konstantin Baierer 1715f0d8b3 allow overriding DPI
vahidrezanezhad 3e4ac11347
Merge pull request from qurator-spk/ocrd-page-api-fix
Ocrd page api fix
Konstantin Baierer c95529725a 🐛 typo type{,_}
Konstantin Baierer 93f93444ee 🐛 typo {c,C}oords
vahidrezanezhad b643ced9fb
Merge pull request from qurator-spk/ocrd-page-api
replace lxml with OCR-D/core PAGE API
Konstantin Baierer f41d267456 Merge remote-tracking branch 'origin/main' into ocrd-page-api
Konstantin Baierer 416a84e542 replace lxml with OCR-D/core PAGE API
vahidrezanezhad 17c0f8ab11
Merge pull request from qurator-spk/empty-page-fix
fix call to build xml for empty pages, fix 
Konstantin Baierer 517843fe8b fix call to build xml for empty pages, fix
vahidrezanezhad 68e6f5c712
Merge pull request from qurator-spk/xml-rfct
Xml rfct
vahidrezanezhad 7a859ffae4
Merge branch 'main' into xml-rfct
vahidrezanezhad d5a9817390
back on track- freezing problem , memory error and issues with reading order by drop capitals and marginals are resolved
vahidrezanezhad 43b8759acf
back on track- freezing problem , memory error and issues with reading order by drop capitals and marginals are resolved
vahid b473c85a59 OOM error happend with tensorflow-gpu=1.15.5 is resolved
Konstantin Baierer 3d9da4feaa writer: use a single counter for all regions/lines
Konstantin Baierer a678bbf966 counter: add reset();
Konstantin Baierer a3465ca1a0 eliminate id_of_texts from xml_reading_order, fix plus one error
Konstantin Baierer 6c60d9e90a reading order: fix @index
Konstantin Baierer 02aa31cc66 Merge remote-tracking branch 'origin/main' into xml-rfct
Konstantin Baierer c5736e9b74 fix region counting
vahid 67a9fc8820 ..
vahidrezanezhad 4b3c8a6707
bug in reading order is fixed
vahidrezanezhad 73b7c780ab
Update eynollah.py
reading order bug for documents with text regions less than 5: fixed
Konstantin Baierer 03d75f5788 simplify serialize_lines_in_region
Konstantin Baierer d95fcf14c0 id_of_marginalia still necessary
Konstantin Baierer 56b688befe counter: allow arbitrary line/region id
Konstantin Baierer fffa207658 Merge branch 'xml-rfct' of https://github.com/qurator-spk/eynollah into xml-rfct
# Conflicts:
#	qurator/eynollah/writer.py
Konstantin Baierer 7eb973b3aa xml_reading_order takes id_of_marginals directly
vahidrezanezhad 6b2a6588fa
Ein klein bug gefixt
Konstantin Baierer 98568402c7 counter: init-overrideable
Konstantin Baierer 9b1da7c023 use counter for lines too
Konstantin Baierer 1cd3ee1a2e simplify calculate_polygon_coords
Konstantin Baierer 20fcac6232 remove unnecessary if
Konstantin Baierer 24da879844 add EynollahIdCounter class
Konstantin Baierer 9f5e4af5f0 factor out marginalia ID calc from xml_reading_order
Konstantin Baierer 630002d96d minor clean up xml_reading_order