Commit Graph

684 Commits (1a0b9d19589aab33a260b46ad416044b3f2d16e3)
 

Author SHA1 Message Date
Robert Sachunsky 1a0b9d1958
Merge pull request #1 from bertsky/v3-api-refactor-init
refactoring of Eynollah init and model loading
2 weeks ago
Robert Sachunsky 559d001eef another fix to avoid frequent warnings 2 weeks ago
Robert Sachunsky dd478279a4 CLI: also --overwrite in single-image mode 2 weeks ago
Robert Sachunsky 8159e6336a fix typo (preventing log messages) 2 weeks ago
Robert Sachunsky 2919538382 minor fixes to avoid frequent warnings 2 weeks ago
Robert Sachunsky 903c87aca0 update readme (OCR-D section) 2 weeks ago
Robert Sachunsky dcf2ed5e22 run: also write out XML in single filename mode 2 weeks ago
Robert Sachunsky fe77171d45 run_single: reduce indentation 2 weeks ago
Robert Sachunsky c7dc952851 smoke-test: also test dir-in mode and overwrite 2 weeks ago
Robert Sachunsky 79003a083c CLI: ValueError instead of print+exit 2 weeks ago
Robert Sachunsky e17d34fafa factor run_single() out of run(), simplify kwargs 2 weeks ago
Robert Sachunsky 1a0a1cb00b remove session methods and redundant model loaders 2 weeks ago
Robert Sachunsky ab3da17547
Update requirements.txt
Co-authored-by: Konstantin Baierer <kba@users.noreply.github.com>
2 weeks ago
Robert Sachunsky dd51f900b9 OCR-D: init Eynollah in 'setup', re-use instance for each page via non-public API 2 weeks ago
Robert Sachunsky ffeb4a343d Eynollah: remove useless 'pcgts' attr 2 weeks ago
Robert Sachunsky 9dc33db108 CI: add binarization models to cache 2 weeks ago
Robert Sachunsky 9c769d4cc5 CI: run CLI tests, too 2 weeks ago
Robert Sachunsky 250fc02606 add tests for binarization, remove dependency on deps-test 2 weeks ago
Robert Sachunsky 515b4023f6 sbb_binarize: fix missing reference 2 weeks ago
Robert Sachunsky 95a681aa8c add Continuous Deployment via Dockerhub and GHCR 2 weeks ago
Robert Sachunsky df3510750c
Github Actions CI: no more Docker clean or build 2 weeks ago
Robert Sachunsky 45e3ab9692
Github Actions: free space: all existing Docker images 2 weeks ago
Robert Sachunsky 31aeb9629d
Github Actions: free space more aggressively 2 weeks ago
Robert Sachunsky 7430b57b65 dockerfile: add smoke test 2 weeks ago
Robert Sachunsky f35f49376e run CLI test in TMPDIR, add ocrd-test 2 weeks ago
Robert Sachunsky ae066388ea docker: no need for g++, but install w/ 'EXTRAS=OCR' 2 weeks ago
Robert Sachunsky 722b5c6bf1 add make variable EXTRAS for optional dependencies 2 weeks ago
Robert Sachunsky c01609ff4e allow even more empty imports for optional dependencies 2 weeks ago
Robert Sachunsky 51e9bfd6d7 improve+extend dockerfile 2 weeks ago
Robert Sachunsky 09248d4829 improve+extend makefile 2 weeks ago
Robert Sachunsky 46618f4229 allow more empty imports for optional dependencies 2 weeks ago
Robert Sachunsky 4be89910a2 CLI: fix arg vs kwarg from merge 2 weeks ago
Robert Sachunsky 9d61acf173 simplify 2 weeks ago
Robert Sachunsky a1068ff2eb OCR-D: move sbb-binarize to ocrd-tool.json, update to v3 2 weeks ago
Robert Sachunsky c794d4d29f OCR-D: fix typo light_mode→light_version 2 weeks ago
Robert Sachunsky 4338259ca1 OCR-D: ensure page image gets replaced in result as well if not the original file 2 weeks ago
Robert Sachunsky 55969b0173 OCR-D: add docstring 2 weeks ago
Robert Sachunsky 3916474b8b OCR-D: require >=v3.1 2 weeks ago
Robert Sachunsky 6d02e90570 OCR-D: restrict max_workers=1 2 weeks ago
Robert Sachunsky efd3fa6775 allow empty imports for optional dependencies 2 weeks ago
Robert Sachunsky 238132e260 use 'image_filename' for pseudo-iteration outside 'dir_in' mode 2 weeks ago
Robert Sachunsky af4e2a4ffc do not require 'dir_out' outside 'dir_in' mode 2 weeks ago
Robert Sachunsky ea136e3ddd 'overwrite' check: only in 'dir_in' mode 2 weeks ago
Robert Sachunsky 1f4a17b60d Merge remote-tracking branch 'origin/machine_based_reading_order_integration' into v3-api 2 weeks ago
Robert Sachunsky edf924c2cb ocrd-tool: add dockerhub 2 weeks ago
vahidrezanezhad d3a4c06e7f This commit enables the export of cropped text line images along with their corresponding texts from a Page-XML file. These exported text line images and texts can be utilized for training a text line-based OCR model. 4 weeks ago
vahidrezanezhad c8b8529951 For the CNN-RNN OCR model, long text lines are split into two segments 4 weeks ago
vahidrezanezhad aa72ca3006 Resolved an issue in the OCR-D framework where dir_out received a None value 1 month ago
vahidrezanezhad a4f1f35125 Resolving test failure 1 month ago
kba 54040c1db4 Merge remote-tracking branch 'bertsky/machine_based_reading_order_integration_fixes' into machine_based_reading_order_integration 1 month ago