Commit Graph

540 Commits (addb5729224fb34c658d546e89bf0ee3d5ea1706)
 

Author SHA1 Message Date
Mike Gerber addb572922
Merge pull request #143 from qurator-spk/chore/update-pre-commit
⚙  pre-commit: update
5 days ago
Mike Gerber 1ebb004386 ⚙ pre-commit: update 5 days ago
Mike Gerber c3aa48ec3b Merge branch 'master' of https://github.com/qurator-spk/dinglehopper 6 days ago
Mike Gerber 628594ef98 📦 v0.11.0 6 days ago
Mike Gerber d7814db705
Merge pull request #142 from qurator-spk/feat/flex-line-dirs
Feat/flex line dirs
6 days ago
Mike Gerber 5639f3db7f ✔ Add a tests that checks if plain text files with BOM are read correctly 6 days ago
Mike Gerber 9fc8937324 ✒ README: Mention dinglehopper-line-dirs --help 6 days ago
Mike Gerber 14a4bc56d8 🐛 Add --plain-encoding option to dinglehopper-extract 1 week ago
Mike Gerber a70260c10e 🐛 Use warning() to fix DeprecationWarning 1 week ago
Gerber, Mike 224aa02163 🚧 Fix help text 1 week ago
Gerber, Mike 9db5b4caf5 🚧 Add OCR-D parameter for plain text encoding 1 week ago
Gerber, Mike 5578ce83a3 🚧 Add option for text encoding to line dir cli 1 week ago
Gerber, Mike cf59b951a3 🚧 Add option for text encoding to line dir cli 1 week ago
Gerber, Mike 480b3cf864 ✔ Test that CLI produces a complete HTML report 1 week ago
Gerber, Mike f1a586cff1 ✔ Test line dirs CLI 1 week ago
Gerber, Mike 3b16c14c16 ✔ Properly test line dir finding 1 week ago
Gerber, Mike 322faeb26c 🎨 Sort imports 1 week ago
Gerber, Mike c37316da09 🐛 cli_line_dirs: Fix word differences section
At the time of generation of the section, the {gt,ocr}_words generators
were drained. Fix by using a list.

Fixes gh-124.
1 week ago
Gerber, Mike 9414a92f9f 🐛 cli_line_dirs: Type-annotate functions 1 week ago
Gerber, Mike 68344e48f8 🎨 Reformat cli_line_dirs 1 week ago
Gerber, Mike 73ee16fe51 🚧 Support 'merged' GT+OCR line directories 1 week ago
Gerber, Mike 6980d7a252 🚧 Use our own removesuffix() as we still support Python 3.8 1 week ago
Gerber, Mike 2bf2529c38 🚧 Port new line dir functions 1 week ago
Gerber, Mike ad8e6de36b 🐛 cli_line_dirs: Fix character diff reports 1 week ago
Gerber, Mike 4024e350f7 🚧 Test new flexible line dirs functions 1 week ago
Mike Gerber 3c317cbeaf
Merge pull request #141 from qurator-spk/chore/update-pre-commit
⚙  pre-commit: update
1 week ago
Mike Gerber d8403421fc ⚙ pre-commit: update 1 week ago
Mike Gerber 3305043234
Merge pull request #140 from qurator-spk/fix/vendor-strings
🐛 Fix vendor strings
1 week ago
Mike Gerber 6bf5bd7178 🐛 Fix vendor strings 1 week ago
Mike Gerber 817e0c95f7 📦 v0.10.1 1 week ago
Mike Gerber 3d7c7ee1e3
Merge pull request #139 from bertsky/allow-uniseg-py38
re-allow uniseg 0.8 and py38
1 week ago
Robert Sachunsky a24623b966 re-allow py38 2 weeks ago
Robert Sachunsky ea33602336 CI: reactivate py38 2 weeks ago
Robert Sachunsky 64444dd419 opt out of 7f8a8dd5 (uniseg update that requires py39) 2 weeks ago
Mike Gerber f6dfb77f94 🐛 pyproject.toml: Fix description 2 weeks ago
Mike Gerber ef817cb343 📦 v0.10.0 2 weeks ago
Mike Gerber b1c109baae
Merge pull request #128 from kba/v3-api
V3 api
2 weeks ago
Mike Gerber 13ab1ae150 🐛 Docker: Use same vendor as license for now 2 weeks ago
Mike Gerber d974369e13 🐛 Docker: Fix description 2 weeks ago
Mike Gerber b7bdca4ac8 🐛 Makefile: Make phony targets .PHONY 2 weeks ago
kba 831a24fc4c typo: report_prefix -> file_id 2 weeks ago
Konstantin Baierer f6a2c94520 ocrd_cli: but do check for existing output files
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2 weeks ago
Konstantin Baierer 4162836612 ocrd_cli: no need to check fileGrp dir exists
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2 weeks ago
Konstantin Baierer c0aa82d188 OCR-D processor: properly handle missing or non-downloaded GT/OCR file
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2 weeks ago
kba 8c1b6d65f5 Dockerfile: build ocrd-all-tool.json 2 weeks ago
Mike Gerber f287386c0e 🧹Don't pin uniseg and rapidfuzz
Breakage with the newest uniseg API was fixed in master.

Can't see any issue with rapidfuzz, so removing that pin, too.
2 weeks ago
kba 63031b30bf Port to OCR-D/core API v3 2 weeks ago
Mike Gerber bf6633be02
Merge pull request #136 from qurator-spk/chore/update-liccheck
⚙  liccheck: update permissable licenses (mit-cmu, psf 2.0, iscl)
2 weeks ago
Mike Gerber d3aa9eb520 ⚙ liccheck: update permissable licenses (mit-cmu, psf 2.0, iscl) 2 weeks ago
Mike Gerber 625686f204
Merge pull request #135 from qurator-spk/chore/update-python-version
⚙  pyproject.toml: Update supported Python version
2 weeks ago