1
0
Fork 0
mirror of https://github.com/qurator-spk/dinglehopper.git synced 2025-06-09 11:50:00 +02:00
Commit graph

548 commits

Author SHA1 Message Date
1dbeda8454
Merge 829b165869 into 3443edd6d3 2025-05-13 12:42:13 +02:00
3443edd6d3
Merge pull request #145 from bertsky/master
update docker
2025-05-13 12:41:50 +02:00
Robert Sachunsky
b1ef3af1a8 docker: use latest core base stage 2025-05-02 00:19:11 +02:00
Robert Sachunsky
d09e3969f8 docker: prepackage ocrd-all-module-dir.json 2025-05-02 00:19:11 +02:00
b5e99d96c9
Merge pull request #144 from qurator-spk/fix/make-test-results-clearer
✔  GitHub Actions: Make reporting results clearer
2025-04-25 11:31:29 +02:00
774790c36f ✔ GitHub Actions: Make reporting results clearer
In the "Actions" tab on GitHub, the workflow run that would post test results to the
_original_ workflow run is named "Test Report". This would lead me to click on it to see
the results, just to be disappointed.

This aims to make the naming of the GitHub workflows/jobs clearer.
2025-04-25 11:20:00 +02:00
829b165869 🚧 Install possible Pillow build requirements (if building from source) 2025-04-25 10:15:38 +02:00
07328e1b91 ⚙ GitHub Actions: Test on Python 3.14 2025-04-25 10:15:38 +02:00
addb572922
Merge pull request #143 from qurator-spk/chore/update-pre-commit
⚙  pre-commit: update
2025-04-25 10:14:30 +02:00
1ebb004386 ⚙ pre-commit: update 2025-04-25 10:13:06 +02:00
c3aa48ec3b Merge branch 'master' of https://github.com/qurator-spk/dinglehopper 2025-04-24 17:16:06 +02:00
628594ef98 📦 v0.11.0 2025-04-24 17:14:44 +02:00
d7814db705
Merge pull request #142 from qurator-spk/feat/flex-line-dirs
Feat/flex line dirs
2025-04-24 16:48:22 +02:00
5639f3db7f ✔ Add a tests that checks if plain text files with BOM are read correctly 2025-04-24 16:44:29 +02:00
9fc8937324 ✒ README: Mention dinglehopper-line-dirs --help 2025-04-24 15:13:19 +02:00
14a4bc56d8 🐛 Add --plain-encoding option to dinglehopper-extract 2025-04-22 18:24:35 +02:00
a70260c10e 🐛 Use warning() to fix DeprecationWarning 2025-04-22 13:57:19 +02:00
224aa02163 🚧 Fix help text 2025-04-22 13:57:19 +02:00
9db5b4caf5 🚧 Add OCR-D parameter for plain text encoding 2025-04-22 13:57:19 +02:00
5578ce83a3 🚧 Add option for text encoding to line dir cli 2025-04-22 13:57:19 +02:00
cf59b951a3 🚧 Add option for text encoding to line dir cli 2025-04-22 13:57:19 +02:00
480b3cf864 ✔ Test that CLI produces a complete HTML report 2025-04-22 13:57:19 +02:00
f1a586cff1 ✔ Test line dirs CLI 2025-04-22 13:57:18 +02:00
3b16c14c16 ✔ Properly test line dir finding 2025-04-22 13:57:18 +02:00
322faeb26c 🎨 Sort imports 2025-04-22 13:57:18 +02:00
c37316da09 🐛 cli_line_dirs: Fix word differences section
At the time of generation of the section, the {gt,ocr}_words generators
were drained. Fix by using a list.

Fixes gh-124.
2025-04-22 13:57:18 +02:00
9414a92f9f 🐛 cli_line_dirs: Type-annotate functions 2025-04-22 13:57:18 +02:00
68344e48f8 🎨 Reformat cli_line_dirs 2025-04-22 13:57:18 +02:00
73ee16fe51 🚧 Support 'merged' GT+OCR line directories 2025-04-22 13:57:18 +02:00
6980d7a252 🚧 Use our own removesuffix() as we still support Python 3.8 2025-04-22 13:57:18 +02:00
2bf2529c38 🚧 Port new line dir functions 2025-04-22 13:57:17 +02:00
ad8e6de36b 🐛 cli_line_dirs: Fix character diff reports 2025-04-22 13:57:17 +02:00
4024e350f7 🚧 Test new flexible line dirs functions 2025-04-22 13:57:17 +02:00
3c317cbeaf
Merge pull request #141 from qurator-spk/chore/update-pre-commit
⚙  pre-commit: update
2025-04-22 12:35:14 +02:00
d8403421fc ⚙ pre-commit: update 2025-04-22 12:30:47 +02:00
3305043234
Merge pull request #140 from qurator-spk/fix/vendor-strings
🐛 Fix vendor strings
2025-04-22 11:50:29 +02:00
6bf5bd7178 🐛 Fix vendor strings 2025-04-22 11:48:44 +02:00
817e0c95f7 📦 v0.10.1 2025-04-22 10:32:29 +02:00
3d7c7ee1e3
Merge pull request #139 from bertsky/allow-uniseg-py38
re-allow uniseg 0.8 and py38
2025-04-22 10:09:51 +02:00
Robert Sachunsky
a24623b966 re-allow py38 2025-04-17 16:47:13 +02:00
Robert Sachunsky
ea33602336 CI: reactivate py38 2025-04-17 16:12:42 +02:00
Robert Sachunsky
64444dd419 opt out of 7f8a8dd5 (uniseg update that requires py39) 2025-04-17 16:12:37 +02:00
f6dfb77f94 🐛 pyproject.toml: Fix description 2025-04-17 08:51:32 +02:00
ef817cb343 📦 v0.10.0 2025-04-17 08:37:37 +02:00
b1c109baae
Merge pull request #128 from kba/v3-api
V3 api
2025-04-17 08:34:51 +02:00
13ab1ae150 🐛 Docker: Use same vendor as license for now 2025-04-17 08:26:36 +02:00
d974369e13 🐛 Docker: Fix description 2025-04-17 08:10:56 +02:00
b7bdca4ac8 🐛 Makefile: Make phony targets .PHONY 2025-04-17 08:09:06 +02:00
kba
831a24fc4c typo: report_prefix -> file_id 2025-04-17 08:04:52 +02:00
Konstantin Baierer
f6a2c94520 ocrd_cli: but do check for existing output files
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2025-04-17 08:04:52 +02:00