Robert Sachunsky
2aafd098fd
Merge b1ef3af1a8
into b5e99d96c9
2025-05-02 00:19:57 +02:00
Robert Sachunsky
b1ef3af1a8
docker: use latest core base stage
2025-05-02 00:19:11 +02:00
Robert Sachunsky
d09e3969f8
docker: prepackage ocrd-all-module-dir.json
2025-05-02 00:19:11 +02:00
b5e99d96c9
Merge pull request #144 from qurator-spk/fix/make-test-results-clearer
...
✔ GitHub Actions: Make reporting results clearer
2025-04-25 11:31:29 +02:00
774790c36f
✔ GitHub Actions: Make reporting results clearer
...
In the "Actions" tab on GitHub, the workflow run that would post test results to the
_original_ workflow run is named "Test Report". This would lead me to click on it to see
the results, just to be disappointed.
This aims to make the naming of the GitHub workflows/jobs clearer.
2025-04-25 11:20:00 +02:00
addb572922
Merge pull request #143 from qurator-spk/chore/update-pre-commit
...
⚙ pre-commit: update
2025-04-25 10:14:30 +02:00
1ebb004386
⚙ pre-commit: update
2025-04-25 10:13:06 +02:00
c3aa48ec3b
Merge branch 'master' of https://github.com/qurator-spk/dinglehopper
2025-04-24 17:16:06 +02:00
628594ef98
📦 v0.11.0
2025-04-24 17:14:44 +02:00
d7814db705
Merge pull request #142 from qurator-spk/feat/flex-line-dirs
...
Feat/flex line dirs
2025-04-24 16:48:22 +02:00
5639f3db7f
✔ Add a tests that checks if plain text files with BOM are read correctly
2025-04-24 16:44:29 +02:00
9fc8937324
✒ README: Mention dinglehopper-line-dirs --help
2025-04-24 15:13:19 +02:00
14a4bc56d8
🐛 Add --plain-encoding option to dinglehopper-extract
2025-04-22 18:24:35 +02:00
a70260c10e
🐛 Use warning() to fix DeprecationWarning
2025-04-22 13:57:19 +02:00
224aa02163
🚧 Fix help text
2025-04-22 13:57:19 +02:00
9db5b4caf5
🚧 Add OCR-D parameter for plain text encoding
2025-04-22 13:57:19 +02:00
5578ce83a3
🚧 Add option for text encoding to line dir cli
2025-04-22 13:57:19 +02:00
cf59b951a3
🚧 Add option for text encoding to line dir cli
2025-04-22 13:57:19 +02:00
480b3cf864
✔ Test that CLI produces a complete HTML report
2025-04-22 13:57:19 +02:00
f1a586cff1
✔ Test line dirs CLI
2025-04-22 13:57:18 +02:00
3b16c14c16
✔ Properly test line dir finding
2025-04-22 13:57:18 +02:00
322faeb26c
🎨 Sort imports
2025-04-22 13:57:18 +02:00
c37316da09
🐛 cli_line_dirs: Fix word differences section
...
At the time of generation of the section, the {gt,ocr}_words generators
were drained. Fix by using a list.
Fixes gh-124.
2025-04-22 13:57:18 +02:00
9414a92f9f
🐛 cli_line_dirs: Type-annotate functions
2025-04-22 13:57:18 +02:00
68344e48f8
🎨 Reformat cli_line_dirs
2025-04-22 13:57:18 +02:00
73ee16fe51
🚧 Support 'merged' GT+OCR line directories
2025-04-22 13:57:18 +02:00
6980d7a252
🚧 Use our own removesuffix() as we still support Python 3.8
2025-04-22 13:57:18 +02:00
2bf2529c38
🚧 Port new line dir functions
2025-04-22 13:57:17 +02:00
ad8e6de36b
🐛 cli_line_dirs: Fix character diff reports
2025-04-22 13:57:17 +02:00
4024e350f7
🚧 Test new flexible line dirs functions
2025-04-22 13:57:17 +02:00
3c317cbeaf
Merge pull request #141 from qurator-spk/chore/update-pre-commit
...
⚙ pre-commit: update
2025-04-22 12:35:14 +02:00
d8403421fc
⚙ pre-commit: update
2025-04-22 12:30:47 +02:00
3305043234
Merge pull request #140 from qurator-spk/fix/vendor-strings
...
🐛 Fix vendor strings
2025-04-22 11:50:29 +02:00
6bf5bd7178
🐛 Fix vendor strings
2025-04-22 11:48:44 +02:00
817e0c95f7
📦 v0.10.1
2025-04-22 10:32:29 +02:00
3d7c7ee1e3
Merge pull request #139 from bertsky/allow-uniseg-py38
...
re-allow uniseg 0.8 and py38
2025-04-22 10:09:51 +02:00
Robert Sachunsky
a24623b966
re-allow py38
2025-04-17 16:47:13 +02:00
Robert Sachunsky
ea33602336
CI: reactivate py38
2025-04-17 16:12:42 +02:00
Robert Sachunsky
64444dd419
opt out of 7f8a8dd5
(uniseg update that requires py39)
2025-04-17 16:12:37 +02:00
f6dfb77f94
🐛 pyproject.toml: Fix description
2025-04-17 08:51:32 +02:00
ef817cb343
📦 v0.10.0
2025-04-17 08:37:37 +02:00
b1c109baae
Merge pull request #128 from kba/v3-api
...
V3 api
2025-04-17 08:34:51 +02:00
13ab1ae150
🐛 Docker: Use same vendor as license for now
2025-04-17 08:26:36 +02:00
d974369e13
🐛 Docker: Fix description
2025-04-17 08:10:56 +02:00
b7bdca4ac8
🐛 Makefile: Make phony targets .PHONY
2025-04-17 08:09:06 +02:00
kba
831a24fc4c
typo: report_prefix -> file_id
2025-04-17 08:04:52 +02:00
Konstantin Baierer
f6a2c94520
ocrd_cli: but do check for existing output files
...
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2025-04-17 08:04:52 +02:00
Konstantin Baierer
4162836612
ocrd_cli: no need to check fileGrp dir exists
...
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2025-04-17 08:04:52 +02:00
Konstantin Baierer
c0aa82d188
OCR-D processor: properly handle missing or non-downloaded GT/OCR file
...
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2025-04-17 08:04:51 +02:00
kba
8c1b6d65f5
Dockerfile: build ocrd-all-tool.json
2025-04-17 08:04:51 +02:00