1
0
Fork 0
mirror of https://github.com/qurator-spk/dinglehopper.git synced 2025-06-09 03:40:12 +02:00
Commit graph

489 commits

Author SHA1 Message Date
a09a624bde
Merge pull request #132 from qurator-spk/fix/uniseg-removed-index-parameter
🐛 Fix for changed API of uniseg's word_break
2025-04-16 09:28:31 +02:00
badfa9c99e ⚙ GitHub Actions: Don't test on Python 3.8 anymore 2025-04-16 09:25:44 +02:00
7f8a8dd564 🐛 Fix for changed API of uniseg's word_break 2025-04-16 09:10:43 +02:00
b72d4f5af9
Merge pull request #131 from qurator-spk/chore/update-pre-commit
⚙  pre-commit: update
2025-04-16 09:06:05 +02:00
058042accb ⚙ pre-commit: update 2025-04-16 08:59:58 +02:00
071e6a8bd1
Merge pull request #120 from joschrew/dockerfile
Add Dockerfile and Makefile to create ocr-d dockerimage
2024-10-11 18:04:22 +02:00
6b82293670
Update Dockerfile
I fancy-clicked @bertsky's change suggestion, which duplicated some labels. Now fancy-clicking the fix, fingers crossed...
2024-10-07 17:41:59 +02:00
6ecf49a355
Update Dockerfile
Co-authored-by: Robert Sachunsky <38561704+bertsky@users.noreply.github.com>
2024-10-07 17:39:42 +02:00
joschrew
9c7c104dce Add Dockerfile and Makefile to create ocr-d image 2024-10-02 15:29:36 +02:00
2e6fe0c279
Merge pull request #113 from qurator-spk/python-3.13
✔ Test on Python 3.13
2024-09-04 19:15:04 +02:00
1753ed4d13 ✔ Test on Python 3.13 2024-09-04 19:09:45 +02:00
3233dbcc8f ✔ pre-commit: Add license check 2024-07-22 16:54:33 +02:00
f2e290dffe 🐛 Fix --version option in OCR-D CLI 2024-07-19 14:54:46 +02:00
6d1daf1dfe Support --version option in CLI 2024-07-19 14:41:54 +02:00
27ad145c7e ⚙ pyproject.toml: Add license.file 2024-07-19 09:58:01 +02:00
2e9e88cc1e ⚙ pre-commit: Update hooks 2024-07-19 09:56:40 +02:00
129e6eb427 📦 v0.9.7 2024-07-11 17:25:38 +02:00
cf998443c1 ⚙ ruff: Update settings (select → lint.select) 2024-07-11 17:15:24 +02:00
6048107889 Merge branch 'master' of https://github.com/qurator-spk/dinglehopper 2024-07-11 16:26:29 +02:00
2ee37ed4e3 🎨 Sort imports 2024-07-11 16:25:38 +02:00
521f034fba
Merge pull request #116 from stweil/master
Fix typo
2024-07-10 01:13:24 +02:00
d1a2247615 ⚙ pre-commit: Update hooks 2024-07-09 21:07:59 +02:00
4047f8b6e5 🐛 Fix loading ocrd-tool.json for Python 3.12 2024-07-09 21:01:31 +02:00
Stefan Weil
cd68a973cb Fix typo
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-05-26 09:18:00 +02:00
bc5818da9f ✔ GitHub Actions: Update used actions 2024-05-14 15:56:08 +02:00
c91234daba ✔ GitHub Actions: Update used actions 2024-05-13 21:17:42 +02:00
a534b5e28e ⚙ pre-commit: Update hooks 2024-05-13 21:16:29 +02:00
b336f98271 🐛 Fix reading plain text files
As reported by @tallemeersch in gh-107, newlines were not removed for plain text files.
Fix this by stripping the lines as suggested.

Fixes gh-107.
2024-05-06 18:14:16 +02:00
41a0fad352 📦 v0.9.6 2024-05-06 17:48:48 +02:00
e72d1e37ea Revert "✔ Test on Python 3.13"
This reverts commit 0d5c6d5a62.
2024-05-06 17:42:09 +02:00
86e723cd53 🐛 GHA: Install possible shapely build requirements (if building from source) 2024-05-06 17:25:59 +02:00
dc4565fd2d
Merge pull request #111 from stweil/typos
Fix some typos (found by `codespell` and `typos`)
2024-05-06 17:23:31 +02:00
fbcb9160fd 🐛 GHA: Install possible lxml build requirements (if building from source) 2024-05-06 17:09:05 +02:00
0d5c6d5a62 ✔ Test on Python 3.13 2024-05-06 17:02:52 +02:00
e34adbf41c 🐛 Fix Python 3.12 support by requiring ocrd >= 2.65.0 2024-05-06 16:59:18 +02:00
58a688b175 ⚙ pre-commit: Update hooks 2024-05-06 16:57:53 +02:00
Stefan Weil
79701e410d Fix some typos (found by codespell and typos)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2024-04-29 08:42:17 +02:00
2383730a55 ✔ Test using empty files
Test edge cases + empty files, e.g. empty text content and a Unicode BOM character.

See also gh-79.
2024-04-08 20:33:03 +02:00
98d7928f45 ⚙ pre-commit: Update hooks 2024-04-08 20:27:47 +02:00
edabffec7e 🧹 tests: Move comment out of the code (bad style + weird formatting) 2024-04-04 19:46:08 +02:00
32d4037533 ⚙ cli: Annotate types in process_dir() 2024-04-04 19:38:27 +02:00
fe1a713d55 ⚙ pre-commit: Update hooks 2024-04-04 19:33:47 +02:00
be7c1dd25d 🧹 Make from_text_segment()'s textequiv_level keyword-only 2024-03-27 21:09:34 +01:00
932bfafc7d 🧹 Make process_dir() keyword arguments keyword-only 2024-03-27 19:44:09 +01:00
945aec5673 ✒ README-DEV: Releasing a new version 2024-03-27 19:01:49 +01:00
c29a80bc81 📦 v0.9.5 2024-03-27 18:49:13 +01:00
a1c1d0ad49 ⚙ pre-commit: Add mypy dependencies
Closes gh-106.
2024-03-27 18:32:49 +01:00
5d9f0c482f 🐛 Check that we always get a valid ALTO namespace (satifies mypy) 2024-03-27 17:57:53 +01:00
19d1a00817 🎨 Reformat (Black) 2024-03-27 17:36:05 +01:00
4dc6b7dc04 ⚙ pre-commit: Update hooks 2024-03-26 19:40:07 +01:00