Commit Graph

462 Commits (master)
 

Author SHA1 Message Date
Mike Gerber b336f98271 🐛 Fix reading plain text files
As reported by @tallemeersch in gh-107, newlines were not removed for plain text files.
Fix this by stripping the lines as suggested.

Fixes gh-107.
5 days ago
Mike Gerber 41a0fad352 📦 v0.9.6 5 days ago
Mike Gerber e72d1e37ea Revert "✔ Test on Python 3.13"
This reverts commit 0d5c6d5a62.
5 days ago
Mike Gerber 86e723cd53 🐛 GHA: Install possible shapely build requirements (if building from source) 5 days ago
Mike Gerber dc4565fd2d
Merge pull request #111 from stweil/typos
Fix some typos (found by `codespell` and `typos`)
5 days ago
Mike Gerber fbcb9160fd 🐛 GHA: Install possible lxml build requirements (if building from source) 5 days ago
Mike Gerber 0d5c6d5a62 ✔ Test on Python 3.13 5 days ago
Mike Gerber e34adbf41c 🐛 Fix Python 3.12 support by requiring ocrd >= 2.65.0 5 days ago
Mike Gerber 58a688b175 ⚙ pre-commit: Update hooks 5 days ago
Stefan Weil 79701e410d Fix some typos (found by `codespell` and `typos`)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
2 weeks ago
Mike Gerber 2383730a55 ✔ Test using empty files
Test edge cases + empty files, e.g. empty text content and a Unicode BOM character.

See also gh-79.
1 month ago
Mike Gerber 98d7928f45 ⚙ pre-commit: Update hooks 1 month ago
Mike Gerber edabffec7e 🧹 tests: Move comment out of the code (bad style + weird formatting) 1 month ago
Mike Gerber 32d4037533 ⚙ cli: Annotate types in process_dir() 1 month ago
Mike Gerber fe1a713d55 ⚙ pre-commit: Update hooks 1 month ago
Mike Gerber be7c1dd25d 🧹 Make from_text_segment()'s textequiv_level keyword-only 1 month ago
Mike Gerber 932bfafc7d 🧹 Make process_dir() keyword arguments keyword-only 2 months ago
Mike Gerber 945aec5673 ✒ README-DEV: Releasing a new version 2 months ago
Mike Gerber c29a80bc81 📦 v0.9.5 2 months ago
Mike Gerber a1c1d0ad49 ⚙ pre-commit: Add mypy dependencies
Closes gh-106.
2 months ago
Mike Gerber 5d9f0c482f 🐛 Check that we always get a valid ALTO namespace (satifies mypy) 2 months ago
Mike Gerber 19d1a00817 🎨 Reformat (Black) 2 months ago
Mike Gerber 4dc6b7dc04 ⚙ pre-commit: Update hooks 2 months ago
Mike Gerber 6b3697c864 Merge branch 'master' of https://github.com/qurator-spk/dinglehopper 2 months ago
Mike Gerber 4d4ead4cc8 🐛 Fix word segmentation with uniseg 0.8.0 2 months ago
Mike Gerber 0e3d24cac1
🐛 README.md: Fix badge (for real) 2 months ago
Mike Gerber 4016c01638
🐛 README.md: Fix test badge 2 months ago
Mike Gerber 4b64398cec 🚧 GitLab CI Test: Depend on child pipeline 4 months ago
Mike Gerber 7e033b6f03 🚧 GitLab CI Test: Depend on child pipeline 4 months ago
Mike Gerber 250ee2b7f2 🚧 GitLab CI Test: Push after pulling 4 months ago
Mike Gerber 76c4533aa5 🚧 GitLab CI Test: Push after pulling 4 months ago
Mike Gerber f8e31089b3 🚧 GitLab CI Test: Push after pulling 4 months ago
Mike Gerber 6cfb49fe39 🚧 GitLab CI Test: Push after pulling 4 months ago
Mike Gerber 5eba65f097 🚧 GitLab CI Test: Trigger only on default branch (and do not hardcode it) 4 months ago
Mike Gerber 83cef3106f 🚧 GitLab CI Test 4 months ago
Mike Gerber a95a85a889 🚧 GitLab CI Test 4 months ago
Mike Gerber ff34c65c1e 🔍 ruff: Remove ignore configuration, we use multimethods in a compatible way now 4 months ago
Mike Gerber 21c44d426e ⚙ pre-commit: Update hooks 4 months ago
Mike Gerber 10ccba989e 🚧 GitLab CI Test 4 months ago
Mike Gerber 10d423f045 🚧 GitLab CI Test 4 months ago
Mike Gerber 6d947a9ca9 🚧 GitLab CI Test 4 months ago
Mike Gerber 484da90d27 🚧 GitLab CI Test 4 months ago
Mike Gerber d0ddfa68a1 🚧 GitLab CI Test 4 months ago
Mike Gerber 81391132f0 🚧 GitLab CI Test 4 months ago
Mike Gerber dc390cd3f8 🚧 GitLab CI Test 4 months ago
Mike Gerber c77e8f51ab 🚧 GitLab CI Test 4 months ago
Mike Gerber e083688c66 🚧 GitLab CI Test 4 months ago
Mike Gerber 6d8afc27b3 🚧 GitLab CI Test 4 months ago
Mike Gerber af83b35f23 🚧 GitLab CI Test 4 months ago
Mike Gerber 344f96dca9 🚧 GitLab CI Test 4 months ago