Commit Graph

502 Commits (d974369e13e3bf5f20e24084a27b912430717150)
 

Author SHA1 Message Date
Mike Gerber 2383730a55 ✔ Test using empty files
Test edge cases + empty files, e.g. empty text content and a Unicode BOM character.

See also gh-79.
Mike Gerber 98d7928f45 ⚙ pre-commit: Update hooks
Mike Gerber edabffec7e 🧹 tests: Move comment out of the code (bad style + weird formatting)
Mike Gerber 32d4037533 ⚙ cli: Annotate types in process_dir()
Mike Gerber fe1a713d55 ⚙ pre-commit: Update hooks
Mike Gerber be7c1dd25d 🧹 Make from_text_segment()'s textequiv_level keyword-only
Mike Gerber 932bfafc7d 🧹 Make process_dir() keyword arguments keyword-only
Mike Gerber 945aec5673 ✒ README-DEV: Releasing a new version
Mike Gerber c29a80bc81 📦 v0.9.5
Mike Gerber a1c1d0ad49 ⚙ pre-commit: Add mypy dependencies
Closes gh-106.
Mike Gerber 5d9f0c482f 🐛 Check that we always get a valid ALTO namespace (satifies mypy)
Mike Gerber 19d1a00817 🎨 Reformat (Black)
Mike Gerber 4dc6b7dc04 ⚙ pre-commit: Update hooks
Mike Gerber 6b3697c864 Merge branch 'master' of https://github.com/qurator-spk/dinglehopper
Mike Gerber 4d4ead4cc8 🐛 Fix word segmentation with uniseg 0.8.0
Mike Gerber 0e3d24cac1
🐛 README.md: Fix badge (for real)
Mike Gerber 4016c01638
🐛 README.md: Fix test badge
Mike Gerber 4b64398cec 🚧 GitLab CI Test: Depend on child pipeline
Mike Gerber 7e033b6f03 🚧 GitLab CI Test: Depend on child pipeline
Mike Gerber 250ee2b7f2 🚧 GitLab CI Test: Push after pulling
Mike Gerber 76c4533aa5 🚧 GitLab CI Test: Push after pulling
Mike Gerber f8e31089b3 🚧 GitLab CI Test: Push after pulling
Mike Gerber 6cfb49fe39 🚧 GitLab CI Test: Push after pulling
Mike Gerber 5eba65f097 🚧 GitLab CI Test: Trigger only on default branch (and do not hardcode it)
Mike Gerber 83cef3106f 🚧 GitLab CI Test
Mike Gerber a95a85a889 🚧 GitLab CI Test
Mike Gerber ff34c65c1e 🔍 ruff: Remove ignore configuration, we use multimethods in a compatible way now
Mike Gerber 21c44d426e ⚙ pre-commit: Update hooks
Mike Gerber 10ccba989e 🚧 GitLab CI Test
Mike Gerber 10d423f045 🚧 GitLab CI Test
Mike Gerber 6d947a9ca9 🚧 GitLab CI Test
Mike Gerber 484da90d27 🚧 GitLab CI Test
Mike Gerber d0ddfa68a1 🚧 GitLab CI Test
Mike Gerber 81391132f0 🚧 GitLab CI Test
Mike Gerber dc390cd3f8 🚧 GitLab CI Test
Mike Gerber c77e8f51ab 🚧 GitLab CI Test
Mike Gerber e083688c66 🚧 GitLab CI Test
Mike Gerber 6d8afc27b3 🚧 GitLab CI Test
Mike Gerber af83b35f23 🚧 GitLab CI Test
Mike Gerber 344f96dca9 🚧 GitLab CI Test
Mike Gerber 483e809691 🔍 mypy: Use an almost strict mypy configuration, and fix any issues
Mike Gerber ad316aeabc 🔍 mypy: Use a compatible syntax for multimethod
Mike Gerber 8166435958 🔍 mypy: Remove ExtractedText.segments converter
Mike Gerber 24c25b6fcd 🔍 mypy: Avoid using check() for all attr validators
Mike Gerber ac9d360dcd 🔍 mypy: Make cli.process() typed so mypy checks it (and issues no warning)
Mike Gerber 788868b2ac Merge branch 'pr103'
Mike Gerber 59a3882ce5 🧹 GitHub Actions: Clean up whitespace
Sadra Barikbin 4466422cda Fix a typo
Sadra Barikbin 967f833eac Improve report
Sadra Barikbin f4ff6a8f31 Change reporter