Gerber, Mike
195354c6d4
Merge branch 'feat/compare-line-texts'
continuous-integration/drone/push Build encountered an error
Details
3 years ago
Gerber, Mike
8a3f5e48c2
🐛 dinglehopper: Patch word_break only once
...
continuous-integration/drone/push Build encountered an error
Details
Previously, we (accidently) patched uniseg's word_break on every call
to words(). Do it only once.
3 years ago
Gerber, Mike
b6bde2b7ec
📝 dinglehopper: Document dinglehopper-line-dirs in the README
continuous-integration/drone/push Build encountered an error
Details
3 years ago
Gerber, Mike
f77ce857b2
🚧 dinglehopper: Sahre json_float code
continuous-integration/drone/push Build encountered an error
Details
3 years ago
Gerber, Mike
5b394649a7
🚧 dinglehopper: Compute WER in line-dirs CLI
3 years ago
Gerber, Mike
cb2be96179
🚧 dinglehopper: Add word differences in line-dirs report
3 years ago
Gerber, Mike
dbb660615a
🚧 dinglehopper: Compare line text directories (WIP)
continuous-integration/drone/push Build encountered an error
Details
3 years ago
Gerber, Mike
a018006f98
🚧 dinglehopper: Compare line text directories (WIP)
3 years ago
Gerber, Mike
36b36f6986
🚧 dinglehopper: Compare line text directories (WIP)
3 years ago
Gerber, Mike
f0f3cd2d96
⬆️ dinglehopper: Require rapidfuzz >= 1.9.1
...
continuous-integration/drone/push Build encountered an error
Details
See https://github.com/qurator-spk/dinglehopper/issues/64 .
3 years ago
Gerber, Mike
a5c9c7438f
💩 ocrd-galley: Work around OCR-D/core#730
...
continuous-integration/drone/push Build is passing
Details
OCR-D/core currently needs six until the next relaase. Fix the build by
requiring it here.
3 years ago
Gerber, Mike
7d26b049d1
Merge branch 'fix/ci-py310'
3 years ago
Gerber, Mike
51a44895dc
⬆️ CircleCI: Add Python 3.10
3 years ago
Gerber, Mike
1f8fa5176f
Revert " ⬆️ CircleCI: Add Python 3.10"
...
This reverts commit b2b21839c2
.
3 years ago
Gerber, Mike
b2b21839c2
⬆️ CircleCI: Add Python 3.10
3 years ago
Gerber, Mike
7d85e21cbc
⬆️ CircleCI: Switch to the new cimg/python image
3 years ago
Gerber, Mike
dea0c53f88
Merge branch 'rapidfuzz'
3 years ago
Gerber, Mike
06ea38449c
📝 dinglehopper: Update Levenshtein notebook
3 years ago
Gerber, Mike
3ee688001a
🧹 dinglehopper: Directly import levenshtein() from rapidfuzz
3 years ago
Gerber, Mike
5d496df267
⚡ dinglehopper: Remove tests that only test rapidfuzz's levenshtein()
3 years ago
Gerber, Mike
091f069b3c
⚡ dinglehopper: Remove tests that only test rapidfuzz's levenshtein_ops()
3 years ago
Gerber, Mike
af8da1d716
⚡ dinglehopper: Use rapidfuzz for editops
3 years ago
Gerber, Mike
249787686f
Merge branch 'master' of github.com:qurator-spk/dinglehopper
continuous-integration/drone/push Build is failing
Details
4 years ago
Gerber, Mike
2a6cc5823e
🐛 dinglehopper: Call initLogging before logging
...
When using ocrd_utils' getLogger(), we need to call initLogging() before doing any
logging.
Fixes #55 .
4 years ago
Mike Gerber
0b9af3a21e
Merge pull request #58 from kba/unorderedgroupindexed
...
continuous-integration/drone/push Build is passing
Details
ReadingOrder may also contain UnorderedGroupIndexed
4 years ago
Konstantin Baierer
7fde00d911
ReadingOrder may also contain UnorderedGroupIndexed
4 years ago
Gerber, Mike
1778b36a9a
🚧 dinglehopper: Read PAGE UnorderedGroup in XML order
4 years ago
Gerber, Mike
bd324331e6
🚧 dinglehopper: Try out Drone CI
continuous-integration/drone/push Build is passing
Details
4 years ago
Gerber, Mike
a59ecb795c
🚧 dinglehopper: Try out Drone CI
continuous-integration/drone/push Build is failing
Details
4 years ago
Gerber, Mike
14230e073a
🚧 dinglehopper: Try out Drone CI
4 years ago
Gerber, Mike
985666a71c
🚧 dinglehopper: Try out Drone CI
4 years ago
Gerber, Mike
4a73053cfc
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
e3d4493c82
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
27f4c3bdf8
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
8533e6d421
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
e8da8b63f8
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
3b7a1a5631
🚧 Replace Travis with CircleCI
4 years ago
Mike Gerber
691ce371ca
Merge pull request #50 from b2m/fix-table-extraction
...
Fix the extraction of text from Page with TableRegion
4 years ago
Benjamin Rosemann
a68fc269d9
Fix the extraction of text from Page with TableRegion
...
Dinglehopper did not consider `OrderedGroupIndex` in the `ReadingOrder`
element when extracting text regions. As a consequence a `TableRegion`
was not considered for text extraction.
4 years ago
Gerber, Mike
8cd8314c8a
🐛 dinglehopper: Bump up ocrd req for zip_input_files
...
See also GH-49.
4 years ago
Mike Gerber
62670dd0c7
Merge pull request #49 from kba/zip_input_files
...
ocrd cli: use core-provided zip_input_files method
4 years ago
Konstantin Baierer
74e0ac18ed
ocrd cli: use core-provided zip_input_files method
4 years ago
Gerber, Mike
389e253c11
🐛 dinglehopper: Fix alto_extract_lines()'s type annotation
4 years ago
Gerber, Mike
fe3923a8af
🐛 dinglehopper: Fix alto_extract()'s type annotation
4 years ago
Gerber, Mike
132f91d500
✔️ dinglehopper: Add missing integration test markers
4 years ago
Gerber, Mike
c48d7646df
📝 dinglehopper: README-DEV: Massage markdown a bit
4 years ago
Mike Gerber
fed021090d
Merge pull request #46 from b2m/tool-changes
...
Tool changes
4 years ago
Benjamin Rosemann
cb1ac9d260
Add black to developer requirements.
4 years ago
Benjamin Rosemann
03ad413f4a
Added some helpful tools and configurations
4 years ago
Benjamin Rosemann
5cbd4f3d95
Preparation for black code formatter
4 years ago