Commit Graph

270 Commits (e543438496ea0b8e2d7ae75ada56ff47fcabd557)
 

Author SHA1 Message Date
Max Bachmann e543438496 replace usage of deprecated rapidfuzz APIs 2 years ago
Mike Gerber 1febea8c92
Merge pull request #66 from stweil/master
continuous-integration/drone/push Build is passing Details
Ignore Python build artifacts
3 years ago
Stefan Weil 101f50ec88 Ignore Python build artifacts
Signed-off-by: Stefan Weil <sw@weilnetz.de>
3 years ago
Gerber, Mike edc24cd4db ✔️ DroneCI: Build on Python 3.6 → 3.10
continuous-integration/drone/push Build is passing Details
3 years ago
Gerber, Mike d726396002 👷🏾‍♂️ Remove str() on Path objects
As of Python 3.6 we don't need to call str() on Path objects anymore.

See also gh-20.
3 years ago
Gerber, Mike a19224dc46 ✔️ CircleCI: Stop testing using Python 3.5
The latest rapidfuzz updates broke Python 3.5 support. As it is EOL for some time now,
we are stopping testing with it.

See also gh-65 and gh-20.
3 years ago
Gerber, Mike 76bacc0f15 🐛 Bump rapidfuzz dep to >= 2.0.5 (Fixes gh-65) 3 years ago
Gerber, Mike 195354c6d4 Merge branch 'feat/compare-line-texts'
continuous-integration/drone/push Build encountered an error Details
3 years ago
Gerber, Mike 8a3f5e48c2 🐛 dinglehopper: Patch word_break only once
continuous-integration/drone/push Build encountered an error Details
Previously, we (accidently) patched uniseg's word_break on every call
to words(). Do it only once.
3 years ago
Gerber, Mike b6bde2b7ec 📝 dinglehopper: Document dinglehopper-line-dirs in the README
continuous-integration/drone/push Build encountered an error Details
3 years ago
Gerber, Mike f77ce857b2 🚧 dinglehopper: Sahre json_float code
continuous-integration/drone/push Build encountered an error Details
3 years ago
Gerber, Mike 5b394649a7 🚧 dinglehopper: Compute WER in line-dirs CLI 3 years ago
Gerber, Mike cb2be96179 🚧 dinglehopper: Add word differences in line-dirs report 3 years ago
Gerber, Mike dbb660615a 🚧 dinglehopper: Compare line text directories (WIP)
continuous-integration/drone/push Build encountered an error Details
3 years ago
Gerber, Mike a018006f98 🚧 dinglehopper: Compare line text directories (WIP) 3 years ago
Gerber, Mike 36b36f6986 🚧 dinglehopper: Compare line text directories (WIP) 3 years ago
Gerber, Mike f0f3cd2d96 ⬆️ dinglehopper: Require rapidfuzz >= 1.9.1
continuous-integration/drone/push Build encountered an error Details
See https://github.com/qurator-spk/dinglehopper/issues/64.
3 years ago
Gerber, Mike a5c9c7438f 💩 ocrd-galley: Work around OCR-D/core#730
continuous-integration/drone/push Build is passing Details
OCR-D/core currently needs six until the next relaase. Fix the build by
requiring it here.
3 years ago
Gerber, Mike 7d26b049d1 Merge branch 'fix/ci-py310' 3 years ago
Gerber, Mike 51a44895dc ⬆️ CircleCI: Add Python 3.10 3 years ago
Gerber, Mike 1f8fa5176f Revert "⬆️ CircleCI: Add Python 3.10"
This reverts commit b2b21839c2.
3 years ago
Gerber, Mike b2b21839c2 ⬆️ CircleCI: Add Python 3.10 3 years ago
Gerber, Mike 7d85e21cbc ⬆️ CircleCI: Switch to the new cimg/python image 3 years ago
Gerber, Mike dea0c53f88 Merge branch 'rapidfuzz' 3 years ago
Gerber, Mike 06ea38449c 📝 dinglehopper: Update Levenshtein notebook 3 years ago
Gerber, Mike 3ee688001a 🧹 dinglehopper: Directly import levenshtein() from rapidfuzz 3 years ago
Gerber, Mike 5d496df267 dinglehopper: Remove tests that only test rapidfuzz's levenshtein() 3 years ago
Gerber, Mike 091f069b3c dinglehopper: Remove tests that only test rapidfuzz's levenshtein_ops() 3 years ago
Gerber, Mike af8da1d716 dinglehopper: Use rapidfuzz for editops 3 years ago
Gerber, Mike 249787686f Merge branch 'master' of github.com:qurator-spk/dinglehopper
continuous-integration/drone/push Build is failing Details
4 years ago
Gerber, Mike 2a6cc5823e 🐛 dinglehopper: Call initLogging before logging
When using ocrd_utils' getLogger(), we need to call initLogging() before doing any
logging.

Fixes #55.
4 years ago
Mike Gerber 0b9af3a21e
Merge pull request #58 from kba/unorderedgroupindexed
continuous-integration/drone/push Build is passing Details
ReadingOrder may also contain UnorderedGroupIndexed
4 years ago
Konstantin Baierer 7fde00d911 ReadingOrder may also contain UnorderedGroupIndexed 4 years ago
Gerber, Mike 1778b36a9a 🚧 dinglehopper: Read PAGE UnorderedGroup in XML order 4 years ago
Gerber, Mike bd324331e6 🚧 dinglehopper: Try out Drone CI
continuous-integration/drone/push Build is passing Details
4 years ago
Gerber, Mike a59ecb795c 🚧 dinglehopper: Try out Drone CI
continuous-integration/drone/push Build is failing Details
4 years ago
Gerber, Mike 14230e073a 🚧 dinglehopper: Try out Drone CI 4 years ago
Gerber, Mike 985666a71c 🚧 dinglehopper: Try out Drone CI 4 years ago
Gerber, Mike 4a73053cfc 🚧 Replace Travis with CircleCI 4 years ago
Gerber, Mike e3d4493c82 🚧 Replace Travis with CircleCI 4 years ago
Gerber, Mike 27f4c3bdf8 🚧 Replace Travis with CircleCI 4 years ago
Gerber, Mike 8533e6d421 🚧 Replace Travis with CircleCI 4 years ago
Gerber, Mike e8da8b63f8 🚧 Replace Travis with CircleCI 4 years ago
Gerber, Mike 3b7a1a5631 🚧 Replace Travis with CircleCI 4 years ago
Mike Gerber 691ce371ca
Merge pull request #50 from b2m/fix-table-extraction
Fix the extraction of text from Page with TableRegion
4 years ago
Benjamin Rosemann a68fc269d9 Fix the extraction of text from Page with TableRegion
Dinglehopper did not consider `OrderedGroupIndex` in the `ReadingOrder`
element when extracting text regions. As a consequence a `TableRegion`
was not considered for text extraction.
4 years ago
Gerber, Mike 8cd8314c8a 🐛 dinglehopper: Bump up ocrd req for zip_input_files
See also GH-49.
4 years ago
Mike Gerber 62670dd0c7
Merge pull request #49 from kba/zip_input_files
ocrd cli: use core-provided zip_input_files method
4 years ago
Konstantin Baierer 74e0ac18ed ocrd cli: use core-provided zip_input_files method 4 years ago
Gerber, Mike 389e253c11 🐛 dinglehopper: Fix alto_extract_lines()'s type annotation 4 years ago