Mike Gerber
85f751aacc
Merge pull request #67 from maxbachmann/rapidfuzz
...
replace usage of deprecated rapidfuzz APIs
2 years ago
Max Bachmann
e543438496
replace usage of deprecated rapidfuzz APIs
2 years ago
Mike Gerber
1febea8c92
Merge pull request #66 from stweil/master
...
continuous-integration/drone/push Build is passing
Details
Ignore Python build artifacts
3 years ago
Stefan Weil
101f50ec88
Ignore Python build artifacts
...
Signed-off-by: Stefan Weil <sw@weilnetz.de>
3 years ago
Gerber, Mike
edc24cd4db
✔️ DroneCI: Build on Python 3.6 → 3.10
continuous-integration/drone/push Build is passing
Details
3 years ago
Gerber, Mike
d726396002
👷🏾♂️ Remove str() on Path objects
...
As of Python 3.6 we don't need to call str() on Path objects anymore.
See also gh-20.
3 years ago
Gerber, Mike
a19224dc46
✔️ CircleCI: Stop testing using Python 3.5
...
The latest rapidfuzz updates broke Python 3.5 support. As it is EOL for some time now,
we are stopping testing with it.
See also gh-65 and gh-20.
3 years ago
Gerber, Mike
76bacc0f15
🐛 Bump rapidfuzz dep to >= 2.0.5 (Fixes gh-65)
3 years ago
Gerber, Mike
195354c6d4
Merge branch 'feat/compare-line-texts'
continuous-integration/drone/push Build encountered an error
Details
3 years ago
Gerber, Mike
8a3f5e48c2
🐛 dinglehopper: Patch word_break only once
...
continuous-integration/drone/push Build encountered an error
Details
Previously, we (accidently) patched uniseg's word_break on every call
to words(). Do it only once.
3 years ago
Gerber, Mike
b6bde2b7ec
📝 dinglehopper: Document dinglehopper-line-dirs in the README
continuous-integration/drone/push Build encountered an error
Details
3 years ago
Gerber, Mike
f77ce857b2
🚧 dinglehopper: Sahre json_float code
continuous-integration/drone/push Build encountered an error
Details
3 years ago
Gerber, Mike
5b394649a7
🚧 dinglehopper: Compute WER in line-dirs CLI
3 years ago
Gerber, Mike
cb2be96179
🚧 dinglehopper: Add word differences in line-dirs report
3 years ago
Gerber, Mike
dbb660615a
🚧 dinglehopper: Compare line text directories (WIP)
continuous-integration/drone/push Build encountered an error
Details
3 years ago
Gerber, Mike
a018006f98
🚧 dinglehopper: Compare line text directories (WIP)
3 years ago
Gerber, Mike
36b36f6986
🚧 dinglehopper: Compare line text directories (WIP)
3 years ago
Gerber, Mike
f0f3cd2d96
⬆️ dinglehopper: Require rapidfuzz >= 1.9.1
...
continuous-integration/drone/push Build encountered an error
Details
See https://github.com/qurator-spk/dinglehopper/issues/64 .
3 years ago
Gerber, Mike
a5c9c7438f
💩 ocrd-galley: Work around OCR-D/core#730
...
continuous-integration/drone/push Build is passing
Details
OCR-D/core currently needs six until the next relaase. Fix the build by
requiring it here.
3 years ago
Gerber, Mike
7d26b049d1
Merge branch 'fix/ci-py310'
3 years ago
Gerber, Mike
51a44895dc
⬆️ CircleCI: Add Python 3.10
3 years ago
Gerber, Mike
1f8fa5176f
Revert " ⬆️ CircleCI: Add Python 3.10"
...
This reverts commit b2b21839c2
.
3 years ago
Gerber, Mike
b2b21839c2
⬆️ CircleCI: Add Python 3.10
3 years ago
Gerber, Mike
7d85e21cbc
⬆️ CircleCI: Switch to the new cimg/python image
3 years ago
Gerber, Mike
dea0c53f88
Merge branch 'rapidfuzz'
3 years ago
Gerber, Mike
06ea38449c
📝 dinglehopper: Update Levenshtein notebook
3 years ago
Gerber, Mike
3ee688001a
🧹 dinglehopper: Directly import levenshtein() from rapidfuzz
3 years ago
Gerber, Mike
5d496df267
⚡ dinglehopper: Remove tests that only test rapidfuzz's levenshtein()
3 years ago
Gerber, Mike
091f069b3c
⚡ dinglehopper: Remove tests that only test rapidfuzz's levenshtein_ops()
3 years ago
Gerber, Mike
af8da1d716
⚡ dinglehopper: Use rapidfuzz for editops
3 years ago
Gerber, Mike
249787686f
Merge branch 'master' of github.com:qurator-spk/dinglehopper
continuous-integration/drone/push Build is failing
Details
4 years ago
Gerber, Mike
2a6cc5823e
🐛 dinglehopper: Call initLogging before logging
...
When using ocrd_utils' getLogger(), we need to call initLogging() before doing any
logging.
Fixes #55 .
4 years ago
Mike Gerber
0b9af3a21e
Merge pull request #58 from kba/unorderedgroupindexed
...
continuous-integration/drone/push Build is passing
Details
ReadingOrder may also contain UnorderedGroupIndexed
4 years ago
Konstantin Baierer
7fde00d911
ReadingOrder may also contain UnorderedGroupIndexed
4 years ago
Gerber, Mike
1778b36a9a
🚧 dinglehopper: Read PAGE UnorderedGroup in XML order
4 years ago
Gerber, Mike
bd324331e6
🚧 dinglehopper: Try out Drone CI
continuous-integration/drone/push Build is passing
Details
4 years ago
Gerber, Mike
a59ecb795c
🚧 dinglehopper: Try out Drone CI
continuous-integration/drone/push Build is failing
Details
4 years ago
Gerber, Mike
14230e073a
🚧 dinglehopper: Try out Drone CI
4 years ago
Gerber, Mike
985666a71c
🚧 dinglehopper: Try out Drone CI
4 years ago
Gerber, Mike
4a73053cfc
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
e3d4493c82
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
27f4c3bdf8
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
8533e6d421
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
e8da8b63f8
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
3b7a1a5631
🚧 Replace Travis with CircleCI
4 years ago
Mike Gerber
691ce371ca
Merge pull request #50 from b2m/fix-table-extraction
...
Fix the extraction of text from Page with TableRegion
4 years ago
Benjamin Rosemann
a68fc269d9
Fix the extraction of text from Page with TableRegion
...
Dinglehopper did not consider `OrderedGroupIndex` in the `ReadingOrder`
element when extracting text regions. As a consequence a `TableRegion`
was not considered for text extraction.
4 years ago
Gerber, Mike
8cd8314c8a
🐛 dinglehopper: Bump up ocrd req for zip_input_files
...
See also GH-49.
4 years ago
Mike Gerber
62670dd0c7
Merge pull request #49 from kba/zip_input_files
...
ocrd cli: use core-provided zip_input_files method
4 years ago
Konstantin Baierer
74e0ac18ed
ocrd cli: use core-provided zip_input_files method
4 years ago