Commit Graph

427 Commits (a95a85a889a6b4a4f90818f1a35b99c6034c0b05)
 

Author SHA1 Message Date
Gerber, Mike 009fa55c09 Merge branch 'master' of https://github.com/qurator-spk/dinglehopper
Gerber, Mike c20bbbfa25 📝 dinglehopper: Update screenshot to include a region id tooltip
Mike Gerber 252bf9b3e7
📝 dinglehopper: Fix markdown in README.md
Gerber, Mike c6c6b8efab 📝 dinglehopper: Add detail about the text extraction and ExtractedText
Gerber, Mike 7025ea54a8 📝 dinglehopper: Move developer info to README-DEV.md
Gerber, Mike f50591abac Merge branch 'feat/display-segment-id'
Gerber, Mike c514abfb9f 🧹 dinglehopper: Sanitize imports
Gerber, Mike 1077dc64ce ➡️ dinglehopper: Move ExtractedText to its own file
Gerber, Mike 9dd4ff0aae dinglehopper: Extract line IDs for ALTO
Gerber, Mike f3aafb6fdf dinglehopper: Validate ExtractedText.{segments,_text} in both directions
Gerber, Mike 1f9a680fe7 ⚙️ dinglehopper: PyCharm should use dinglehopper-github virtualenv
Gerber, Mike b14c35e147 🎨 dinglehopper: Use multimethod to handle str vs ExtractedText
Gerber, Mike a17ee2afec 🚧 dinglehopper: Guarantee NFC + rename from_text → from_str
Gerber, Mike 7843824eaf 🚧 dinglehopper: Support str & ExtractedText in CER and distance functions
Gerber, Mike 5bee55c896 💩 dinglehopper: Fix OCR-D CLI test by working around ocrd_cli_wrap_processor() check for arguments
Gerber, Mike 96b55f1806 🚧 dinglehopper: Hierarchical text representation
Gerber, Mike db6292611f 🧹 dinglehopper: Remove merged text extraction test code
Gerber, Mike d706ef4621 📝 Document CER/WER and the format detection (Fixes GH-26)
Gerber, Mike da47e41c85 💩 dinglehopper: Fix OCR-D CLI test by working around ocrd_cli_wrap_processor() check for arguments
Mike Gerber 7085ee0fd8
Merge pull request from kba/getlogger
getLogger per method
Gerber, Mike 77154ef256 📝 dinglehopper: Document REPORT_PREFIX (Closes GH-27.)
Gerber, Mike 829b84c66a ⚙️ dinglehopper: Add PyCharm's vcs.xml to git
Konstantin Baierer 12da98e477 getLogger per method
Gerber, Mike 717801bdbb Merge commit '7930ecd42868cb6785a58f8ee95b05882704621d'
Gerber, Mike 7930ecd428 Merge branch 'master' of https://github.com/qurator-spk/dinglehopper
Gerber, Mike 976a042b2b 🔧 dinglehopper: Add PyCharm code style config
Gerber, Mike 7e3dafd3bc 🔧 dinglehopper: Add PyCharm code style config
Mike Gerber 2b98f69afe
Merge pull request from kba/file-ids-and-such
ocrd cli: use make_file_id and assert_file_grp_cardinality
Konstantin Baierer 004ae298ca ocrd cli: use make_file_id and assert_file_grp_cardinality
Gerber, Mike 79253c2640 Merge branch 'feat/display-segment-id' of https://github.com/qurator-spk/dinglehopper into feat/display-segment-id
Gerber, Mike 5a3a74b246 Merge branch 'feat/display-segment-id' of github.com:qurator-spk/dinglehopper into feat/display-segment-id
Gerber, Mike 6ab38f1bda 🎨 dinglehopper: Make PyCharm happier with the type hinting, newlines etc.
Gerber, Mike d484810038 dinglehopper: Validate read segment ids
Gerber, Mike d39f74f11a 🧹 dinglehopper: Remove obsolete normalization-related FIXME
Gerber, Mike 8c5f7c73d5 🧹 dinglehopper: Replace XXX with an actual comment
Gerber, Mike 37edc0336f 🧹 dinglehopper: Remove obsolete XXX that has a GitHub issue
Gerber, Mike 9f05e6ca4c 🧹 dinglehopper: Remove obsolete XXX about None ids
Gerber, Mike 4469af62c8 🎨 dinglehopper: Unfuck substitutions a bit
Gerber, Mike 079be203bd 🐛 dinglehopper: Fix tests to deal with new normalization logic
Gerber, Mike c010a7f05e 🧹 dinglehopper: Calculate segment ids once, on the first call
Gerber, Mike 0cf7ff4721 🧹 dinglehopper: Remove obsolete XXX about the PAGE hierarchy
Gerber, Mike c432cb505a 🧹 dinglehopper: Clean up test_lines_similar()
Gerber, Mike 0c33e84415 📓 dinglehopper: Document editops()
Gerber, Mike a61c935624 🧹 dinglehopper: Move Python 3.5 XXXs to a GitHub issue
See https://github.com/qurator-spk/dinglehopper/issues/20.
Gerber, Mike 257e4986cc 🚧 dinglehopper: Use a Bootstrap tooltip for the segment id
Gerber, Mike a320d5fd8f 🚧 dinglehopper: Re-introduce "substitute_equivalences" as Normalization.NFC_SBB
Gerber, Mike 2579e0220c 🚧 dinglehopper: Remove debug output
Gerber, Mike d4e39d3d26 🚧 dinglehopper: Display segment id in the corresponding column
Gerber, Mike 48ad340428 🚧 dinglehopper: Display segment id when hovering over a character difference
Gerber, Mike 1f6538b44c 🚧 dinglehopper: Extract text while retaining segment id info