1
0
Fork 0
mirror of https://github.com/qurator-spk/dinglehopper.git synced 2025-06-09 11:50:00 +02:00
Commit graph

468 commits

Author SHA1 Message Date
f3aafb6fdf dinglehopper: Validate ExtractedText.{segments,_text} in both directions 2020-10-08 12:20:27 +02:00
1f9a680fe7 ⚙️ dinglehopper: PyCharm should use dinglehopper-github virtualenv 2020-10-08 12:16:42 +02:00
b14c35e147 🎨 dinglehopper: Use multimethod to handle str vs ExtractedText 2020-10-08 12:15:58 +02:00
a17ee2afec 🚧 dinglehopper: Guarantee NFC + rename from_text → from_str 2020-10-08 11:25:01 +02:00
7843824eaf 🚧 dinglehopper: Support str & ExtractedText in CER and distance functions 2020-10-08 10:47:20 +02:00
5bee55c896 💩 dinglehopper: Fix OCR-D CLI test by working around ocrd_cli_wrap_processor() check for arguments 2020-10-07 18:40:06 +02:00
96b55f1806 🚧 dinglehopper: Hierarchical text representation 2020-10-07 18:31:52 +02:00
db6292611f 🧹 dinglehopper: Remove merged text extraction test code 2020-10-07 16:07:27 +02:00
d706ef4621 📝 Document CER/WER and the format detection (Fixes GH-26) 2020-09-30 17:58:05 +02:00
da47e41c85 💩 dinglehopper: Fix OCR-D CLI test by working around ocrd_cli_wrap_processor() check for arguments 2020-09-25 14:53:19 +02:00
7085ee0fd8
Merge pull request #29 from kba/getlogger
getLogger per method
2020-09-25 13:20:58 +02:00
77154ef256 📝 dinglehopper: Document REPORT_PREFIX (Closes GH-27.) 2020-09-24 20:58:15 +02:00
829b84c66a ⚙️ dinglehopper: Add PyCharm's vcs.xml to git 2020-09-24 20:51:42 +02:00
Konstantin Baierer
12da98e477 getLogger per method 2020-09-24 10:16:52 +02:00
717801bdbb Merge commit '7930ecd428' 2020-09-03 14:47:44 +02:00
7930ecd428 Merge branch 'master' of https://github.com/qurator-spk/dinglehopper 2020-08-10 18:03:39 +02:00
976a042b2b 🔧 dinglehopper: Add PyCharm code style config 2020-08-10 18:03:29 +02:00
7e3dafd3bc 🔧 dinglehopper: Add PyCharm code style config 2020-08-10 18:03:00 +02:00
2b98f69afe
Merge pull request #23 from kba/file-ids-and-such
ocrd cli: use make_file_id and assert_file_grp_cardinality
2020-08-07 18:12:07 +02:00
Konstantin Baierer
004ae298ca ocrd cli: use make_file_id and assert_file_grp_cardinality 2020-08-07 18:00:33 +02:00
79253c2640 Merge branch 'feat/display-segment-id' of https://github.com/qurator-spk/dinglehopper into feat/display-segment-id 2020-06-26 17:52:39 +02:00
5a3a74b246 Merge branch 'feat/display-segment-id' of github.com:qurator-spk/dinglehopper into feat/display-segment-id 2020-06-23 17:02:56 +02:00
6ab38f1bda 🎨 dinglehopper: Make PyCharm happier with the type hinting, newlines etc. 2020-06-18 13:27:59 +02:00
d484810038 dinglehopper: Validate read segment ids 2020-06-18 13:27:59 +02:00
d39f74f11a 🧹 dinglehopper: Remove obsolete normalization-related FIXME 2020-06-18 13:27:59 +02:00
8c5f7c73d5 🧹 dinglehopper: Replace XXX with an actual comment 2020-06-18 13:27:59 +02:00
37edc0336f 🧹 dinglehopper: Remove obsolete XXX that has a GitHub issue 2020-06-18 13:27:59 +02:00
9f05e6ca4c 🧹 dinglehopper: Remove obsolete XXX about None ids 2020-06-18 13:27:59 +02:00
4469af62c8 🎨 dinglehopper: Unfuck substitutions a bit 2020-06-18 13:27:59 +02:00
079be203bd 🐛 dinglehopper: Fix tests to deal with new normalization logic 2020-06-18 13:27:59 +02:00
c010a7f05e 🧹 dinglehopper: Calculate segment ids once, on the first call 2020-06-18 13:27:59 +02:00
0cf7ff4721 🧹 dinglehopper: Remove obsolete XXX about the PAGE hierarchy 2020-06-18 13:27:59 +02:00
c432cb505a 🧹 dinglehopper: Clean up test_lines_similar() 2020-06-18 13:27:59 +02:00
0c33e84415 📓 dinglehopper: Document editops() 2020-06-18 13:27:59 +02:00
a61c935624 🧹 dinglehopper: Move Python 3.5 XXXs to a GitHub issue
See https://github.com/qurator-spk/dinglehopper/issues/20.
2020-06-18 13:27:59 +02:00
257e4986cc 🚧 dinglehopper: Use a Bootstrap tooltip for the segment id 2020-06-18 13:27:59 +02:00
a320d5fd8f 🚧 dinglehopper: Re-introduce "substitute_equivalences" as Normalization.NFC_SBB 2020-06-18 13:27:59 +02:00
2579e0220c 🚧 dinglehopper: Remove debug output 2020-06-18 13:27:59 +02:00
d4e39d3d26 🚧 dinglehopper: Display segment id in the corresponding column 2020-06-18 13:27:59 +02:00
48ad340428 🚧 dinglehopper: Display segment id when hovering over a character difference 2020-06-18 13:27:59 +02:00
1f6538b44c 🚧 dinglehopper: Extract text while retaining segment id info 2020-06-18 13:27:59 +02:00
275ff32524 🚧 dinglehopper: Extract text while retaining segment id info 2020-06-18 13:27:59 +02:00
4e182e0794 🚧 dinglehopper: Extract text while retaining segment id info 2020-06-18 13:27:59 +02:00
9f8bb1d8ea 🚧 dinglehopper: Extract text while retaining segment id info 2020-06-18 13:27:59 +02:00
1083dcc5b9 🚧 dinglehopper: Test aligning by character while retaining segment id info 2020-06-18 13:27:59 +02:00
55db2b713f 🚧 dinglehopper: Test aligning by character while retaining segment id info 2020-06-18 13:27:59 +02:00
0d569e81c3 🚧 dinglehopper: Test aligning by character while retaining segment id info 2020-06-18 13:27:59 +02:00
167dad18f4 🚧 dinglehopper: Test aligning by character while retaining segment id info 2020-06-18 13:27:59 +02:00
4cd835ae51 🚧 dinglehopper: Test aligning by character while retaining segment id info 2020-06-18 13:27:59 +02:00
8435d88419 🚧 dinglehopper: WIP data structure for extracted text 2020-06-18 13:27:59 +02:00