|
c6c6b8efab
|
📝 dinglehopper: Add detail about the text extraction and ExtractedText
|
2020-10-08 17:05:36 +02:00 |
|
|
f50591abac
|
Merge branch 'feat/display-segment-id'
|
2020-10-08 13:39:38 +02:00 |
|
|
c514abfb9f
|
🧹 dinglehopper: Sanitize imports
|
2020-10-08 13:33:19 +02:00 |
|
|
1077dc64ce
|
➡️ dinglehopper: Move ExtractedText to its own file
|
2020-10-08 13:25:20 +02:00 |
|
|
9dd4ff0aae
|
✨ dinglehopper: Extract line IDs for ALTO
|
2020-10-08 12:54:28 +02:00 |
|
|
f3aafb6fdf
|
✨ dinglehopper: Validate ExtractedText.{segments,_text} in both directions
|
2020-10-08 12:20:27 +02:00 |
|
|
b14c35e147
|
🎨 dinglehopper: Use multimethod to handle str vs ExtractedText
|
2020-10-08 12:15:58 +02:00 |
|
|
a17ee2afec
|
🚧 dinglehopper: Guarantee NFC + rename from_text → from_str
|
2020-10-08 11:25:01 +02:00 |
|
|
7843824eaf
|
🚧 dinglehopper: Support str & ExtractedText in CER and distance functions
|
2020-10-08 10:47:20 +02:00 |
|
|
5bee55c896
|
💩 dinglehopper: Fix OCR-D CLI test by working around ocrd_cli_wrap_processor() check for arguments
|
2020-10-07 18:40:06 +02:00 |
|
|
96b55f1806
|
🚧 dinglehopper: Hierarchical text representation
|
2020-10-07 18:31:52 +02:00 |
|
|
d706ef4621
|
📝 Document CER/WER and the format detection (Fixes GH-26)
|
2020-09-30 17:58:05 +02:00 |
|
|
da47e41c85
|
💩 dinglehopper: Fix OCR-D CLI test by working around ocrd_cli_wrap_processor() check for arguments
|
2020-09-25 14:53:19 +02:00 |
|
|
7085ee0fd8
|
Merge pull request #29 from kba/getlogger
getLogger per method
|
2020-09-25 13:20:58 +02:00 |
|
|
77154ef256
|
📝 dinglehopper: Document REPORT_PREFIX (Closes GH-27.)
|
2020-09-24 20:58:15 +02:00 |
|
Konstantin Baierer
|
12da98e477
|
getLogger per method
|
2020-09-24 10:16:52 +02:00 |
|
Konstantin Baierer
|
004ae298ca
|
ocrd cli: use make_file_id and assert_file_grp_cardinality
|
2020-08-07 18:00:33 +02:00 |
|
|
6ab38f1bda
|
🎨 dinglehopper: Make PyCharm happier with the type hinting, newlines etc.
|
2020-06-18 13:27:59 +02:00 |
|
|
d484810038
|
✨ dinglehopper: Validate read segment ids
|
2020-06-18 13:27:59 +02:00 |
|
|
d39f74f11a
|
🧹 dinglehopper: Remove obsolete normalization-related FIXME
|
2020-06-18 13:27:59 +02:00 |
|
|
8c5f7c73d5
|
🧹 dinglehopper: Replace XXX with an actual comment
|
2020-06-18 13:27:59 +02:00 |
|
|
37edc0336f
|
🧹 dinglehopper: Remove obsolete XXX that has a GitHub issue
|
2020-06-18 13:27:59 +02:00 |
|
|
9f05e6ca4c
|
🧹 dinglehopper: Remove obsolete XXX about None ids
|
2020-06-18 13:27:59 +02:00 |
|
|
4469af62c8
|
🎨 dinglehopper: Unfuck substitutions a bit
|
2020-06-18 13:27:59 +02:00 |
|
|
079be203bd
|
🐛 dinglehopper: Fix tests to deal with new normalization logic
|
2020-06-18 13:27:59 +02:00 |
|
|
c010a7f05e
|
🧹 dinglehopper: Calculate segment ids once, on the first call
|
2020-06-18 13:27:59 +02:00 |
|
|
0cf7ff4721
|
🧹 dinglehopper: Remove obsolete XXX about the PAGE hierarchy
|
2020-06-18 13:27:59 +02:00 |
|
|
c432cb505a
|
🧹 dinglehopper: Clean up test_lines_similar()
|
2020-06-18 13:27:59 +02:00 |
|
|
0c33e84415
|
📓 dinglehopper: Document editops()
|
2020-06-18 13:27:59 +02:00 |
|
|
a61c935624
|
🧹 dinglehopper: Move Python 3.5 XXXs to a GitHub issue
See https://github.com/qurator-spk/dinglehopper/issues/20.
|
2020-06-18 13:27:59 +02:00 |
|
|
257e4986cc
|
🚧 dinglehopper: Use a Bootstrap tooltip for the segment id
|
2020-06-18 13:27:59 +02:00 |
|
|
a320d5fd8f
|
🚧 dinglehopper: Re-introduce "substitute_equivalences" as Normalization.NFC_SBB
|
2020-06-18 13:27:59 +02:00 |
|
|
2579e0220c
|
🚧 dinglehopper: Remove debug output
|
2020-06-18 13:27:59 +02:00 |
|
|
d4e39d3d26
|
🚧 dinglehopper: Display segment id in the corresponding column
|
2020-06-18 13:27:59 +02:00 |
|
|
48ad340428
|
🚧 dinglehopper: Display segment id when hovering over a character difference
|
2020-06-18 13:27:59 +02:00 |
|
|
1f6538b44c
|
🚧 dinglehopper: Extract text while retaining segment id info
|
2020-06-18 13:27:59 +02:00 |
|
|
275ff32524
|
🚧 dinglehopper: Extract text while retaining segment id info
|
2020-06-18 13:27:59 +02:00 |
|
|
4e182e0794
|
🚧 dinglehopper: Extract text while retaining segment id info
|
2020-06-18 13:27:59 +02:00 |
|
|
9f8bb1d8ea
|
🚧 dinglehopper: Extract text while retaining segment id info
|
2020-06-18 13:27:59 +02:00 |
|
|
668de758a0
|
✨ dinglehopper: Support disabling metrics in the OCR-D interface
|
2020-06-09 18:29:59 +02:00 |
|
|
f699697eb3
|
🐛 dinglehopper: Fix reading OCR-D workspace files when only URLs are provided
|
2020-06-09 17:13:22 +02:00 |
|
|
22765f02a2
|
🐛 dinglehopper: Fix tests by making metrics a keyword argument
|
2020-06-09 13:07:44 +02:00 |
|
|
5cbeb7b0dd
|
✨ dinglehopper: Support disabling the metrics using CLI option --no-metrics
|
2020-06-08 18:26:21 +02:00 |
|
|
745095e52c
|
✨ dinglehopper: Include number of characters and words in JSON report
|
2020-02-21 14:53:16 +01:00 |
|
|
48a31ce672
|
Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector"
This reverts commit 2c89bf3b35ee290d7b830ef270df3a96aa48245e, reversing
changes made to 9f7e413148ca5dbac9b555d7b0d0a5fa3a0f5340.
|
2019-12-09 12:44:05 +01:00 |
|
b-vr103
|
1303a7d92f
|
Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector
|
2019-12-09 11:57:16 +01:00 |
|
|
f32eb9eb69
|
🐛 dinglehopper: Escape text inserted into HTML (Fixes #8)
|
2019-12-06 15:59:09 +01:00 |
|
|
82e863fac2
|
📝 dinglehopper: Document seq_editops()
|
2019-12-04 13:22:38 +01:00 |
|
|
5ccdace1dd
|
🎨 dinglehopper: Move working_directory() context manager into tests/util
|
2019-12-02 15:18:08 +01:00 |
|
|
f98c527c93
|
🐛 dinglehopper: Fix working_directory() context manager
|
2019-12-02 15:14:16 +01:00 |
|