Robert Sachunsky
e16438a377
add MetadataItem about processor
2019-11-29 18:15:56 +01:00
Robert Sachunsky
ad8f25666b
backout gracefully when text annotation is missing
2019-11-29 18:06:37 +01:00
Robert Sachunsky
9002606e1c
unify function for 3 levels
2019-11-29 17:46:16 +01:00
Robert Sachunsky
f829015bb5
relax concatenation test: text must be equal irrespective of tokenization/joiner
2019-11-29 17:19:56 +01:00
6ee105b17c
Merge pull request #4 from bertsky/all-orders
...
All orders
2019-11-29 13:13:24 +01:00
Robert Sachunsky
0dc5bdac2e
generalize to other textLineOrder/readingDirection:
...
- don't ignore regions / lines / words that are not top-to-bottom and left-to-right;
instead, only ignore regions that are not top-to-bottom OR bottom-to-top and
lines or words that are not left-to-right OR right-to-left
(thus, applying each on its appropriate level, and allowing reverse sorting,
but still discounting rotated layouts)
- don't enter segments if they have no more than 1 child
- improve logging: show failed attempts on debug, show pageIds throughout
2019-11-29 12:31:08 +01:00
Robert Sachunsky
295165692b
fix module tree
2019-11-29 12:16:04 +01:00
1468b6070b
🐛 Use ocrd-repair-inconsistencies where appropriate
2019-11-26 17:11:51 +01:00
2205a4469c
Merge pull request #1 from bertsky/patch-1
...
Improve description & documented steps & "no underscores"
2019-11-26 17:03:10 +01:00
Robert Sachunsky
30a59d897d
Merge branch 'master' into patch-1
2019-11-26 16:53:47 +01:00
cf3ee27288
📝 README: Fix grammar
2019-11-26 15:27:53 +01:00
Robert Sachunsky
f9125b59d8
better tool description, fix steps, no underscores
2019-11-26 12:49:30 +00:00
Robert Sachunsky
b41278f99b
improve description of behaviour in README
...
clarify:
- what levels are re-ordered (`ReadingOrder` unaffected/unrelated)
- textual consistency is only the trigger, not the driver
- coordinate-based order respects annotation
2019-11-26 12:43:52 +00:00
7d4afef3f8
📝 README: Clarify
2019-11-26 13:07:57 +01:00
b2845ad254
📝 Update fix script
2019-11-25 18:06:05 +01:00
1c44a198af
✨ Report missing reading direction/textline order
2019-11-25 18:05:10 +01:00
e8d6845e74
📝 Include an example script
2019-11-25 12:00:45 +01:00
e3a1d1e941
📝 README: Include an example script
2019-11-25 11:58:57 +01:00
44fe8a8357
✨ Check reading direction/textline order rather than assuming
2019-11-25 11:06:28 +01:00
dd9f1a3093
🎨 Methods → functions
2019-11-22 16:58:53 +01:00
42484da95a
🎨 Optimize imports
2019-11-22 16:58:34 +01:00
c868ec568d
📝 Add a short README
2019-11-22 16:51:28 +01:00
25437176d4
✨ Fix line order in regions
2019-11-22 16:50:55 +01:00
c7033a5d4d
🎨 Extract methods to fix lines + words
2019-11-22 16:39:23 +01:00
8d21cd8ab9
🎉 Initial commit
2019-11-22 16:18:05 +01:00