Konstantin Baierer
5c7fd26883
processor: self.resolve_resource model
4 years ago
Robert Sachunsky
261db14ec3
ensure valid coordinates by intersection with parent…
...
- Border: intersect with page frame
- text regions: intersect with (new) Border
- text lines: intersect with (new) text region
(and back-transform at all)
4 years ago
Gerber, Mike
665b739fb8
🐛 sbb_textline_detector: Re-base Border coords too
4 years ago
Gerber, Mike
006c7765b0
🐛 sbb_textline_detector: Filter cropped images (OCR-D)
4 years ago
Gerber, Mike
37cc513ce9
🚧 sbb_textline_detector: Translate detected coordinates
4 years ago
Gerber, Mike
a9b9c8a885
🚧 sbb_textline_detector: Get image via image_from_page
4 years ago
Konstantin Baierer
f167f6768c
getLogger per method
4 years ago
Clemens Neudecker
e4798c6811
replace 'PrintSpace' with 'Border'
4 years ago
Konstantin Baierer
05deb03ec8
use make_file_id and assert_file_grp_cardinality
4 years ago
Gerber, Mike
8b01d9e671
🐛 sbb_textline_detection: Set pcGtsId
...
Newest OCR-D workspace validation requires that the pcGtsId of a
PAGE-XML file matches its METS mets:file/ID. Fix this by setting
it correctly.
4 years ago
Gerber, Mike
f90b3cfa86
🔊 sbb_textline_detector: In OCR-D interface, warn if overwriting existing segmentation
5 years ago
Gerber, Mike
11c0e9cee5
🐛 sbb_textline_detector: Do not print PAGE output to stdout
...
ocrd-sbb-textline-detector uses ocrd_page's parse() to parse XML input,
which writes the XML to stdout by default.
Fix this by silencing it using parse()'s silence=True.
5 years ago
wrznr
4fc57d7756
Assign page id
5 years ago
wrznr
9e9163e852
Simplify the iteration over files in the input file group
5 years ago
Mike Gerber
6e0decb5ec
Merge pull request #12 from kba/rename-tool
...
Rename ocrd_sbb.. to ocrd-sbb... in ocrd_cli.py, ht @bertsky
5 years ago
Gerber, Mike
5fb30a7a1f
Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector "
...
This reverts commit 417b9235d5
, reversing
changes made to a74974b7b6
.
5 years ago
Konstantin Baierer
cf6381c148
Rename ocrd_sbb.. to ocrd-sbb... in ocrd_cli.py, ht @bertsky
5 years ago
Rezanezhad, Vahid
19116091f9
Update config_params.json
5 years ago
Gerber, Mike
af5cbe9052
🐛 sbb_textline_detector: Fix making the output file id
5 years ago
Gerber, Mike
4aed06a325
✨ sbb_textline_detection: Preserve input PAGE info by merging segmentation results
...
ocrd_sbb_textline_detection used the output XML by main.py as is, and
– by doing this – threw away any input data from the input PAGE,
including the critical pc:AlternativeImage and the less important
pc:MetadataItem.
Fix this by merging the segmentation results into a file created from
the input file.
Also add a pc:MetadataItem processingStep about the segmentation
operation.
5 years ago
Gerber, Mike
2528573b4f
✨ sbb_textline_detector: Allow PAGE input in OCR-D interface
...
Previous OCR-D processors may output PAGE files instead of image files.
Resolve images file from PAGE files if necessary.
5 years ago
Gerber, Mike
2199bf0d8c
🧹 sbb_textline_detector: Remove extra .xml suffix from METS file id
5 years ago
Gerber, Mike
5fd04677f9
🐛 sbb_textline_detector: Fix filenames of created OCR-D file group
5 years ago
Gerber, Mike
0c915c75de
✨ sbb_textline_detector: Add a OCR-D interface
5 years ago