174 Commits (35e0ae0ef811ec160d07a38b0552f90260b3dec5)
 

Author SHA1 Message Date
Clemens Neudecker 29870f26e1
Merge pull request #4 from cneud/cneud-PAGE2019
PAGE2019
5 years ago
Clemens Neudecker c0989f5b55
Merge pull request #3 from cneud/cneud-opencv-python-headless
Relax requirements.txt
5 years ago
Clemens Neudecker 9b784e3a81
ocrd implies click 5 years ago
Clemens Neudecker 7c7f035b69
matplotlib implies numpy 5 years ago
Clemens Neudecker 1c4ddac3b6
Merge pull request #10 from kba/kebab-snake
kebab-case snake_case executable, fix #9
5 years ago
Konstantin Baierer b6ca1a7c53 kebab-case snake_case executable, fix #9 5 years ago
Gerber, Mike 1b73c3c23e 📝 sbb_textline_detector: Break long line for ocrd_sbb_textline_detector example 5 years ago
Gerber, Mike eb4c8ee99c 📝 sbb_textline_detector: Break long line for ocrd_sbb_textline_detector example 5 years ago
Gerber, Mike b15fed32ff Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector 5 years ago
Gerber, Mike 482c0fd095 📝 sbb_textline_detector: Document OCR-D Usage 5 years ago
Gerber, Mike 3935204338 📝 sbb_textline_detector: Document OCR-D Usage 5 years ago
Clemens Neudecker 3b526ef40d
refactor class name 5 years ago
Clemens Neudecker 6c0bfba686
fix typos 5 years ago
Clemens Neudecker 5113d28e13
do not require sudo 5 years ago
Clemens Neudecker 02388a759d
Update README.md 5 years ago
Clemens Neudecker c8bc468628
fix docstring 5 years ago
Clemens Neudecker 2ecb021870
refactor class name 5 years ago
Clemens Neudecker e696a068cb
Fix typos 5 years ago
Clemens Neudecker d90dad48fd
PAGE2019 5 years ago
Clemens Neudecker 58f5d2b3c5
Update requirements.txt 5 years ago
Clemens Neudecker b22a812979
Improve README.md 5 years ago
Clemens Neudecker eb64cc030f
Create LICENSE 5 years ago
b-vr103 d08712533a Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector 5 years ago
vahidrezanezhad af670b55ac
Update README.md 5 years ago
vahidrezanezhad 1013b7ed64
Update README.md 5 years ago
vahidrezanezhad eeff5a0b2d
Update README.md 5 years ago
vahidrezanezhad fb7c605515
Update README.md 5 years ago
vahidrezanezhad ad4f7acdd8
Update README.md 5 years ago
vahidrezanezhad a836a083c1
Update README.md 5 years ago
vahidrezanezhad b0dc6491c7
Update README.md 5 years ago
Rezanezhad, Vahid 19116091f9 Update config_params.json 5 years ago
Gerber, Mike af5cbe9052 🐛 sbb_textline_detector: Fix making the output file id 5 years ago
Rezanezhad, Vahid 2112bb18c6 fixed the bug: local variable 't4' referenced before assignment 5 years ago
Rezanezhad, Vahid a11f6740cb Update main.py - robust deskewing and better page extraction 5 years ago
Rezanezhad, Vahid 0182b7087f remove multiprocessing bug 5 years ago
Gerber, Mike 8fa7179560 🐛 sbb_textline_detector: Disable multiprocessing to fix race condition
Lines were sorted in the wrong regions. Work around this by disabling
multiprocessing until a proper fix is done.
5 years ago
Gerber, Mike 4aed06a325 sbb_textline_detection: Preserve input PAGE info by merging segmentation results
ocrd_sbb_textline_detection used the output XML by main.py as is, and
– by doing this – threw away any input data from the input PAGE,
including the critical pc:AlternativeImage and the less important
pc:MetadataItem.

Fix this by merging the segmentation results into a file created from
the input file.

Also add a pc:MetadataItem processingStep about the segmentation
operation.
5 years ago
Gerber, Mike 4fb3e70ef6 🧹 sbb_textline_detector: Do not create empty/space-only TextEquivs (again) 5 years ago
Gerber, Mike bf41a29e7b 🐛 sbb_textline_detector: Do not hardcode Created/LastChange elements 5 years ago
Gerber, Mike fbd21cdb81 🧹 sbb_textline_detector: Do not create empty/space-only TextEquivs (again) 5 years ago
Rezanezhad, Vahid 2d6dd92b31 Update main.py 5 years ago
Rezanezhad, Vahid 9f97f34255 Update main.py 5 years ago
Rezanezhad, Vahid 8c954a6c7a Update main.py 5 years ago
Rezanezhad, Vahid 6714481556 Update main.py 5 years ago
Rezanezhad, Vahid 719824f19d Update main.py 5 years ago
Gerber, Mike f94511a1d8 Merge branch 'master' of code.dev.sbb.berlin:qurator/mono-repo 5 years ago
Gerber, Mike 4f28cd905a 🧹 sbb_textline_detector: Do not create empty/space-only TextEquivs
ocrd_tesserocr or ocrd_cis complain about already existing text if
empty/space-only TextEquivs elements exist after segmentation. Also, it
does not make sense to create them in a segmentation step.

Fix by removing the code generating the elements.
5 years ago
Rezanezhad, Vahid 00929ab391 Update main.py 5 years ago
Gerber, Mike f0dd955606 Merge branch 'master' of code.dev.sbb.berlin:qurator/mono-repo 5 years ago
Gerber, Mike 2528573b4f sbb_textline_detector: Allow PAGE input in OCR-D interface
Previous OCR-D processors may output PAGE files instead of image files.
Resolve images file from PAGE files if necessary.
5 years ago