Commit Graph

125 Commits (c15da4cd0f2d1bb8cc5c5b022b7bc79595b58260)
 

Author SHA1 Message Date
Clemens Neudecker 3b51a14600
Merge pull request from cneud/add-license-1
Add LICENSE
Clemens Neudecker 29870f26e1
Merge pull request from cneud/cneud-PAGE2019
PAGE2019
Clemens Neudecker c0989f5b55
Merge pull request from cneud/cneud-opencv-python-headless
Relax requirements.txt
Clemens Neudecker 9b784e3a81
ocrd implies click
Clemens Neudecker 7c7f035b69
matplotlib implies numpy
Clemens Neudecker 1c4ddac3b6
Merge pull request from kba/kebab-snake
kebab-case snake_case executable, fix 
Konstantin Baierer b6ca1a7c53 kebab-case snake_case executable, fix
Gerber, Mike 1b73c3c23e 📝 sbb_textline_detector: Break long line for ocrd_sbb_textline_detector example
Gerber, Mike eb4c8ee99c 📝 sbb_textline_detector: Break long line for ocrd_sbb_textline_detector example
Gerber, Mike b15fed32ff Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector
Gerber, Mike 482c0fd095 📝 sbb_textline_detector: Document OCR-D Usage
Gerber, Mike 3935204338 📝 sbb_textline_detector: Document OCR-D Usage
Clemens Neudecker 3b526ef40d
refactor class name
Clemens Neudecker 6c0bfba686
fix typos
Clemens Neudecker 5113d28e13
do not require sudo
Clemens Neudecker 02388a759d
Update README.md
Clemens Neudecker c8bc468628
fix docstring
Clemens Neudecker 2ecb021870
refactor class name
Clemens Neudecker e696a068cb
Fix typos
Clemens Neudecker d90dad48fd
PAGE2019
Clemens Neudecker 58f5d2b3c5
Update requirements.txt
Clemens Neudecker b22a812979
Improve README.md
Clemens Neudecker eb64cc030f
Create LICENSE
b-vr103 d08712533a Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector
vahidrezanezhad af670b55ac
Update README.md
vahidrezanezhad 1013b7ed64
Update README.md
vahidrezanezhad eeff5a0b2d
Update README.md
vahidrezanezhad fb7c605515
Update README.md
vahidrezanezhad ad4f7acdd8
Update README.md
vahidrezanezhad a836a083c1
Update README.md
vahidrezanezhad b0dc6491c7
Update README.md
Rezanezhad, Vahid 19116091f9 Update config_params.json
Gerber, Mike af5cbe9052 🐛 sbb_textline_detector: Fix making the output file id
Rezanezhad, Vahid 2112bb18c6 fixed the bug: local variable 't4' referenced before assignment
Rezanezhad, Vahid a11f6740cb Update main.py - robust deskewing and better page extraction
Rezanezhad, Vahid 0182b7087f remove multiprocessing bug
Gerber, Mike 8fa7179560 🐛 sbb_textline_detector: Disable multiprocessing to fix race condition
Lines were sorted in the wrong regions. Work around this by disabling
multiprocessing until a proper fix is done.
Gerber, Mike 4aed06a325 sbb_textline_detection: Preserve input PAGE info by merging segmentation results
ocrd_sbb_textline_detection used the output XML by main.py as is, and
– by doing this – threw away any input data from the input PAGE,
including the critical pc:AlternativeImage and the less important
pc:MetadataItem.

Fix this by merging the segmentation results into a file created from
the input file.

Also add a pc:MetadataItem processingStep about the segmentation
operation.
Gerber, Mike 4fb3e70ef6 🧹 sbb_textline_detector: Do not create empty/space-only TextEquivs (again)
Gerber, Mike bf41a29e7b 🐛 sbb_textline_detector: Do not hardcode Created/LastChange elements
Gerber, Mike fbd21cdb81 🧹 sbb_textline_detector: Do not create empty/space-only TextEquivs (again)
Rezanezhad, Vahid 2d6dd92b31 Update main.py
Rezanezhad, Vahid 9f97f34255 Update main.py
Rezanezhad, Vahid 8c954a6c7a Update main.py
Rezanezhad, Vahid 6714481556 Update main.py
Rezanezhad, Vahid 719824f19d Update main.py
Gerber, Mike f94511a1d8 Merge branch 'master' of code.dev.sbb.berlin:qurator/mono-repo
Gerber, Mike 4f28cd905a 🧹 sbb_textline_detector: Do not create empty/space-only TextEquivs
ocrd_tesserocr or ocrd_cis complain about already existing text if
empty/space-only TextEquivs elements exist after segmentation. Also, it
does not make sense to create them in a segmentation step.

Fix by removing the code generating the elements.
Rezanezhad, Vahid 00929ab391 Update main.py
Gerber, Mike f0dd955606 Merge branch 'master' of code.dev.sbb.berlin:qurator/mono-repo