Update Preprocessing.md

pull/40/head
Clemens Neudecker 5 years ago committed by GitHub
parent 860a3c45f0
commit 564a9ee851
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

@ -4,7 +4,6 @@ The preprocessing pipeline that is developed at the
[Berlin State Library](http://staatsbibliothek-berlin.de/) [Berlin State Library](http://staatsbibliothek-berlin.de/)
comprises the following steps: comprises the following steps:
- textline extraction @[sbb_pixelwise_segmentation](https://github.com/qurator-spk/pixelwise_segmentation_SBB) - textline extraction @[sbb_pixelwise_segmentation](https://github.com/qurator-spk/pixelwise_segmentation_SBB)
- word segmentation @[ocrd_tesserocr](https://github.com/OCR-D/ocrd_tesserocr) - OCR + word segmentation @[ocrd_tesserocr](https://github.com/OCR-D/ocrd_tesserocr)
- OCR @[ocrd_calamari](https://github.com/qurator-spk/ocrd_calamari)
- Tokenization - Tokenization
- Pretagging @[sbb_ner](https://github.com/qurator-spk/sbb_ner) - Pretagging @[sbb_ner](https://github.com/qurator-spk/sbb_ner)

Loading…
Cancel
Save