mirror of
https://github.com/qurator-spk/neat.git
synced 2025-06-09 11:49:54 +02:00
Update Preprocessing.md
This commit is contained in:
parent
860a3c45f0
commit
564a9ee851
1 changed files with 2 additions and 3 deletions
|
@ -4,7 +4,6 @@ The preprocessing pipeline that is developed at the
|
|||
[Berlin State Library](http://staatsbibliothek-berlin.de/)
|
||||
comprises the following steps:
|
||||
- textline extraction @[sbb_pixelwise_segmentation](https://github.com/qurator-spk/pixelwise_segmentation_SBB)
|
||||
- word segmentation @[ocrd_tesserocr](https://github.com/OCR-D/ocrd_tesserocr)
|
||||
- OCR @[ocrd_calamari](https://github.com/qurator-spk/ocrd_calamari)
|
||||
- OCR + word segmentation @[ocrd_tesserocr](https://github.com/OCR-D/ocrd_tesserocr)
|
||||
- Tokenization
|
||||
- Pretagging @[sbb_ner](https://github.com/qurator-spk/sbb_ner)
|
||||
- Pretagging @[sbb_ner](https://github.com/qurator-spk/sbb_ner)
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue