mirror of
				https://github.com/qurator-spk/neat.git
				synced 2025-10-30 16:24:12 +01:00 
			
		
		
		
	Update Preprocessing.md
This commit is contained in:
		
							parent
							
								
									860a3c45f0
								
							
						
					
					
						commit
						564a9ee851
					
				
					 1 changed files with 2 additions and 3 deletions
				
			
		|  | @ -4,7 +4,6 @@ The preprocessing pipeline that is developed at the | |||
| [Berlin State Library](http://staatsbibliothek-berlin.de/)  | ||||
| comprises the following steps: | ||||
| - textline extraction @[sbb_pixelwise_segmentation](https://github.com/qurator-spk/pixelwise_segmentation_SBB) | ||||
| - word segmentation @[ocrd_tesserocr](https://github.com/OCR-D/ocrd_tesserocr) | ||||
| - OCR @[ocrd_calamari](https://github.com/qurator-spk/ocrd_calamari) | ||||
| - OCR + word segmentation @[ocrd_tesserocr](https://github.com/OCR-D/ocrd_tesserocr) | ||||
| - Tokenization | ||||
| - Pretagging @[sbb_ner](https://github.com/qurator-spk/sbb_ner) | ||||
		Loading…
	
	Add table
		Add a link
		
	
		Reference in a new issue