mirror of
				https://github.com/mikegerber/ocrd_calamari.git
				synced 2025-10-30 23:34:13 +01:00 
			
		
		
		
	
		
			
				
	
	
		
			31 lines
		
	
	
	
		
			833 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			31 lines
		
	
	
	
		
			833 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| # ocrd_calamari
 | |
| 
 | |
| Recognize text using [Calamari OCR](https://github.com/Calamari-OCR/calamari).
 | |
| 
 | |
| ## Introduction
 | |
| 
 | |
| This offers a OCR-D compliant workspace processor for some of the functionality of Calamari OCR.
 | |
| 
 | |
| This processor only operates on the text line level and so needs a line segmentation (and by extension a binarized 
 | |
| image) as its input.
 | |
| 
 | |
| ## Example Usage
 | |
| 
 | |
| ```sh
 | |
| ocrd-calamari-recognize -p test-parameters.json -m mets.xml -I OCR-D-SEG-LINE -O OCR-D-OCR-CALAMARI
 | |
| ```
 | |
| 
 | |
| With `test-parameters.json`:
 | |
| 
 | |
| ```json
 | |
| {
 | |
|     "checkpoint": "/path/to/some/trained/models/*.ckpt.json"
 | |
| }
 | |
| ```
 | |
| 
 | |
| TODO
 | |
| ----
 | |
| 
 | |
| * Support Calamari's "extended prediction data" output
 | |
| * Currently, the processor only supports a prediction using confidence voting of multiple models. While this is
 | |
|   superior, it makes sense to support single model prediction, too.
 |