From a5d46f0d28c40a5d70004b3042916ec10a6f30f3 Mon Sep 17 00:00:00 2001 From: Mike Gerber Date: Thu, 1 Oct 2020 13:23:44 +0200 Subject: [PATCH] =?UTF-8?q?=F0=9F=9A=A7=20README:=20Mention=20METS?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 3cf8227..cb370b3 100644 --- a/README.md +++ b/README.md @@ -8,7 +8,7 @@ ## Introduction -**ocrd_calamari** offers a [OCR-D](https://ocr-d.de) compliant workspace processor for the functionality of Calamari OCR. It uses [PAGE XML](https://github.com/PRImA-Research-Lab/PAGE-XML) documents as input and output. +**ocrd_calamari** offers a [OCR-D](https://ocr-d.de) compliant workspace processor for the functionality of Calamari OCR. It uses OCR-D workspaces (METS) with [PAGE XML](https://github.com/PRImA-Research-Lab/PAGE-XML) documents as input and output. This processor only operates on the text line level and so needs a line segmentation (and by extension a binarized image) as its input.