From f5a3a4fb2eb7d7484eaca8b840d2ad5b9fc84951 Mon Sep 17 00:00:00 2001 From: Mike Gerber Date: Thu, 13 Feb 2020 11:04:58 +0100 Subject: [PATCH] =?UTF-8?q?=F0=9F=93=9D=20README:=20Update=20to=20reflect?= =?UTF-8?q?=20that=20this=20is=20mainly=20for=20documentation=20of=20the?= =?UTF-8?q?=20trained=20model?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- README.md | 19 +++++++++++++------ 1 file changed, 13 insertions(+), 6 deletions(-) diff --git a/README.md b/README.md index 0648fc3..a644854 100644 --- a/README.md +++ b/README.md @@ -2,14 +2,21 @@ Train a GT4HistOCR Calamari model ================================= `train.sh` trains a Calamari model based on GT4HistOCR. Or rather 5 using -cross-validation to use for confidence voting. - -Requires Calamari 0.3.5. - -`train.sh` is able to download GT4HistOCR from the web if the `data` submodule -is not available, that is if you're not a member of the Qurator team at SBB. +cross-validation to use for confidence voting. This repository mainly +serves as documentation of the providence of the model published at +https://qurator-data.de/calamari-models/, not as the definitive guide to +training such a model. Trained models -------------- For a finished model have a look here: https://qurator-data.de/calamari-models/ + +Training your own model +----------------------- +If you really want to, you can use this script to train your own. It takes +about 1 week on a Nvidia RTX 2080 GPU. Please use [requirements.txt](requirements.txt) +in that case to setup a virtualenv. + +`train.sh` is able to download GT4HistOCR from the web if the `data` submodule +is not available, that is if you're not a member of the Qurator team at SBB.