mirror of https://github.com/qurator-spk/train-calamari-gt4histocr.git synced 2026-07-21 17:49:10 +02:00

No description

Find a file

Mike Gerber f5a3a4fb2e 📝 README: Update to reflect that this is mainly for documentation of the trained model		2020-02-13 11:04:58 +01:00
data@f817209ba7	✨ calamari-models/train-calamari-gt4histocr: Move from my personal experiments repo	2019-12-05 17:48:45 +01:00
LICENSE	calamari-models/train-calamari-gt4histocr: Add license (Fixes qurator-spk/train-calamari-gt4histocr#1 )	2019-12-13 13:58:39 +01:00
qurator_data_lib.sh	calamari-models/train-calamari-gt4histocr: Update train.sh to use qurator_data_lib.sh	2019-12-05 18:16:43 +01:00
README.md	📝 README: Update to reflect that this is mainly for documentation of the trained model	2020-02-13 11:04:58 +01:00
requirements.txt	⬆️ Make tensorflow-gpu dependency less tight	2020-02-12 15:59:35 +01:00
train.sh	calamari-models/train-calamari-gt4histocr: Update train.sh to use qurator_data_lib.sh	2019-12-05 18:16:43 +01:00
upload-prepare.sh	calamari-models/train-calamari-gt4histocr: Add a script to prepare upload	2019-12-05 18:48:02 +01:00

README.md

Train a GT4HistOCR Calamari model

train.sh trains a Calamari model based on GT4HistOCR. Or rather 5 using cross-validation to use for confidence voting. This repository mainly serves as documentation of the providence of the model published at https://qurator-data.de/calamari-models/, not as the definitive guide to training such a model.

Trained models

For a finished model have a look here: https://qurator-data.de/calamari-models/

Training your own model

If you really want to, you can use this script to train your own. It takes about 1 week on a Nvidia RTX 2080 GPU. Please use requirements.txt in that case to setup a virtualenv.

train.sh is able to download GT4HistOCR from the web if the data submodule is not available, that is if you're not a member of the Qurator team at SBB.