You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
sbb_pixelwise_segmentation/README.md

51 lines
2.1 KiB
Markdown

5 years ago
# Pixelwise Segmentation
> Pixelwise segmentation for document images
## Introduction
This repository contains the source code for training an encoder model for document image segmentation.
## Installation
Either clone the repository via `git clone https://github.com/qurator-spk/sbb_pixelwise_segmentation.git` or download and unpack the [ZIP](https://github.com/qurator-spk/sbb_pixelwise_segmentation/archive/master.zip).
5 years ago
### Pretrained encoder
Download our pretrained weights and add them to a ``pretrained_model`` folder:
5 years ago
https://qurator-data.de/sbb_pixelwise_segmentation/pretrained_encoder/
5 years ago
## Usage
### Train
To train a model, run: ``python train.py with config_params.json``
### Ground truth format
Lables for each pixel are identified by a number. So if you have a
binary case, ``n_classes`` should be set to ``2`` and labels should
be ``0`` and ``1`` for each class and pixel.
In the case of multiclass, just set ``n_classes`` to the number of classes
you have and the try to produce the labels by pixels set from ``0 , 1 ,2 .., n_classes-1``.
The labels format should be png.
3 years ago
Our lables are 3 channel png images but only information of first channel is used.
If you have an image label with height and width of 10, for a binary case the first channel should look like this:
5 years ago
3 years ago
Label: [ [1, 0, 0, 1, 1, 0, 0, 1, 0, 0],
[0, 0, 0, 0, 0, 0, 0, 0, 0, 0],
...,
[0, 0, 0, 0, 0, 0, 0, 0, 0, 0],
[0, 0, 0, 0, 0, 0, 0, 0, 0, 0] ]
5 years ago
3 years ago
This means that you have an image by `10*10*3` and `pixel[0,0]` belongs
5 years ago
to class `1` and `pixel[0,1]` belongs to class `0`.
5 years ago
5 years ago
### Training , evaluation and output
The train and evaluation folders should contain subfolders of images and labels.
The output folder should be an empty folder where the output model will be written to.
5 years ago
5 years ago
### Patches
5 years ago
If you want to train your model with patches, the height and width of
the patches should be defined and also the number of batches (how many patches
should be seen by the model in each iteration).
In the case that the model should see the image once, like page extraction,
patches should be set to ``false``.
5 years ago