mirror of https://github.com/qurator-spk/sbb_pixelwise_segmentation.git synced 2026-07-16 08:39:19 +02:00

No description

Find a file

b-vr103 3ac99b4e81 Merge commit '`c5e1e2dda7`'		2019-12-10 11:59:24 +01:00
.gitkeep	code to produce models	2019-12-05 12:01:54 +01:00
__init__.py	add files needed for training	2019-12-05 12:10:55 +01:00
config_params.json	Update config_params.json	2019-12-05 14:05:55 +01:00
metrics.py	add files needed for training	2019-12-05 12:10:55 +01:00
models.py	add files needed for training	2019-12-05 12:10:55 +01:00
README.md	Update README.md	2019-12-10 11:58:02 +01:00
train.py	add files needed for training	2019-12-05 14:05:07 +01:00
utils.py	add files needed for training	2019-12-05 14:05:07 +01:00

README.md

Train

just run: python train.py with config_params.json

Ground truth format

Lables for each pixel is identified by a number . So if you have a
binary case n_classes should be set to 2 and 
labels should be 0 and 1 for each class and pixel.
In the case of multiclass just set n_classes to the number of classes 
you have and the try to produce the labels
by pixels set from 0 , 1 ,2 .., n_classes-1.
The labels format should be png. 

If you have an image label for binary case it should look like this:

Label: [ [[1 0 0 1], [1 0 0 1] ,[1 0 0 1]], 
[[1 0 0 1], [1 0 0 1] ,[1 0 0 1]] ,
[[1 0 0 1], [1 0 0 1] ,[1 0 0 1]] ] 
This means that you have an image by 3*4*3 and pixel[0,0] belongs
to class 1 and pixel[0,1] to class 0.

Training , evaluation and output

train and evaluation folder should have subfolder of images and labels.
And output folder should be empty folder which the output model will be written there.

Patches

if you want to train your model with patches, the height and width of
patches should be defined and also number of 
batchs (how many patches should be seen by model by each iteration).
In the case that model should see the image once, like page extraction,
the patches should be set to false.

Pretrained encoder

Download weights from this link and add it to pretrained_model folder. https://file.spk-berlin.de:8443/pretrained_encoder/