Commit graph

137 commits

Author SHA1 Message Date
kba
8a9b4f8f55 remove commented-out requirement for tf == 2.12.1, rely on same version as in eynollah proper 2025-10-02 12:16:26 +02:00
kba
95bb5908bb Merge branch 'integrate-training-from-sbb_pixelwise_segmentation' of https://github.com/qurator-spk/eynollah into integrate-training-from-sbb_pixelwise_segmentation 2025-10-01 18:02:09 +02:00
kba
48266b1ee0 make training dependencies optional-dependencies of eynollah
i.e. `pip install "eynollah[training]"` will install the requirements for training
2025-10-01 18:01:25 +02:00
kba
733af1e9a7 📝 update train/README.md, align with docs/train.md 2025-10-01 17:43:32 +02:00
cneud
91d2a74ac9 remove redundant parentheses 2025-10-01 00:38:01 +02:00
cneud
f2f93e0251 list literal is faster than using list constructor to create a new list 2025-10-01 00:26:27 +02:00
cneud
70af00182b mutable defaults are the source of all evil 2025-10-01 00:20:18 +02:00
cneud
070dafca75 remove duplicate LICENSE 2025-09-29 22:17:27 +02:00
kba
ce02a3553b 🔥 remove obsolete versions of the training document 2025-09-29 15:18:21 +02:00
kba
6d379782ab 📝 align former upstream train.md with wiki train.md syntactically 2025-09-29 15:11:02 +02:00
kba
56c4b7af88 📝 align pre-merge docs/train.md with former upstream train.md syntactically 2025-09-29 14:59:41 +02:00
vahidrezanezhad
a65405bead tables are visulaized within layout 2025-09-22 15:56:14 +02:00
vahidrezanezhad
530897c6c2 renaming argument names 2025-09-19 13:20:26 +02:00
vahidrezanezhad
68a71be8bc Running inference on files in a directory 2025-09-13 22:40:11 +02:00
vahidrezanezhad
cf4983da54 visualize vertical ocr text vertically 2025-08-08 16:12:55 +02:00
vahidrezanezhad
263da755ef loading xmls with UTF-8 encoding 2025-08-07 10:32:49 +02:00
vahidrezanezhad
6462ea5b33 adding visualization of ocr text of xml file 2025-08-06 22:33:42 +02:00
vahidrezanezhad
1b222594d6 Update README.md: how to train model using docker image 2025-06-25 18:33:55 +02:00
vahidrezanezhad
f5a1d1a255 docker file to train model with desired cuda and cudnn 2025-06-25 18:24:16 +02:00
Clemens Neudecker
0e7de52f5e Merge pull request #24 from johnlockejrr/unifying-training-models
Unifying training models
2025-06-03 09:00:56 +02:00
vahidrezanezhad
eb91000490 layout visualization updated 2025-06-02 18:23:34 +02:00
vahidrezanezhad
25e3a2a99f visualizing ro for single xml file 2025-05-23 18:30:51 +02:00
vahidrezanezhad
f9390c71e7 updating inference for mb reading order 2025-05-17 02:18:27 +02:00
johnlockejrr
25abc0fabc Update gt_gen_utils.py
Keep safely the full basename without extension
2025-05-14 03:34:51 -07:00
vahidrezanezhad
4a7728bb34 visuliazation layout from eynollah page-xml output 2025-05-12 22:39:47 +02:00
vahidrezanezhad
4ddc84dee8 visulizing textline detection from eynollah page-xml output 2025-05-12 18:31:40 +02:00
johnlockejrr
3a9fc0efde Update utils.py
Changed unsafe basename extraction:
`file_name = i.split('.')[0]` to `file_name = os.path.splitext(i)[0]`
and
`filename = n[i].split('.')[0]` to `filename = os.path.splitext(n[i])[0]`
because
`"Vat.sam.2_206.jpg` -> `Vat` instead of `"Vat.sam.2_206`
2025-05-11 06:09:17 -07:00
johnlockejrr
6fa766d6a5 Update utils.py 2025-05-11 05:31:34 -07:00
vahidrezanezhad
92954b1b7b resolving issued with saving model by steps 2025-05-05 16:13:38 +02:00
vahidrezanezhad
5694d971c5 saving model by steps is added to reading order and pixel wise segmentation use cases training 2025-05-05 15:39:05 +02:00
vahidrezanezhad
3b123b039c adding min_early parameter for generating training dataset for machine based reading order model 2025-05-03 19:25:32 +02:00
Clemens Neudecker
44d02687c6 Merge pull request #18 from johnlockejrr/unifying-training-models
Deprecations in train.py and check an argument in inference.py
2025-04-17 15:43:28 +02:00
vahidrezanezhad
4635dd219d updating:rotation augmentation is provided for machine based reading order 2025-04-17 00:12:30 +02:00
vahidrezanezhad
dd21a3b33a updating:rotation augmentation is provided for machine based reading order 2025-04-17 00:05:59 +02:00
vahidrezanezhad
825b2634f9 rotation augmentation is provided for machine based reading order 2025-04-16 23:36:41 +02:00
vahidrezanezhad
363c343b37 visualising reaidng order- Overlaying on image is provided 2025-03-17 20:09:48 +01:00
vahidrezanezhad
90a1b186f7 this enables to visualize reading order of textregions provided in page-xml files 2025-03-14 17:20:33 +01:00
vahidrezanezhad
e9b860b275 artificial_class_label for table region 2024-11-18 16:34:53 +01:00
vahidrezanezhad
238ea3bd8e update resizing in inference 2024-11-14 16:26:19 +01:00
vahidrezanezhad
7b4d14b19f addinh shifting augmentation 2024-10-29 17:06:22 +01:00
vahidrezanezhad
fd14e656aa early_erosion is added 2024-10-25 14:01:39 +02:00
johnlockejrr
f09eed1197 Changed deprecated lr to learning_rate and model.fit_generator to model.fit 2024-10-19 13:25:50 -07:00
johnlockejrr
a524f8b1a7 Update inference.py to check if save_layout was passed as argument otherwise can give an cv2 error 2024-10-19 13:21:29 -07:00
vahidrezanezhad
3f354e1c34 new augmentations for patchwise training 2024-08-30 15:30:18 +02:00
vahidrezanezhad
e3da494470 fixing artificial class bug 2024-08-28 17:34:06 +02:00
vahidrezanezhad
a57a31673d adding foreground rgb to augmentation 2024-08-28 02:09:27 +02:00
vahidrezanezhad
5bbd0980b2 early dilation for textline artificial class 2024-08-28 00:04:19 +02:00
vahidrezanezhad
61cdd2acb8 using prepared binarized images in the case of augmentation 2024-08-22 21:58:09 +02:00
vahidrezanezhad
aeb2ee4e3e scaling, channels shuffling, rgb background and red content added to no patch augmentation 2024-08-21 19:33:23 +02:00
vahidrezanezhad
445c45cb87 updating augmentations 2024-08-21 16:17:59 +02:00