Commit graph

123 commits

Author SHA1 Message Date
Clemens Neudecker
d6ccb83bf5
Merge pull request #24 from johnlockejrr/unifying-training-models
Unifying training models
2025-06-03 09:00:56 +02:00
vahidrezanezhad
834cf19cb9 layout visualization updated 2025-06-02 18:23:34 +02:00
vahidrezanezhad
13c6d28aaa visualizing ro for single xml file 2025-05-23 18:30:51 +02:00
vahidrezanezhad
110e077ee2 updating inference for mb reading order 2025-05-17 02:18:27 +02:00
johnlockejrr
1bf801985b
Update gt_gen_utils.py
Keep safely the full basename without extension
2025-05-14 03:34:51 -07:00
vahidrezanezhad
3de24243fc visuliazation layout from eynollah page-xml output 2025-05-12 22:39:47 +02:00
vahidrezanezhad
fc75770b73 visulizing textline detection from eynollah page-xml output 2025-05-12 18:31:40 +02:00
johnlockejrr
102b04c84d
Update utils.py
Changed unsafe basename extraction:
`file_name = i.split('.')[0]` to `file_name = os.path.splitext(i)[0]`
and
`filename = n[i].split('.')[0]` to `filename = os.path.splitext(n[i])[0]`
because
`"Vat.sam.2_206.jpg` -> `Vat` instead of `"Vat.sam.2_206`
2025-05-11 06:09:17 -07:00
johnlockejrr
be57f137d7
Update utils.py 2025-05-11 05:31:34 -07:00
vahidrezanezhad
d554d26739 resolving issued with saving model by steps 2025-05-05 16:13:38 +02:00
vahidrezanezhad
1454bc4f58 saving model by steps is added to reading order and pixel wise segmentation use cases training 2025-05-05 15:39:05 +02:00
vahidrezanezhad
8ae42b7c6e adding min_early parameter for generating training dataset for machine based reading order model 2025-05-03 19:25:32 +02:00
Clemens Neudecker
d57de478eb
Merge pull request #18 from johnlockejrr/unifying-training-models
Deprecations in train.py and check an argument in inference.py
2025-04-17 15:43:28 +02:00
vahidrezanezhad
4cc5aea166 updating:rotation augmentation is provided for machine based reading order 2025-04-17 00:12:30 +02:00
vahidrezanezhad
77ef68a1ec updating:rotation augmentation is provided for machine based reading order 2025-04-17 00:05:59 +02:00
vahidrezanezhad
22f82c5448 rotation augmentation is provided for machine based reading order 2025-04-16 23:36:41 +02:00
vahidrezanezhad
872e5b0b3a visualising reaidng order- Overlaying on image is provided 2025-03-17 20:09:48 +01:00
vahidrezanezhad
822d9292de this enables to visualize reading order of textregions provided in page-xml files 2025-03-14 17:20:33 +01:00
vahidrezanezhad
bd4160408e artificial_class_label for table region 2024-11-18 16:34:53 +01:00
vahidrezanezhad
98868e4a9e update resizing in inference 2024-11-14 16:26:19 +01:00
vahidrezanezhad
3ef865e0b5 addinh shifting augmentation 2024-10-29 17:06:22 +01:00
vahidrezanezhad
4150675621 early_erosion is added 2024-10-25 14:01:39 +02:00
johnlockejrr
451188c3b9
Changed deprecated lr to learning_rate and model.fit_generator to model.fit 2024-10-19 13:25:50 -07:00
johnlockejrr
df4a47ae6f
Update inference.py to check if save_layout was passed as argument otherwise can give an cv2 error 2024-10-19 13:21:29 -07:00
vahidrezanezhad
cca4d17823 new augmentations for patchwise training 2024-08-30 15:30:18 +02:00
vahidrezanezhad
5f456cf508 fixing artificial class bug 2024-08-28 17:34:06 +02:00
vahidrezanezhad
c502e67c14 adding foreground rgb to augmentation 2024-08-28 02:09:27 +02:00
vahidrezanezhad
4f0e3efa2b early dilation for textline artificial class 2024-08-28 00:04:19 +02:00
vahidrezanezhad
9904846776 using prepared binarized images in the case of augmentation 2024-08-22 21:58:09 +02:00
vahidrezanezhad
f31219b1c9 scaling, channels shuffling, rgb background and red content added to no patch augmentation 2024-08-21 19:33:23 +02:00
vahidrezanezhad
95bbdf8040 updating augmentations 2024-08-21 16:17:59 +02:00
vahidrezanezhad
7be326d689 augmentation function for red textlines, rgb background and scaling for no patch case 2024-08-21 00:48:30 +02:00
vahidrezanezhad
85dd59f23e update 2024-08-09 13:20:09 +02:00
vahidrezanezhad
f4bad09083 save only layout output. different from overlayed layout on image 2024-08-09 12:46:18 +02:00
Clemens Neudecker
b6bdf942fd
add documentation from wiki as markdown file to the codebase 2024-08-08 16:35:06 +02:00
vahidrezanezhad
59e5892f25 erosion rate changed 2024-08-01 14:30:51 +02:00
vahidrezanezhad
5fbe941f53 inference updated 2024-07-24 18:00:39 +02:00
vahidrezanezhad
30894ddc75 erosion and dilation parameters are changed & separators are written in label images after artificial label 2024-07-24 16:52:05 +02:00
b-vr103
c340fbb721 increasing margin in the case of pixelwise inference 2024-07-23 11:29:05 +02:00
b-vr103
f2692cf8dd brightness augmentation modified 2024-07-17 18:20:24 +02:00
vahidrezanezhad
9521768774 adding degrading and brightness augmentation to no patches case training 2024-07-17 17:14:20 +02:00
vahidrezanezhad
55f3cb9a84 printspace_as_class_in_layout is integrated. Printspace can be defined as a class for layout segmentation 2024-07-16 18:29:27 +02:00
vahidrezanezhad
647a3f8cc4 resolving typo 2024-07-09 03:04:29 +02:00
vahidrezanezhad
c0faecec2c update inference 2024-06-21 23:42:25 +02:00
vahidrezanezhad
033cf6734b update reading order machine based 2024-06-21 13:06:26 +02:00
vahidrezanezhad
9358657a0d update config 2024-06-12 17:40:40 +02:00
vahidrezanezhad
743f2e97d6 Transformer+CNN structure is added to vision transformer type 2024-06-12 17:39:57 +02:00
vahidrezanezhad
f1fd74c7eb transformer patch size is dynamic now. 2024-06-12 13:26:27 +02:00
vahidrezanezhad
2aa216e388 binarization as a separate task of segmentation 2024-06-11 17:48:30 +02:00
vahidrezanezhad
41a0e15e79 updating train.py nontransformer backend 2024-06-10 22:15:30 +02:00