vahidrezanezhad
|
6462ea5b33
|
adding visualization of ocr text of xml file
|
2025-08-06 22:33:42 +02:00 |
|
vahidrezanezhad
|
1b222594d6
|
Update README.md: how to train model using docker image
|
2025-06-25 18:33:55 +02:00 |
|
vahidrezanezhad
|
f5a1d1a255
|
docker file to train model with desired cuda and cudnn
|
2025-06-25 18:24:16 +02:00 |
|
Clemens Neudecker
|
0e7de52f5e
|
Merge pull request #24 from johnlockejrr/unifying-training-models
Unifying training models
|
2025-06-03 09:00:56 +02:00 |
|
vahidrezanezhad
|
eb91000490
|
layout visualization updated
|
2025-06-02 18:23:34 +02:00 |
|
vahidrezanezhad
|
25e3a2a99f
|
visualizing ro for single xml file
|
2025-05-23 18:30:51 +02:00 |
|
vahidrezanezhad
|
f9390c71e7
|
updating inference for mb reading order
|
2025-05-17 02:18:27 +02:00 |
|
johnlockejrr
|
25abc0fabc
|
Update gt_gen_utils.py
Keep safely the full basename without extension
|
2025-05-14 03:34:51 -07:00 |
|
vahidrezanezhad
|
4a7728bb34
|
visuliazation layout from eynollah page-xml output
|
2025-05-12 22:39:47 +02:00 |
|
vahidrezanezhad
|
4ddc84dee8
|
visulizing textline detection from eynollah page-xml output
|
2025-05-12 18:31:40 +02:00 |
|
johnlockejrr
|
3a9fc0efde
|
Update utils.py
Changed unsafe basename extraction:
`file_name = i.split('.')[0]` to `file_name = os.path.splitext(i)[0]`
and
`filename = n[i].split('.')[0]` to `filename = os.path.splitext(n[i])[0]`
because
`"Vat.sam.2_206.jpg` -> `Vat` instead of `"Vat.sam.2_206`
|
2025-05-11 06:09:17 -07:00 |
|
johnlockejrr
|
6fa766d6a5
|
Update utils.py
|
2025-05-11 05:31:34 -07:00 |
|
vahidrezanezhad
|
92954b1b7b
|
resolving issued with saving model by steps
|
2025-05-05 16:13:38 +02:00 |
|
vahidrezanezhad
|
5694d971c5
|
saving model by steps is added to reading order and pixel wise segmentation use cases training
|
2025-05-05 15:39:05 +02:00 |
|
vahidrezanezhad
|
3b123b039c
|
adding min_early parameter for generating training dataset for machine based reading order model
|
2025-05-03 19:25:32 +02:00 |
|
Clemens Neudecker
|
44d02687c6
|
Merge pull request #18 from johnlockejrr/unifying-training-models
Deprecations in train.py and check an argument in inference.py
|
2025-04-17 15:43:28 +02:00 |
|
vahidrezanezhad
|
4635dd219d
|
updating:rotation augmentation is provided for machine based reading order
|
2025-04-17 00:12:30 +02:00 |
|
vahidrezanezhad
|
dd21a3b33a
|
updating:rotation augmentation is provided for machine based reading order
|
2025-04-17 00:05:59 +02:00 |
|
vahidrezanezhad
|
825b2634f9
|
rotation augmentation is provided for machine based reading order
|
2025-04-16 23:36:41 +02:00 |
|
vahidrezanezhad
|
363c343b37
|
visualising reaidng order- Overlaying on image is provided
|
2025-03-17 20:09:48 +01:00 |
|
vahidrezanezhad
|
90a1b186f7
|
this enables to visualize reading order of textregions provided in page-xml files
|
2025-03-14 17:20:33 +01:00 |
|
vahidrezanezhad
|
e9b860b275
|
artificial_class_label for table region
|
2024-11-18 16:34:53 +01:00 |
|
vahidrezanezhad
|
238ea3bd8e
|
update resizing in inference
|
2024-11-14 16:26:19 +01:00 |
|
vahidrezanezhad
|
7b4d14b19f
|
addinh shifting augmentation
|
2024-10-29 17:06:22 +01:00 |
|
vahidrezanezhad
|
fd14e656aa
|
early_erosion is added
|
2024-10-25 14:01:39 +02:00 |
|
johnlockejrr
|
f09eed1197
|
Changed deprecated lr to learning_rate and model.fit_generator to model.fit
|
2024-10-19 13:25:50 -07:00 |
|
johnlockejrr
|
a524f8b1a7
|
Update inference.py to check if save_layout was passed as argument otherwise can give an cv2 error
|
2024-10-19 13:21:29 -07:00 |
|
vahidrezanezhad
|
3f354e1c34
|
new augmentations for patchwise training
|
2024-08-30 15:30:18 +02:00 |
|
vahidrezanezhad
|
e3da494470
|
fixing artificial class bug
|
2024-08-28 17:34:06 +02:00 |
|
vahidrezanezhad
|
a57a31673d
|
adding foreground rgb to augmentation
|
2024-08-28 02:09:27 +02:00 |
|
vahidrezanezhad
|
5bbd0980b2
|
early dilation for textline artificial class
|
2024-08-28 00:04:19 +02:00 |
|
vahidrezanezhad
|
61cdd2acb8
|
using prepared binarized images in the case of augmentation
|
2024-08-22 21:58:09 +02:00 |
|
vahidrezanezhad
|
aeb2ee4e3e
|
scaling, channels shuffling, rgb background and red content added to no patch augmentation
|
2024-08-21 19:33:23 +02:00 |
|
vahidrezanezhad
|
445c45cb87
|
updating augmentations
|
2024-08-21 16:17:59 +02:00 |
|
vahidrezanezhad
|
5e1821a741
|
augmentation function for red textlines, rgb background and scaling for no patch case
|
2024-08-21 00:48:30 +02:00 |
|
vahidrezanezhad
|
bf5837bf6e
|
update
|
2024-08-09 13:20:09 +02:00 |
|
vahidrezanezhad
|
3b90347a94
|
save only layout output. different from overlayed layout on image
|
2024-08-09 12:46:18 +02:00 |
|
Clemens Neudecker
|
2d83b8faad
|
add documentation from wiki as markdown file to the codebase
|
2024-08-08 16:35:06 +02:00 |
|
vahidrezanezhad
|
6fb28d6ce8
|
erosion rate changed
|
2024-08-01 14:30:51 +02:00 |
|
vahidrezanezhad
|
381976099f
|
inference updated
|
2024-07-24 18:00:39 +02:00 |
|
vahidrezanezhad
|
2c822dae4e
|
erosion and dilation parameters are changed & separators are written in label images after artificial label
|
2024-07-24 16:52:05 +02:00 |
|
b-vr103
|
840d7c2283
|
increasing margin in the case of pixelwise inference
|
2024-07-23 11:29:05 +02:00 |
|
b-vr103
|
861f0b1ebd
|
brightness augmentation modified
|
2024-07-17 18:20:24 +02:00 |
|
vahidrezanezhad
|
453d0fbf92
|
adding degrading and brightness augmentation to no patches case training
|
2024-07-17 17:14:20 +02:00 |
|
vahidrezanezhad
|
3bceec9c19
|
printspace_as_class_in_layout is integrated. Printspace can be defined as a class for layout segmentation
|
2024-07-16 18:29:27 +02:00 |
|
vahidrezanezhad
|
9260d2962a
|
resolving typo
|
2024-07-09 03:04:29 +02:00 |
|
vahidrezanezhad
|
fe69b9c4a8
|
update inference
|
2024-06-21 23:42:25 +02:00 |
|
vahidrezanezhad
|
b3cd01de37
|
update reading order machine based
|
2024-06-21 13:06:26 +02:00 |
|
vahidrezanezhad
|
66022cf771
|
update config
|
2024-06-12 17:40:40 +02:00 |
|
vahidrezanezhad
|
22d7359db2
|
Transformer+CNN structure is added to vision transformer type
|
2024-06-12 17:39:57 +02:00 |
|