Commit graph

28 commits

Author SHA1 Message Date
vahidrezanezhad
263da755ef loading xmls with UTF-8 encoding 2025-08-07 10:32:49 +02:00
vahidrezanezhad
6462ea5b33 adding visualization of ocr text of xml file 2025-08-06 22:33:42 +02:00
Clemens Neudecker
0e7de52f5e Merge pull request #24 from johnlockejrr/unifying-training-models
Unifying training models
2025-06-03 09:00:56 +02:00
vahidrezanezhad
eb91000490 layout visualization updated 2025-06-02 18:23:34 +02:00
vahidrezanezhad
f9390c71e7 updating inference for mb reading order 2025-05-17 02:18:27 +02:00
johnlockejrr
25abc0fabc Update gt_gen_utils.py
Keep safely the full basename without extension
2025-05-14 03:34:51 -07:00
vahidrezanezhad
4a7728bb34 visuliazation layout from eynollah page-xml output 2025-05-12 22:39:47 +02:00
vahidrezanezhad
4ddc84dee8 visulizing textline detection from eynollah page-xml output 2025-05-12 18:31:40 +02:00
vahidrezanezhad
3b123b039c adding min_early parameter for generating training dataset for machine based reading order model 2025-05-03 19:25:32 +02:00
vahidrezanezhad
363c343b37 visualising reaidng order- Overlaying on image is provided 2025-03-17 20:09:48 +01:00
vahidrezanezhad
e9b860b275 artificial_class_label for table region 2024-11-18 16:34:53 +01:00
vahidrezanezhad
fd14e656aa early_erosion is added 2024-10-25 14:01:39 +02:00
vahidrezanezhad
e3da494470 fixing artificial class bug 2024-08-28 17:34:06 +02:00
vahidrezanezhad
5bbd0980b2 early dilation for textline artificial class 2024-08-28 00:04:19 +02:00
vahidrezanezhad
6fb28d6ce8 erosion rate changed 2024-08-01 14:30:51 +02:00
vahidrezanezhad
2c822dae4e erosion and dilation parameters are changed & separators are written in label images after artificial label 2024-07-24 16:52:05 +02:00
vahidrezanezhad
3bceec9c19 printspace_as_class_in_layout is integrated. Printspace can be defined as a class for layout segmentation 2024-07-16 18:29:27 +02:00
vahidrezanezhad
9260d2962a resolving typo 2024-07-09 03:04:29 +02:00
vahidrezanezhad
4c376289e9 just defined graphic region types can be extracted as label 2024-06-06 18:55:22 +02:00
vahidrezanezhad
5a5914e06c just defined textregion types can be extracted as label 2024-06-06 18:45:47 +02:00
vahidrezanezhad
13ebe71d13 replacement in a list done correctly 2024-06-06 14:38:29 +02:00
vahidrezanezhad
3ef0dbdd42 scaling and cropping of labels and org images 2024-05-30 16:59:50 +02:00
vahidrezanezhad
47a1646451 modifying xml parsing 2024-05-30 12:56:56 +02:00
vahidrezanezhad
09789619a8 min_area size of regions considered for reading order detection passed as an argument for inference 2024-05-29 13:07:06 +02:00
vahidrezanezhad
06ed006193 reading order detection on xml with layout + result will be written in an output directory with the same file name 2024-05-29 11:18:35 +02:00
vahidrezanezhad
4fb45a6711 inference for reading order 2024-05-28 16:48:51 +02:00
vahidrezanezhad
ccf520d3c7 adding rest_as_paragraph and rest_as_graphic to elements 2024-05-27 17:23:49 +02:00
vahidrezanezhad
d346b317fb machine based reading order training dataset generator is added 2024-05-24 14:42:58 +02:00