Commit graph

1017 commits

Author SHA1 Message Date
vahidrezanezhad
8e2cdad1be extracting images only - avoid artifacts with heuristics 2024-08-07 23:22:27 +02:00
vahidrezanezhad
00bf2b64d0 1&2 column images only printspace 2024-08-07 19:07:54 +02:00
vahidrezanezhad
be144db9f8 updating 1&2 columns images + full layout 2024-08-07 18:13:10 +02:00
vahidrezanezhad
a62ae370c3 new full layout model and early layout for 1&2 column images are integrated - light version 2024-08-07 02:21:01 +02:00
vahidrezanezhad
9170a9f21c only images extraction - update inference parameters 2024-08-06 16:11:32 +02:00
vahidrezanezhad
6fb28d6ce8 erosion rate changed 2024-08-01 14:30:51 +02:00
cneud
f0e7f75499 Update README.md 2024-08-01 00:30:25 +02:00
cneud
7ded54a8d2 rename GH action 2024-08-01 00:25:31 +02:00
cneud
c9f63826c0 create draft pyproject.toml 2024-08-01 00:13:42 +02:00
cneud
8862df9156 format options table 2024-07-31 22:53:36 +02:00
cneud
38698c6609 Update README.md 2024-07-31 21:16:02 +02:00
cneud
40f5408b1e improve huggingface url 2024-07-31 20:02:56 +02:00
cneud
3cfa447e84 remove CircleCI 2024-07-31 20:01:36 +02:00
cneud
ad133e3425 Update model download url 2024-07-31 19:49:43 +02:00
vahidrezanezhad
381976099f inference updated 2024-07-24 18:00:39 +02:00
vahidrezanezhad
2c822dae4e erosion and dilation parameters are changed & separators are written in label images after artificial label 2024-07-24 16:52:05 +02:00
b-vr103
840d7c2283 increasing margin in the case of pixelwise inference 2024-07-23 11:29:05 +02:00
b-vr103
861f0b1ebd brightness augmentation modified 2024-07-17 18:20:24 +02:00
vahidrezanezhad
453d0fbf92 adding degrading and brightness augmentation to no patches case training 2024-07-17 17:14:20 +02:00
vahidrezanezhad
5144668834 ocr engine first integration 2024-07-17 10:01:37 +02:00
vahidrezanezhad
3bceec9c19 printspace_as_class_in_layout is integrated. Printspace can be defined as a class for layout segmentation 2024-07-16 18:29:27 +02:00
vahidrezanezhad
9260d2962a resolving typo 2024-07-09 03:04:29 +02:00
vahidrezanezhad
fe69b9c4a8 update inference 2024-06-21 23:42:25 +02:00
vahidrezanezhad
b3cd01de37 update reading order machine based 2024-06-21 13:06:26 +02:00
vahidrezanezhad
66022cf771 update config 2024-06-12 17:40:40 +02:00
vahidrezanezhad
22d7359db2 Transformer+CNN structure is added to vision transformer type 2024-06-12 17:39:57 +02:00
vahidrezanezhad
95faf1a4c8 transformer patch size is dynamic now. 2024-06-12 13:26:27 +02:00
vahidrezanezhad
29da23da76 binarization as a separate task of segmentation 2024-06-11 17:48:30 +02:00
vahidrezanezhad
1921e6754f updating train.py nontransformer backend 2024-06-10 22:15:30 +02:00
vahidrezanezhad
cc91e4b12c updating train.py 2024-06-07 16:24:31 +02:00
vahidrezanezhad
4c376289e9 just defined graphic region types can be extracted as label 2024-06-06 18:55:22 +02:00
vahidrezanezhad
0e4dd0b9ef just defined textregion types can be extracted as label 2024-06-06 18:47:30 +02:00
vahidrezanezhad
5a5914e06c just defined textregion types can be extracted as label 2024-06-06 18:45:47 +02:00
vahidrezanezhad
742e3c2aa2 Update README.md 2024-06-06 14:46:06 +02:00
vahidrezanezhad
13ebe71d13 replacement in a list done correctly 2024-06-06 14:38:29 +02:00
vahidrezanezhad
3ef0dbdd42 scaling and cropping of labels and org images 2024-05-30 16:59:50 +02:00
vahidrezanezhad
47a1646451 modifying xml parsing 2024-05-30 12:56:56 +02:00
vahidrezanezhad
09789619a8 min_area size of regions considered for reading order detection passed as an argument for inference 2024-05-29 13:07:06 +02:00
vahidrezanezhad
06ed006193 reading order detection on xml with layout + result will be written in an output directory with the same file name 2024-05-29 11:18:35 +02:00
vahidrezanezhad
4fb45a6711 inference for reading order 2024-05-28 16:48:51 +02:00
vahidrezanezhad
cc7577d2c1 min area size of text region passes as an argument for machine based reading order 2024-05-28 10:14:16 +02:00
vahidrezanezhad
467bbb2884 pass degrading scales for image enhancement as a json file 2024-05-28 10:01:17 +02:00
vahidrezanezhad
ccf520d3c7 adding rest_as_paragraph and rest_as_graphic to elements 2024-05-27 17:23:49 +02:00
vahidrezanezhad
721d3f70a0
Merge pull request #127 from bertsky/new-namespace-pkg
non-legacy namespace package
2024-05-24 16:47:48 +02:00
vahidrezanezhad
9638098ae7 machine based reading order training is integrated 2024-05-24 16:39:48 +02:00
Robert Sachunsky
45bd76f5e8 fix namespace pkg setup 2024-05-24 14:27:56 +00:00
vahidrezanezhad
d346b317fb machine based reading order training dataset generator is added 2024-05-24 14:42:58 +02:00
Robert Sachunsky
f88ee99f3c
non-legacy namespace package 2024-05-23 21:17:38 +02:00
vahidrezanezhad
61487bf782 use case printspace is added 2024-05-23 17:36:23 +02:00
vahidrezanezhad
a83d53c27d use cases like textline, word and glyph are added 2024-05-23 17:14:31 +02:00