Commit graph

674 commits

Author SHA1 Message Date
vahidrezanezhad
438df52287 updating 2024-10-30 00:52:09 +01:00
vahidrezanezhad
90ee2d61dc textline segmentation is masked with drop capitals 2024-10-28 20:56:06 +01:00
vahidrezanezhad
5037e9896d Merge branch 'machine_based_reading_order_integration' of https://github.com/qurator-spk/eynollah into machine_based_reading_order_integration 2024-10-25 19:47:20 +02:00
vahidrezanezhad
82281bd6cf fixing a bug occuring with reading order + Slro option with no patch textline model and thresholding artificial class 2024-10-25 19:42:48 +02:00
vahidrezanezhad
328d33e3dc Temporary commit – textline prediction without patches 2024-10-23 16:55:41 +02:00
vahidrezanezhad
70772d4104 binarization as a standalone command 2024-10-21 23:46:38 +02:00
vahidrezanezhad
f93fa12441 doing more multiprocessing in order to make the process faster 2024-10-18 09:14:42 +02:00
vahidrezanezhad
3ef4eac24c textlines of textregions are extracted in a faster way + early layout for all documents is done with no patches model and on rgb input 2024-10-17 19:12:28 +02:00
Clemens Neudecker
bc9dddd2c0
Update README.md 2024-10-16 14:21:48 +02:00
Clemens Neudecker
21893910b8
relax tf2 requirement to < 2.13 2024-10-16 14:20:53 +02:00
vahidrezanezhad
1da4b7f589 updating light version 2024-10-07 10:55:10 +02:00
vahidrezanezhad
543ed4bc38 -light version need -tll to be enabled otherwise the process will be ended. 2024-10-02 14:09:13 +02:00
Clemens Neudecker
51f6ef63f5
Merge pull request #137 from qurator-spk/dockerfile
Add Dockerfile and make docker target
2024-10-01 17:08:22 +02:00
kba
b13759fdcf ci: smoke-test make docker 2024-10-01 15:38:39 +02:00
kba
c487be2a1d dockerfile: use src-layout 2024-10-01 15:38:01 +02:00
kba
7eb1390583 Merge branch 'main' into dockerfile 2024-10-01 15:25:56 +02:00
vahidrezanezhad
ab63d5ba40 updating light version features 2024-09-30 21:28:39 +02:00
vahidrezanezhad
1774076f4a updating light version. Remove textlines or textregion contours inside a bigger one 2024-09-30 16:10:29 +02:00
vahidrezanezhad
ad32316217 updating light version 2024-09-27 20:59:01 +02:00
vahidrezanezhad
133091137d dilation of textregions and marginals are accomplished 2024-09-27 13:57:01 +02:00
vahidrezanezhad
95effe54a0 updating textregions dilation 2024-09-25 20:00:53 +02:00
vahidrezanezhad
b33739adee parametriyation in the case of textline contours dilation is accomplished 2024-09-24 16:06:27 +02:00
vahidrezanezhad
6626dc6866 updating textline dilation parameters 2024-09-23 15:50:37 +02:00
vahidrezanezhad
62f8ae4860 updating dilation of textlines and text regions 2024-09-23 14:03:07 +02:00
vahidrezanezhad
7f08458436 dilation of text regions without opencv 2024-09-21 14:39:54 +02:00
vahidrezanezhad
5d680136a4 updating light version 2024-09-21 01:04:28 +02:00
vahidrezanezhad
b9e8959c4a update of light versions 2024-09-20 16:33:13 +02:00
vahidrezanezhad
2d18739d9b postprocessing of textline contour dilation + skip layout and reading order passed as an argument 2024-09-20 15:08:09 +02:00
Clemens Neudecker
4af0bc079c
Merge pull request #132 from qurator-spk/extracting_images_only
Extracting images only
2024-09-20 09:35:40 +02:00
vahidrezanezhad
5a07cd9cfa the most effective version of contours dilation without opencv and all at once 2024-09-19 16:21:55 +02:00
michalbubula
d168edfd77
Update cli.py to block other processing in the case of extract_image_only 2024-09-19 15:20:37 +02:00
michalbubula
723f27bec4
Add -eoi option to README.md 2024-09-19 14:41:17 +02:00
vahidrezanezhad
74a0699f6b extracting images only now works for a single image input 2024-09-19 11:20:13 +02:00
vahidrezanezhad
a1f1f98de3 updating scaling contours 2024-09-18 00:08:54 +02:00
Clemens Neudecker
327b446a16
update Makefile with v0.3.1 models 2024-09-17 21:39:17 +02:00
Clemens Neudecker
351e9a897a
update ocrd-tool.json with v0.3.1 models 2024-09-17 21:32:23 +02:00
vahidrezanezhad
21380fc870 scaling contours without dilation 2024-09-17 15:06:41 +02:00
kba
478edc804a Add Dockerfile and make docker target 2024-09-16 18:21:14 +02:00
vahidrezanezhad
1b18ae874b passing number of columns as an argument 2024-09-13 00:52:06 +02:00
vahidrezanezhad
2c93904985 avoiding double binarization 2024-09-12 17:35:28 +02:00
vahidrezanezhad
f0b49073b7 adding option for textline detection in printspace 2024-09-03 23:10:38 +02:00
Clemens Neudecker
c156a1612e
Exclude run_image_extraction_over_ppn_lists.py from merge 2024-09-03 20:03:44 +02:00
vahidrezanezhad
6b2e5d110e all tests are passed 2024-09-03 13:55:55 +02:00
vahidrezanezhad
c3a4a1bba7 resolving issue #110 in a better way 2024-09-03 13:14:10 +02:00
cneud
b6d3d2bdbf fix indentation 2024-09-02 20:11:42 +02:00
cneud
de32d86fb6 Merge branch 'refs/heads/main' into extracting_images_only
# Conflicts:
#	src/eynollah/eynollah.py
2024-09-02 19:55:33 +02:00
vahidrezanezhad
0f87974b0c writing drop capitals in xml output + and may resolve issue #110 2024-09-02 16:21:07 +02:00
kba
c6e0e058d0 Merge branch 'main' into v3-api
# Conflicts:
#	pyproject.toml
#	src/eynollah/cli.py
2024-09-02 14:53:37 +02:00
kba
fdedae2406 require ocrd>=3.0.0b4 2024-09-02 11:47:57 +02:00
kba
f9c2d85dd7 Merge branch 'main' into v3-api 2024-09-02 11:46:56 +02:00