vahidrezanezhad
|
5725e4fd1f
|
-Continue processing when num_col is None but textregions exist. -Convert marginal-only to main body if no main body is present. -Reset deskew angle to 0 when text region density (textregion area to page area) < 0.3 and angle > 45°.
|
2025-10-01 15:58:03 +02:00 |
|
Konstantin Baierer
|
a6f0af07d1
|
Merge pull request #185 from bertsky/patch-4
CD: master is now main
|
2025-09-29 10:44:27 +02:00 |
|
Robert Sachunsky
|
92c1e824dc
|
CD: master is now main
|
2025-09-26 23:05:47 +02:00 |
|
kba
|
6ea6a62801
|
📝 v0.5.0
|
2025-09-26 16:23:46 +02:00 |
|
Konstantin Baierer
|
882e242946
|
Merge pull request #178 from qurator-spk/prepare-release-v0.5.0
Prepare release v0.5.0
|
2025-09-26 16:21:09 +02:00 |
|
kba
|
37e64b4e45
|
📝 changelog
|
2025-09-26 16:19:04 +02:00 |
|
kba
|
3123add815
|
📝 update README
|
2025-09-26 15:07:32 +02:00 |
|
kba
|
830cc2c30a
|
comment out the offending test outright
|
2025-09-26 14:37:04 +02:00 |
|
kba
|
eb8d4573a8
|
tests: also disable ...ocr_directory test
|
2025-09-26 13:57:08 +02:00 |
|
kba
|
42fb452a7e
|
disable the -doit OCR test
|
2025-09-26 12:55:29 +02:00 |
|
Robert Sachunsky
|
480daa4c7c
|
test_run: make ocr -doit work (add truetype file)
|
2025-09-25 22:28:15 +02:00 |
|
kba
|
4c6405713a
|
ci: ocr models
|
2025-09-25 22:19:36 +02:00 |
|
kba
|
b4d460ca79
|
makefile forgot the OCR models
|
2025-09-25 22:16:38 +02:00 |
|
kba
|
f3f5426597
|
Merge branch 'adapt-ocrd' of https://github.com/qurator-spk/eynollah into adapt-ocrd
|
2025-09-25 21:47:27 +02:00 |
|
kba
|
0bb1fb1a05
|
tests: adapt to layout/ocr model split
|
2025-09-25 21:47:15 +02:00 |
|
kba
|
2ec773128b
|
Merge branch 'adapt-ocrd' of https://github.com/qurator-spk/eynollah into adapt-ocrd
|
2025-09-25 21:40:48 +02:00 |
|
kba
|
f37d80c188
|
Merge branch 'adapt-ocrd' of https://github.com/qurator-spk/eynollah into adapt-ocrd
|
2025-09-25 21:39:55 +02:00 |
|
kba
|
57ee1cdc72
|
Merge remote-tracking branch 'bertsky/mbro_dead_code-plus-fixes-plus-tests' into adapt-ocrd
|
2025-09-25 21:39:36 +02:00 |
|
kba
|
5c0ab509c4
|
CI: Update model name
|
2025-09-25 21:17:32 +02:00 |
|
kba
|
9303ded11f
|
ocrd-tool.json: use models_layout instead of eynollah_layouts for consistency
|
2025-09-25 21:12:52 +02:00 |
|
Robert Sachunsky
|
7c79902835
|
enhancement/mbreorder: make all path options kwargs to run() instead of attributes
|
2025-09-25 20:51:02 +02:00 |
|
kba
|
e6ee26fde3
|
make models: adapt to zenodo/v0.5.0
|
2025-09-25 20:35:54 +02:00 |
|
kba
|
11de8a025d
|
Adapt ocrd-eynollah-segment for release
|
2025-09-25 20:11:48 +02:00 |
|
kba
|
5e15c4f248
|
Merge remote-tracking branch 'bertsky/mbro_dead_code-plus-fixes-plus-tests' into prepare-release-v0.5.0
|
2025-09-25 20:05:03 +02:00 |
|
Robert Sachunsky
|
5c7e1f21fb
|
test_run: add tests for ocr
|
2025-09-25 19:53:19 +02:00 |
|
Robert Sachunsky
|
2d14d57e4f
|
ocr: minimal debug logging
|
2025-09-25 19:52:50 +02:00 |
|
Robert Sachunsky
|
1dcc7b5795
|
ocr CLI: make --model vs --model_name xor
|
2025-09-25 16:38:43 +02:00 |
|
Robert Sachunsky
|
5b1e0c1327
|
layout/ocr: make all path options kwargs to run() instead of attributes; ocr: drop redundant prediction_with_both_of_rgb_and_bin in favour of just bool(dir_in_bin)
|
2025-09-25 16:26:31 +02:00 |
|
Robert Sachunsky
|
ef1304a764
|
CLIs: reorder options, explain -i vs -di
|
2025-09-25 16:11:39 +02:00 |
|
Robert Sachunsky
|
df5448cdcd
|
CLIs: add required=True where missing
|
2025-09-25 16:08:40 +02:00 |
|
Robert Sachunsky
|
58dd192fad
|
smoke-test: also add enhancement and mbreorder here
|
2025-09-25 16:05:45 +02:00 |
|
b-vr103
|
369ef573f9
|
get textlines sorted in textregions - detection of vertical and horizontal regions improved
|
2025-09-25 12:51:02 +02:00 |
|
Robert Sachunsky
|
f07df080f0
|
add tests for enhancement and mbreorder
|
2025-09-25 01:16:19 +02:00 |
|
Robert Sachunsky
|
9967510327
|
mbreorder: filter by .xml suffix in dir-in mode
|
2025-09-25 01:15:37 +02:00 |
|
Robert Sachunsky
|
b094a6b77f
|
mbreorder: avoid spaces in logger name
|
2025-09-25 01:15:37 +02:00 |
|
Robert Sachunsky
|
d6cdb69acb
|
binarize/enhance/layout/ocr ls_imgs: use the same file name suffix filter for dir-in mode
|
2025-09-25 01:15:37 +02:00 |
|
Robert Sachunsky
|
96a0d22496
|
mbreorder CLI: change options to mimic other commands
|
2025-09-25 01:15:37 +02:00 |
|
Robert Sachunsky
|
93f7588bfa
|
binarizer CLI: add --log-level
|
2025-09-24 23:08:50 +02:00 |
|
Robert Sachunsky
|
8a1e5a8950
|
enhancement / layout CLI: do not override logger name
|
2025-09-24 23:03:11 +02:00 |
|
Robert Sachunsky
|
960b11f51f
|
machine-based-reading-order CLI: no foreign logger, add --log-level
|
2025-09-24 22:58:57 +02:00 |
|
kba
|
45b05c2316
|
Merge branch 'mbro_dead_code' into prepare-release-v0.5.0
|
2025-09-24 17:18:31 +02:00 |
|
vahidrezanezhad
|
80d50d4bf6
|
get textlines sorted in textregion - verticals
|
2025-09-24 17:17:27 +02:00 |
|
b-vr103
|
6d8641a518
|
get textlines sorted in textregion - verticals
|
2025-09-24 17:17:21 +02:00 |
|
vahidrezanezhad
|
6904a98182
|
get textlines inside textregion sorted debugging
|
2025-09-24 17:17:12 +02:00 |
|
vahidrezanezhad
|
ce13d8c5a3
|
get textlines inside textregion sorted
|
2025-09-24 17:16:47 +02:00 |
|
kba
|
8b30bdbae2
|
image_enhancer: use latest page extraction model
|
2025-09-24 16:39:31 +02:00 |
|
kba
|
c8ebe84697
|
image_enhancer: add missing models, remove dead code
|
2025-09-24 16:36:18 +02:00 |
|
kba
|
b75ca0d31f
|
mb_ro_on_layout: remove copy-pasta code not actually used
|
2025-09-24 16:29:05 +02:00 |
|
Konstantin Baierer
|
9c129c7f54
|
Merge pull request #180 from bertsky/prepare-release-v0.5.0-fixlogging
prepare release v0.5.0: fix logging
|
2025-09-24 12:28:10 +02:00 |
|
Robert Sachunsky
|
5bd318e657
|
rm print statement (already log msg)
|
2025-09-24 12:14:32 +02:00 |
|