kba
48d1198d24
move Eynollah_ocr to separate module
2025-10-20 19:15:31 +02:00
kba
b90cfdfcc4
adapt tests to -l being top-level option now
2025-10-20 18:56:24 +02:00
kba
a850ef39ea
factor model loading in Eynollah to EynollahModelZoo
2025-10-20 18:34:44 +02:00
Robert Sachunsky
5a0e4c3b0f
find_number_of_columns_in_document: improve splitter rule
...
extend horizontal separators to full img width if they do not overlap
any other regions
(only as regards to returned `splitter_y` result,
but without changing returned separators mask)
2025-10-20 17:41:50 +02:00
Robert Sachunsky
542d38ab43
find_number_of_columns_in_document: simplify, rename line→seps
2025-10-20 17:41:49 +02:00
Robert Sachunsky
d3d599b010
order_of_regions: add better plotting (but commented out)
2025-10-20 17:41:47 +02:00
Robert Sachunsky
c43a825d1d
order_of_regions: filter out-of-image peaks
2025-10-20 17:41:47 +02:00
Robert Sachunsky
48761c3e12
find_num_col: simplify, add better plotting (but commented out)
2025-10-20 17:41:45 +02:00
Robert Sachunsky
184927fb54
find_num_cols: re-sort peaks when cutting n-best num_col_classifier
2025-10-20 17:41:44 +02:00
Robert Sachunsky
086c1880ac
binarization: add option --overwrite, skip existing outputs
...
(also, simplify `run` and separate `run_single`)
2025-10-20 17:40:52 +02:00
vahidrezanezhad
c8455370a9
updating heuristics and ocr documentation
2025-10-20 15:13:45 +02:00
vahidrezanezhad
3ec5ceb22e
Update flowchart
2025-10-20 14:55:14 +02:00
vahidrezanezhad
9d2dbb8388
updating model based reading orde detection
2025-10-20 14:47:55 +02:00
cneud
496a0e2ca4
readme and documentation updates
2025-10-17 19:19:26 +02:00
cneud
f212ffa22d
remove unnecessary backslash
2025-10-17 18:27:18 +02:00
cneud
9733d575bf
replace list declaration with list literal (faster)
2025-10-17 18:21:49 +02:00
cneud
20a95365c2
remove redundant parentheses
2025-10-17 18:19:00 +02:00
cneud
2a1f892d72
expand keywords and supported Python versions
2025-10-17 18:17:41 +02:00
kba
6c89888166
Refactor CLI for consistent logging and late imports
2025-10-17 17:47:59 +02:00
kba
557fb227f3
training/gt_gen_utils: fix type errors, comment out dead code
2025-10-17 14:21:05 +02:00
kba
af74890b2e
training/inference.py: add typing info, organize imports
2025-10-17 14:07:43 +02:00
kba
3a73ccca2e
training/models.py: make imports explicit
2025-10-17 13:45:44 +02:00
kba
38c028c6b5
📦 v0.6.0
2025-10-17 10:36:30 +02:00
kba
ca8edb35e3
📝 changelog
2025-10-17 10:35:13 +02:00
kba
50e8b2c266
Merge branch 'integrate-training-from-sbb_pixelwise_segmentation'
2025-10-17 10:33:04 +02:00
kba
46d25647f7
📝 changelog
2025-10-17 10:32:15 +02:00
Robert Sachunsky
2ac01ecacc
join_polygons: try to catch rare case of MultiPolygon
2025-10-17 10:31:51 +02:00
kba
2e0fb64dcb
disable ruff check for training code for now
2025-10-16 21:29:37 +02:00
kba
76c13bcfd7
Merge branch 'integrate-training-from-sbb_pixelwise_segmentation' of https://github.com/qurator-spk/eynollah into integrate-training-from-sbb_pixelwise_segmentation
2025-10-16 20:50:24 +02:00
kba
af5abb77fd
Merge branch 'main' into integrate-training-from-sbb_pixelwise_segmentation
2025-10-16 20:50:16 +02:00
kba
d2f0a43088
📝 changelog
2025-10-16 20:46:49 +02:00
Konstantin Baierer
3bd3faef68
Merge pull request #193 from qurator-spk/training-installation
...
Training installation
2025-10-16 20:39:17 +02:00
kba
1e66c85222
Merge branch 'integrate-training-from-sbb_pixelwise_segmentation' into training-installation
2025-10-16 16:18:02 +02:00
kba
bd8c8bfeac
training: pin numpy to <1.24 as well
2025-10-16 16:15:31 +02:00
Robert Sachunsky
948c8c3441
join_polygons: try to catch rare case of MultiPolygon
2025-10-15 16:58:17 +02:00
kba
f485dd4181
📦 v0.6.0rc2
2025-10-14 16:10:50 +02:00
kba
c1f0158806
📝 changelog
2025-10-14 14:53:15 +02:00
kba
7daa0a1bd5
Merge branch 'fix-196' into prepare-v0.6.0rc2
2025-10-14 14:52:36 +02:00
kba
2febf53479
📝 changelog
2025-10-14 14:52:31 +02:00
Robert Sachunsky
8299e7009a
setup_models: avoid unnecessarily loading region_fl
2025-10-14 14:27:32 +02:00
Robert Sachunsky
e8b7212f36
polygon2contour: avoid uint for coords
...
(introduced in a433c736 to make consistent with
`filter_contours_area_of_image`, but actually
np.uint is prone to create overflows downstream)
2025-10-14 14:27:26 +02:00
kba
745cf3be48
XML encoding should be utf-8 not utf8
...
... and should use OCR-D's generateDS PAGE API consistently
2025-10-10 16:39:17 +02:00
kba
2056a8bdb9
📦 v0.6.0rc1
2025-10-10 16:32:47 +02:00
Robert Sachunsky
4e9a1618c3
layout: refactor model setup, allow loading custom versions
...
- simplify definition of (defaults for) model versions
- unify loading of loadable models (depending on mode)
- use `self.models` dict instead of `self.model_*` attributes
- add `model_versions` kwarg / `--model_version` CLI option
2025-10-10 03:18:09 +02:00
Robert Sachunsky
374818de11
📝 update changelog for 5725e4f
2025-10-09 23:11:05 +02:00
Robert Sachunsky
c4cb16c2a8
simplify
...
(`skip_layout_and_reading_order` is already an attr)
2025-10-09 23:05:50 +02:00
Robert Sachunsky
ecb53056f2
Merge branch 'main' of https://github.com/qurator-spk/eynollah into loky-with-shm-for-175-rebuilt
2025-10-09 22:54:11 +02:00
Robert Sachunsky
d96af425a7
Merge pull request #4 from bertsky/loky-with-shm-for-175-rebuilt-refactored
...
refactoring for 192: speedup and improvements
2025-10-09 22:18:53 +02:00
Robert Sachunsky
cab392601e
📝 update changelog
2025-10-09 20:14:11 +02:00
Robert Sachunsky
e1b56d97da
CI: lint with ruff
2025-10-09 20:14:11 +02:00