Commit graph

  • cf5a0bacd2
    Merge 19b2c3fa42 into 38c028c6b5 Robert Sachunsky 2025-10-25 13:36:48 +02:00
  • 19b2c3fa42 reading order: improve handling of headings and horizontal seps Robert Sachunsky 2025-10-24 22:51:19 +02:00
  • 3367462d18 return_boxes_of_images_by_order_of_reading_new: change arg order Robert Sachunsky 2025-10-24 22:46:46 +02:00
  • a2a9fe5117 delete_separator_around: simplify, eynollah: identifiers Robert Sachunsky 2025-10-24 02:35:04 +02:00
  • 3ebbc2d693 return_boxes_of_images_by_order_of_reading_new: indent Robert Sachunsky 2025-10-24 02:30:39 +02:00
  • 66a0e55e49 return_boxes_of_images_by_order_of_reading_new: avoid oversplits Robert Sachunsky 2025-10-24 02:15:13 +02:00
  • 6fbb5f8a12 return_boxes_of_images_by_order_of_reading_new: simplify Robert Sachunsky 2025-10-24 02:02:39 +02:00
  • 6cc5900943 find_num_col: add better plotting (but commented out) Robert Sachunsky 2025-10-24 01:55:07 +02:00
  • 5d15941b35 contours_in_same_horizon: simplify Robert Sachunsky 2025-10-24 01:51:59 +02:00
  • acee4c1bfe find_number_of_columns_in_document: simplify Robert Sachunsky 2025-10-24 01:43:41 +02:00
  • b2a79cc6ed return_x_start_end_mothers_childs_and_type_of_reading_order: fix+1 Robert Sachunsky 2025-10-24 01:31:52 +02:00
  • e2dfec75fb return_x_start_end_mothers_childs_and_type_of_reading_order: simplify and document Robert Sachunsky 2025-10-24 01:19:20 +02:00
  • 0fc4b2535d return_boxes_of_images_by_order_of_reading_new: fix no-mother case Robert Sachunsky 2025-10-20 16:47:35 +02:00
  • 7c3e418588 return_boxes_of_images_by_order_of_reading_new: simplify Robert Sachunsky 2025-10-20 16:13:51 +02:00
  • cd35241e81 find_number_of_columns_in_document: split headings at top+baseline Robert Sachunsky 2025-10-20 13:41:36 +02:00
  • 6192e5ba5c
    qualitative evaluation of ocr models are added to docs updating_docs vahidrezanezhad 2025-10-23 16:37:24 +02:00
  • d0ad7a98b7 starting qualitative ocr evaluation vahidrezanezhad 2025-10-22 22:45:22 +02:00
  • 7b7714af2e completing ocr evaluations metric vahidrezanezhad 2025-10-22 22:42:37 +02:00
  • b56bb44284 providing ocr model evaluation metrics vahidrezanezhad 2025-10-22 21:30:06 +02:00
  • 59eb4fd3be
    images with ro are added to readme vahidrezanezhad 2025-10-22 19:04:01 +02:00
  • ab9ddd5214
    OCR examples are added to README vahidrezanezhad 2025-10-22 18:41:15 +02:00
  • 2fc723d292 extend README vahidrezanezhad 2025-10-22 18:29:14 +02:00
  • 7cf6ae1d7a
    Merge 883546a6b8 into 38c028c6b5 Konstantin Baierer 2025-10-22 15:05:46 +00:00
  • 883546a6b8 eynollah models package model-zoo kba 2025-10-22 16:38:05 +02:00
  • 04bc4a63d0 reorganize model_zoo kba 2025-10-22 16:04:48 +02:00
  • d94285b3ea rewrite model spec data structure kba 2025-10-22 13:07:35 +02:00
  • 146658f026 eynollah layout: fix trocr_processor model_zoo call kba 2025-10-22 10:47:09 +02:00
  • 4c8abfe19c eynollah_ocr: actually replace the model calls kba 2025-10-22 10:40:49 +02:00
  • 1337461d47 adopt image_enhancer to the zoo kba 2025-10-21 19:24:55 +02:00
  • f0c86672f8 adopt mb_ro_on_layout to the zoo kba 2025-10-21 17:55:08 +02:00
  • bcffa2e503 adopt binarizer to the zoo kba 2025-10-21 17:53:24 +02:00
  • de34a15809 Makefile: fix make models for OCR kba 2025-10-21 17:27:16 +02:00
  • 9d2b18d2af test_run: check log messages starting with eynollah kba 2025-10-21 13:29:55 +02:00
  • a53d5fc452 update docs/makefile to point to v0.6.0 models kba 2025-10-21 13:15:57 +02:00
  • c6b863b13f typing and asserts kba 2025-10-21 12:05:27 +02:00
  • 44b75eb36f cli: model -> model_basedir kba 2025-10-21 10:48:48 +02:00
  • 062f317d2e Introduce model_zoo to Eynollah_ocr kba 2025-10-20 21:14:52 +02:00
  • d609a532bf organize imports mostly kba 2025-10-20 19:46:07 +02:00
  • 48d1198d24 move Eynollah_ocr to separate module kba 2025-10-20 19:15:31 +02:00
  • cca747519d
    Merge b90cfdfcc4 into 38c028c6b5 Konstantin Baierer 2025-10-20 16:56:29 +00:00
  • b90cfdfcc4 adapt tests to -l being top-level option now cli-logging kba 2025-10-20 18:56:24 +02:00
  • a850ef39ea factor model loading in Eynollah to EynollahModelZoo kba 2025-10-20 18:34:44 +02:00
  • 5a0e4c3b0f find_number_of_columns_in_document: improve splitter rule Robert Sachunsky 2025-10-20 13:36:10 +02:00
  • 542d38ab43 find_number_of_columns_in_document: simplify, rename lineseps Robert Sachunsky 2025-10-20 13:34:56 +02:00
  • d3d599b010 order_of_regions: add better plotting (but commented out) Robert Sachunsky 2025-10-20 13:27:23 +02:00
  • c43a825d1d order_of_regions: filter out-of-image peaks Robert Sachunsky 2025-10-20 13:26:01 +02:00
  • 48761c3e12 find_num_col: simplify, add better plotting (but commented out) Robert Sachunsky 2025-10-20 13:20:12 +02:00
  • 184927fb54 find_num_cols: re-sort peaks when cutting n-best num_col_classifier Robert Sachunsky 2025-10-20 13:16:57 +02:00
  • 086c1880ac binarization: add option --overwrite, skip existing outputs Robert Sachunsky 2025-10-15 12:24:21 +02:00
  • c8455370a9 updating heuristics and ocr documentation vahidrezanezhad 2025-10-20 15:13:45 +02:00
  • 3ec5ceb22e
    Update flowchart vahidrezanezhad 2025-10-20 14:55:14 +02:00
  • 9d2dbb8388 updating model based reading orde detection vahidrezanezhad 2025-10-20 14:47:55 +02:00
  • 6c89888166 Refactor CLI for consistent logging and late imports kba 2025-10-17 17:47:59 +02:00
  • 0aebf3a24d
    Merge 557fb227f3 into 38c028c6b5 Konstantin Baierer 2025-10-17 14:22:17 +02:00
  • 557fb227f3 training/gt_gen_utils: fix type errors, comment out dead code ruff-training kba 2025-10-17 14:21:05 +02:00
  • af74890b2e training/inference.py: add typing info, organize imports kba 2025-10-17 14:07:43 +02:00
  • 3a73ccca2e training/models.py: make imports explicit kba 2025-10-17 13:45:14 +02:00
  • 38c028c6b5 📦 v0.6.0 main v0.6.0 kba 2025-10-17 10:36:30 +02:00
  • ca8edb35e3 📝 changelog kba 2025-10-17 10:35:13 +02:00
  • 50e8b2c266 Merge branch 'integrate-training-from-sbb_pixelwise_segmentation' kba 2025-10-17 10:33:04 +02:00
  • 46d25647f7 📝 changelog kba 2025-10-16 20:46:03 +02:00
  • 2ac01ecacc join_polygons: try to catch rare case of MultiPolygon Robert Sachunsky 2025-10-15 16:58:17 +02:00
  • 479ffcc91e
    Merge ad53ea3ae1 into 2e0fb64dcb Konstantin Baierer 2025-10-16 23:13:55 +02:00
  • 56402aeba2
    Merge 2e0c1868e0 into 2e0fb64dcb Konstantin Baierer 2025-10-16 22:34:01 +02:00
  • 2e0fb64dcb disable ruff check for training code for now integrate-training-from-sbb_pixelwise_segmentation kba 2025-10-16 21:29:37 +02:00
  • 76c13bcfd7 Merge branch 'integrate-training-from-sbb_pixelwise_segmentation' of https://github.com/qurator-spk/eynollah into integrate-training-from-sbb_pixelwise_segmentation kba 2025-10-16 20:50:24 +02:00
  • af5abb77fd Merge branch 'main' into integrate-training-from-sbb_pixelwise_segmentation kba 2025-10-16 20:50:16 +02:00
  • d2f0a43088 📝 changelog kba 2025-10-16 20:46:03 +02:00
  • 3bd3faef68
    Merge pull request #193 from qurator-spk/training-installation Konstantin Baierer 2025-10-16 20:39:17 +02:00
  • 2e0c1868e0 move models.py back to src/.../training old-pr-16 kba 2025-10-16 20:36:16 +02:00
  • b67a3c4ed4 tf.keras version that allows any input resolution H.T. Kruitbosch 2024-01-11 19:04:42 +01:00
  • 662aa67dfb move models.py to root to cherry-pick 3098700 kba 2025-10-16 20:31:48 +02:00
  • ad53ea3ae1 move train.py back ReduceLROnPlateau kba 2025-10-16 20:20:41 +02:00
  • 54132a499a Merge remote-tracking branch 'pixelwise_local/ReduceLROnPlateau' into ReduceLROnPlateau kba 2025-10-16 20:20:06 +02:00
  • 30fe51f3ae move src/.../train.py to root to accomodate old PR kba 2025-10-16 20:05:00 +02:00
  • 1e66c85222 Merge branch 'integrate-training-from-sbb_pixelwise_segmentation' into training-installation kba 2025-10-16 16:18:02 +02:00
  • bd8c8bfeac training: pin numpy to <1.24 as well kba 2025-10-16 16:15:31 +02:00
  • 948c8c3441 join_polygons: try to catch rare case of MultiPolygon Robert Sachunsky 2025-10-15 16:58:17 +02:00
  • f485dd4181 📦 v0.6.0rc2 v0.6.0rc2 kba 2025-10-14 16:10:50 +02:00
  • c1f0158806 📝 changelog kba 2025-10-14 14:53:15 +02:00
  • 7daa0a1bd5 Merge branch 'fix-196' into prepare-v0.6.0rc2 kba 2025-10-14 14:52:36 +02:00
  • 2febf53479 📝 changelog kba 2025-10-14 14:52:31 +02:00
  • 8299e7009a setup_models: avoid unnecessarily loading region_fl Robert Sachunsky 2025-10-14 14:23:29 +02:00
  • e8b7212f36 polygon2contour: avoid uint for coords Robert Sachunsky 2025-10-14 14:16:39 +02:00
  • d84272bc4c
    Merge 42a3cc2335 into 2056a8bdb9 Robert Sachunsky 2025-10-13 15:45:26 +02:00
  • 745cf3be48 XML encoding should be utf-8 not utf8 fix-196 kba 2025-10-10 16:39:16 +02:00
  • 2056a8bdb9 📦 v0.6.0rc1 v0.6.0rc1 kba 2025-10-10 16:32:47 +02:00
  • 34f5996194 makefile: update models prepare-v0.6.0 kba 2025-10-10 13:02:14 +02:00
  • 09195aeee9 Merge remote-tracking branch 'bertsky/loky-with-shm-for-175-rebuilt' into prepare-v0.6.0 kba 2025-10-10 12:49:14 +02:00
  • 4e9a1618c3 layout: refactor model setup, allow loading custom versions Robert Sachunsky 2025-10-10 03:18:09 +02:00
  • 374818de11 📝 update changelog for 5725e4f Robert Sachunsky 2025-10-09 23:11:05 +02:00
  • c4cb16c2a8 simplify Robert Sachunsky 2025-10-09 23:05:50 +02:00
  • ecb53056f2 Merge branch 'main' of https://github.com/qurator-spk/eynollah into loky-with-shm-for-175-rebuilt Robert Sachunsky 2025-10-09 22:54:11 +02:00
  • d96af425a7
    Merge pull request #4 from bertsky/loky-with-shm-for-175-rebuilt-refactored Robert Sachunsky 2025-10-09 22:18:53 +02:00
  • cab392601e 📝 update changelog Robert Sachunsky 2025-10-09 20:12:06 +02:00
  • e1b56d97da CI: lint with ruff Robert Sachunsky 2025-10-08 17:54:38 +02:00
  • a144026b27 add rough ruff config Robert Sachunsky 2025-10-08 15:13:57 +02:00
  • b3d29bef89 return_contours_of_interested_region*: rm unused variants Robert Sachunsky 2025-10-08 19:21:07 +02:00
  • 8a2d682e12 fix identifier scope in layout OCR options (w/o full_layout) Robert Sachunsky 2025-10-08 16:52:22 +02:00
  • 096def1e9d mbreorder/enhancment: fix missing imports Robert Sachunsky 2025-10-08 15:13:13 +02:00