Commit graph

  • b094a6b77f mbreorder: avoid spaces in logger name Robert Sachunsky 2025-09-25 00:51:45 +02:00
  • d6cdb69acb binarize/enhance/layout/ocr ls_imgs: use the same file name suffix filter for dir-in mode Robert Sachunsky 2025-09-25 00:11:23 +02:00
  • 96a0d22496 mbreorder CLI: change options to mimic other commands Robert Sachunsky 2025-09-24 23:52:35 +02:00
  • 93f7588bfa binarizer CLI: add --log-level Robert Sachunsky 2025-09-24 23:08:50 +02:00
  • 8a1e5a8950 enhancement / layout CLI: do not override logger name Robert Sachunsky 2025-09-24 23:03:11 +02:00
  • 960b11f51f machine-based-reading-order CLI: no foreign logger, add --log-level Robert Sachunsky 2025-09-24 22:58:57 +02:00
  • 45b05c2316 Merge branch 'mbro_dead_code' into prepare-release-v0.5.0 kba 2025-09-24 17:18:31 +02:00
  • 80d50d4bf6 get textlines sorted in textregion - verticals vahidrezanezhad 2025-09-24 16:36:00 +02:00
  • 6d8641a518 get textlines sorted in textregion - verticals b-vr103 2025-09-24 03:43:36 +02:00
  • 6904a98182 get textlines inside textregion sorted debugging vahidrezanezhad 2025-09-24 01:25:57 +02:00
  • ce13d8c5a3 get textlines inside textregion sorted vahidrezanezhad 2025-09-24 01:22:07 +02:00
  • 8b30bdbae2 image_enhancer: use latest page extraction model kba 2025-09-24 16:39:31 +02:00
  • c8ebe84697 image_enhancer: add missing models, remove dead code kba 2025-09-24 16:36:18 +02:00
  • 0a44a7c2c1 get textlines sorted in textregion - verticals vahidrezanezhad 2025-09-24 16:36:00 +02:00
  • b75ca0d31f mb_ro_on_layout: remove copy-pasta code not actually used kba 2025-09-24 16:29:05 +02:00
  • 9c129c7f54
    Merge pull request #180 from bertsky/prepare-release-v0.5.0-fixlogging Konstantin Baierer 2025-09-24 12:28:10 +02:00
  • 5bd318e657 rm print statement (already log msg) Robert Sachunsky 2025-09-24 12:14:32 +02:00
  • 90f1d7aa47 rm summary msg (info already logged elsewhere) Robert Sachunsky 2025-09-24 12:10:11 +02:00
  • 7933b103f5 log modes only once (in run, not in run_single) Robert Sachunsky 2025-09-24 12:09:30 +02:00
  • d0817f5744 fix typo Robert Sachunsky 2025-09-24 12:08:50 +02:00
  • b7e27e8ccc get textlines sorted in textregion - verticals b-vr103 2025-09-24 03:43:36 +02:00
  • e6d410e53b
    get textlines inside textregion sorted debugging vahidrezanezhad 2025-09-24 01:25:57 +02:00
  • 8438001683
    get textlines inside textregion sorted vahidrezanezhad 2025-09-24 01:22:07 +02:00
  • 9ead58b99a Merge remote-tracking branch 'michalbubula/add-feedback' into prepare-release-v0.5.0 kba 2025-09-23 19:50:27 +02:00
  • 7bde99e866 Merge remote-tracking branch 'origin/updating_readme_for_eynollah_use_cases' into prepare-release-v0.5.0 kba 2025-09-23 19:42:55 +02:00
  • aac2e58b05 Merge remote-tracking branch 'michalbubula/add-feedback' into prepare-release-v0.5.0 prepare-release-v0.5.0-unrebased-merged kba 2025-09-23 19:38:56 +02:00
  • df8d93dbfa Merge branch 'main' into add-feedback kba 2025-09-23 19:20:20 +02:00
  • 89e49f46bb Merge remote-tracking branch 'origin/updating_readme_for_eynollah_use_cases' into prepare-release-v0.5.0 kba 2025-09-23 19:16:54 +02:00
  • a65405bead tables are visulaized within layout vahidrezanezhad 2025-09-22 15:56:14 +02:00
  • 4c2e15aa00 default cnn-rnn and transformer ocr models have changed to model_eynollah_ocr_cnnrnn_20250904 and model_eynollah_ocr_trocr_20250919 respectively vahidrezanezhad 2025-09-21 16:33:14 +02:00
  • 554f3988c9 default cnn-rnn and transformer ocr models have changed to model_eynollah_ocr_cnnrnn_20250904 and model_eynollah_ocr_trocr_20250919 respectively v0.5.0.pre1 prepare-release-v0.5.0-rebased vahidrezanezhad 2025-09-21 16:33:14 +02:00
  • c7ed7a30e6 extending image types vahidrezanezhad 2025-09-21 02:32:40 +02:00
  • 6bbdfe1074 extending image types vahidrezanezhad 2025-09-21 02:32:40 +02:00
  • eb0b44b25c Merge text of textlines and handle hyphenated words by joining them correctly vahidrezanezhad 2025-09-19 23:23:30 +02:00
  • e97e3ab192 Merge text of textlines and handle hyphenated words by joining them correctly vahidrezanezhad 2025-09-19 23:23:30 +02:00
  • eb322d5182 writing page contour correctly in xml output + ignore unsupported file types when loading images vahidrezanezhad 2025-09-19 18:06:18 +02:00
  • b38331b4ab writing page contour correctly in xml output + ignore unsupported file types when loading images vahidrezanezhad 2025-09-19 18:06:18 +02:00
  • ee040c7767 debug new page extraction in the case of ignoring page extraction vahidrezanezhad 2025-09-19 15:24:34 +02:00
  • 994bc8a1c0 debug new page extraction in the case of ignoring page extraction vahidrezanezhad 2025-09-19 15:24:34 +02:00
  • 530897c6c2 renaming argument names vahidrezanezhad 2025-09-19 13:20:26 +02:00
  • 5c9cf8472b remove redundant/brittle interval logging kba 2025-09-18 13:19:57 +02:00
  • 146102842a convert all print stmts to logger.info calls kba 2025-09-18 13:15:18 +02:00
  • c64d102613 move logging to CLI and make initialization optional kba 2025-09-18 13:07:41 +02:00
  • 54d9916f3b page extraction model name is changed vahidrezanezhad 2025-09-16 14:27:15 +02:00
  • 310679eeb8 page extraction model name is changed vahidrezanezhad 2025-09-16 14:27:15 +02:00
  • 542646791d For TrOCR, the cropped text lines will no longer be added to a list before prediction. Instead, for each batch size, the text line images will be collected and predictions will be made directly on them. vahidrezanezhad 2025-09-03 19:18:11 +02:00
  • 52cb0d9fac new page extraction model integration vahidrezanezhad 2025-09-15 13:38:23 +02:00
  • 6e008345a0 new page extraction model integration vahidrezanezhad 2025-09-15 13:36:58 +02:00
  • 68a71be8bc Running inference on files in a directory vahidrezanezhad 2025-09-13 22:40:11 +02:00
  • 8c949cec71 PR #173 has been reverted. Additionally, for TrOCR, the cropped text lines will no longer be added to a list before prediction. Instead, for each batch size, the text line images will be collected and predictions will be made directly on them. vahidrezanezhad 2025-09-03 19:18:11 +02:00
  • d9ae7bd12c merged pr #173 in #175 vahidrezanezhad 2025-09-02 15:27:19 +02:00
  • b84d945b5a
    Merge pull request #3 from bertsky/polygon-dilate-buffer-refactor2 Robert Sachunsky 2025-09-02 13:26:52 +02:00
  • 0711166524 changed the drop capitals bonding box to contour ratio threshold vahidrezanezhad 2025-09-01 11:37:22 +02:00
  • e15640aa8a new page extraction model integration vahidrezanezhad 2025-09-15 13:36:58 +02:00
  • 92a7c7cfea changed the drop capitals bonding box to contour ratio threshold vahidrezanezhad 2025-09-01 11:37:22 +02:00
  • 6a735daa60
    Update README.md updating_readme_for_eynollah_use_cases vahidrezanezhad 2025-08-31 23:30:54 +02:00
  • 090341241e writer: use @type='heading' instead of 'header' Robert Sachunsky 2025-08-26 21:07:50 +02:00
  • bb9cba1fd9 writer: SeparatorRegion needs SeparatorRegionType (not ImageRegionType) Robert Sachunsky 2025-08-26 21:07:18 +02:00
  • eae1303ebb contours: rename 'pixel' → 'label' for clarity Robert Sachunsky 2025-08-26 21:06:36 +02:00
  • dbbf1073df avoid pulling unused 'image_page_rotated' through functions Robert Sachunsky 2025-08-26 21:05:40 +02:00
  • 142ac8825e use box2rect instead of crop_image_inside_box when no image needed Robert Sachunsky 2025-08-26 21:02:43 +02:00
  • 892ff41e38 utils: introduce box2rect and box2slice Robert Sachunsky 2025-08-26 21:00:33 +02:00
  • d3566e55ef polygon2contour: fix 698f38e4 (deprecated dtype) Robert Sachunsky 2025-08-29 17:21:08 +02:00
  • 741aa7867c get_marginals: exit early if no peaks found to avoid spurious overlap mask Robert Sachunsky 2025-08-29 12:37:44 +02:00
  • 57821662b9 filter_contours_without_textline_inside: avoid removing from identical lists twice Robert Sachunsky 2025-08-29 12:19:37 +02:00
  • 698f38e461 polygon2contour: avoid overflow Robert Sachunsky 2025-08-29 12:16:56 +02:00
  • 9b9d21d8ac eynollah ocr: support using either a specific model name or a models directory (default model) vahidrezanezhad 2025-08-28 11:30:59 +02:00
  • fdcae8dd6e eynollah ocr: support using either a specific model name or a models directory (default model) vahidrezanezhad 2025-08-28 11:30:59 +02:00
  • 41365645ef Marginals are divided into left and right, and written from top to bottom. vahidrezanezhad 2025-08-26 22:38:03 +02:00
  • 7dd281267d Marginals are divided into left and right, and written from top to bottom. vahidrezanezhad 2025-08-26 22:38:03 +02:00
  • fd6a6495a2 increase dilatation: textregions/lines (5→6), seplines (0→1) Robert Sachunsky 2025-08-21 13:00:31 +02:00
  • 8be52fb143 refactor shapely converisons into contour2polygon / polygon2contour, also handle heterogeneous geometries Robert Sachunsky 2025-08-21 12:59:03 +02:00
  • 8b5f90e243 move dilate_*_contours to .utils.contour, rename dilate_textregions_contours_textline_version → dilate_textline_contours Robert Sachunsky 2025-08-21 01:42:46 +02:00
  • 244772f086 filter_contours_area_of_image*: also ensure validity here Robert Sachunsky 2025-08-21 01:33:16 +02:00
  • 42474afa4b rename *lines_xml → *seplines for clarity Robert Sachunsky 2025-08-21 01:32:32 +02:00
  • b610fe07a6 check_any_text_region_in_model_one_is_main_or_header_light: return original instead of resampled contours Robert Sachunsky 2025-08-21 01:05:15 +02:00
  • 3d53070b90 avoid creating invalid polygons via rounding Robert Sachunsky 2025-08-21 01:03:46 +02:00
  • 277d00579e get_textregion_contours_in_org_image_light: no back rotation, drop slope_first (always 0) Robert Sachunsky 2025-08-20 14:28:14 +02:00
  • b6d1c43a85 dilate_textregions_contours_textline_version: simplify (via shapely's Polygon.buffer()), ensure validity Robert Sachunsky 2025-08-20 14:26:14 +02:00
  • 6c442c9ae9 separate_lines/do_work_of_slopes: skip if crop is empty Robert Sachunsky 2025-08-19 22:56:36 +02:00
  • e9a6ff5d81 return_boxes_of_images_by_order_of_reading_new: simplify, avoid changing dtype during np.append Robert Sachunsky 2025-08-19 20:09:09 +02:00
  • f994ea5f0b dilate_textregions_contours: simplify (via shapely's Polygon.buffer()), ensure validity Robert Sachunsky 2025-08-19 11:58:45 +02:00
  • 7741502876 reading order on given layout vahidrezanezhad 2025-08-18 02:31:13 +02:00
  • 8dc2fab9fa reading order on given layout vahidrezanezhad 2025-08-18 02:31:13 +02:00
  • 8ebba5ac04 add feedback to command line interface michalbubula 2025-08-12 16:21:15 +02:00
  • 268aa141d7 avoiding float in range vahidrezanezhad 2025-08-12 12:50:15 +02:00
  • 20614d1678 avoiding float in range vahidrezanezhad 2025-08-12 12:50:15 +02:00
  • cf4983da54 visualize vertical ocr text vertically vahidrezanezhad 2025-08-08 16:12:55 +02:00
  • 52d9cc9baf deskewing with faster multiprocessing vahidrezanezhad 2025-08-08 11:32:02 +02:00
  • 5db3e9fa64 deskewing with faster multiprocessing vahidrezanezhad 2025-08-08 11:32:02 +02:00
  • 263da755ef loading xmls with UTF-8 encoding vahidrezanezhad 2025-08-07 10:32:49 +02:00
  • 6462ea5b33 adding visualization of ocr text of xml file vahidrezanezhad 2025-08-06 22:33:42 +02:00
  • 322b04145f use the latest ocr model with balanced fraktur-antiqua training dataset vahidrezanezhad 2025-08-05 14:22:22 +02:00
  • a0c19c57be use the latest ocr model with balanced fraktur-antiqua training dataset vahidrezanezhad 2025-08-05 14:22:22 +02:00
  • 1b95f8f38d threshold for textline ocr + new ocr model vahidrezanezhad 2025-07-25 13:18:38 +02:00
  • 0803881f36 threshold for textline ocr + new ocr model vahidrezanezhad 2025-07-25 13:18:38 +02:00
  • 2996fc8b30
    Merge pull request #166 from qurator-spk/updating_readme_for_eynollah_use_cases-cli Clemens Neudecker 2025-07-24 15:30:57 +02:00
  • fd0595f920
    Update Makefile updating_readme_for_eynollah_use_cases-cli vahidrezanezhad 2025-07-24 13:52:38 +02:00
  • da141bb42e resolving tests error vahidrezanezhad 2025-07-23 16:44:17 +02:00
  • daa597dbaa should merged text for the whole page be written in xml? vahidrezanezhad 2025-07-21 14:50:05 +02:00