Commit graph

  • 027b87d321 fixup c0137c2 (missing arguments for utils_ocr) Robert Sachunsky 2025-10-08 14:56:57 +02:00
  • 1d4815b48f utils_ocr: forgot to pass coordinate offsets Robert Sachunsky 2025-10-08 14:56:14 +02:00
  • 839b7c4d84 make models: avoid re-download Robert Sachunsky 2025-10-08 12:33:14 +02:00
  • e5b5264568 CI: add diagnostic message for model symlink Robert Sachunsky 2025-10-08 12:17:53 +02:00
  • ca72a095ca tests: cover table detection in various modes Robert Sachunsky 2025-10-08 00:44:32 +02:00
  • 5e11a68a3e writer/run_single: consistent kwarg naming conf_contours_textregion(s) Robert Sachunsky 2025-10-08 01:03:48 +02:00
  • 75823f9bed run_single: call writer.build_pagexml_no_full_layout w/ kwargs Robert Sachunsky 2025-10-08 00:54:53 +02:00
  • cbbb3248c7 writer: simplify Robert Sachunsky 2025-10-08 00:43:29 +02:00
  • e32479765c writer: simplify Robert Sachunsky 2025-10-07 23:03:27 +02:00
  • d88ca18eec get/do_work_of_slopes etc.: reduce call/return signatures Robert Sachunsky 2025-10-07 22:53:30 +02:00
  • 02a347a48a no more need to rm from contours_only_text_parent_d_ordered now Robert Sachunsky 2025-10-07 22:47:34 +02:00
  • fd43e78442 filter_contours_without_textline_inside: simplify Robert Sachunsky 2025-10-07 22:42:36 +02:00
  • 0a80cd5dff avoid unnecessary 3-channel conversions: for tables, too Robert Sachunsky 2025-10-07 22:37:05 +02:00
  • dfdc705375 do_work_of_slopes: rm unused old variant Robert Sachunsky 2025-10-07 22:33:06 +02:00
  • 2e907875c1 get_text_region_boxes_by_given_contours: simplify Robert Sachunsky 2025-10-07 22:32:06 +02:00
  • d53f829dfd filter_contours_inside_a_bigger_one: fix edge case in 81827c29 Robert Sachunsky 2025-10-07 22:06:57 +02:00
  • 18bbdb7c48 CI: run deps-test with OCR extra so symlink rule fires Robert Sachunsky 2025-10-07 00:54:25 +02:00
  • 23535998f7 tests: symlink OCR models into layout model directory Robert Sachunsky 2025-10-06 21:27:21 +02:00
  • a1904fa660 tests: cover layout with OCR in various modes Robert Sachunsky 2025-10-06 17:44:12 +02:00
  • 595ed02743 run_single: simplify; allow running TrOCR in non-fl mode, too Robert Sachunsky 2025-10-06 17:24:50 +02:00
  • 6e57ab3741 textline_contours_postprocessing: do not catch arbitrary exceptions Robert Sachunsky 2025-10-06 16:53:59 +02:00
  • fe603188f4 avoid unnecessary 3-channel conversions Robert Sachunsky 2025-10-06 13:11:03 +02:00
  • 155b8f68b8 matching deskewed text region contours with predicted: improve Robert Sachunsky 2025-10-06 12:58:24 +02:00
  • 0e00d7868b matching deskewed text region contours with predicted: improve Robert Sachunsky 2025-10-06 12:55:10 +02:00
  • 0f33c21eb3 matching deskewed text region contours with predicted: improve Robert Sachunsky 2025-10-05 02:45:01 +02:00
  • 73e5a1def8 matching deskewed text region contours with predicted: simplify Robert Sachunsky 2025-10-05 02:33:03 +02:00
  • d774a23daa matching deskewed text region contours with predicted: simplify Robert Sachunsky 2025-10-05 02:18:17 +02:00
  • 29b4527bde do_order_of_regions: simplify Robert Sachunsky 2025-10-03 02:06:08 +02:00
  • e674ea08f3 do_order_of_regions: drop redundant no/full_layout Robert Sachunsky 2025-10-03 00:59:25 +02:00
  • e9bb62bd86 do_order_of_regions: simplify Robert Sachunsky 2025-10-02 23:44:00 +02:00
  • 7387f5a929 do_order_of_regions: improve box matching, simplify Robert Sachunsky 2025-10-02 22:35:40 +02:00
  • 4950e6bd78 order_of_regions: simplify Robert Sachunsky 2025-10-02 22:28:52 +02:00
  • a1c8fd4467 do_order_of_regions / order_of_regions: simplify Robert Sachunsky 2025-10-02 21:41:37 +02:00
  • 415b2cbad8 eynollah, drop_capitals: simplify Robert Sachunsky 2025-10-02 21:36:22 +02:00
  • 3f3353ec3a do_order_of_regions: simplify Robert Sachunsky 2025-10-02 21:28:04 +02:00
  • 8c3d5eb0eb separate_marginals_to_left_and_right_and_order_from_top_to_down: simplify Robert Sachunsky 2025-10-02 21:07:35 +02:00
  • 6fc6e274d3 Merge remote-tracking branch 'bertsky/loky-with-shm-for-175-rebuilt-refactored' into prepare-v0.6.0 v0.6.0-with-pr-bertsky-4 kba 2025-10-09 15:44:19 +02:00
  • ff40f06bca Merge branch 'main' into prepare-v0.6.0 kba 2025-10-09 14:05:29 +02:00
  • 8215814a3f Merge branch 'changelog-v0.5.0' kba 2025-10-09 14:03:45 +02:00
  • 4ffe6190d2 📝 changelog changelog-v0.5.0 kba 2025-10-09 14:03:26 +02:00
  • 8869c20c33 updating CHANGELOG for v0.5.0 vahidrezanezhad 2025-10-06 14:53:47 +02:00
  • 3a4870f186 return_contours_of_interested_region*: rm unused variants Robert Sachunsky 2025-10-08 19:21:07 +02:00
  • bb8f235585 fix identifier scope in layout OCR options (w/o full_layout) Robert Sachunsky 2025-10-08 19:19:10 +02:00
  • a7de13e900 CI: lint with ruff Robert Sachunsky 2025-10-08 17:54:38 +02:00
  • eaae262f65 fix identifier scope in layout OCR options (w/o full_layout) Robert Sachunsky 2025-10-08 16:52:22 +02:00
  • 15693d07c3 add rough ruff config Robert Sachunsky 2025-10-08 15:13:57 +02:00
  • 26886dc052 mbreorder/enhancment: fix missing imports Robert Sachunsky 2025-10-08 15:13:13 +02:00
  • eee9c881ed fixup c0137c2 (missing arguments for utils_ocr) Robert Sachunsky 2025-10-08 14:56:57 +02:00
  • 9b4e835578 utils_ocr: forgot to pass coordinate offsets Robert Sachunsky 2025-10-08 14:56:14 +02:00
  • 303a3d484b fixup f700aaf3 Robert Sachunsky 2025-10-08 12:58:59 +02:00
  • ffe7a2de6b make models: avoid re-download Robert Sachunsky 2025-10-08 12:33:14 +02:00
  • ee91caee4a fixup 70344c13 Robert Sachunsky 2025-10-08 12:23:10 +02:00
  • 7ed2da966f CI: add diagnostic message for model symlink Robert Sachunsky 2025-10-08 12:17:53 +02:00
  • 05fb64676a fixup a388de1 Robert Sachunsky 2025-10-08 12:06:40 +02:00
  • e4ce4c593b fixup for e451ccd0 Robert Sachunsky 2025-10-08 02:05:02 +02:00
  • 26266dd13b writer/run_single: consistent kwarg naming conf_contours_textregion(s) Robert Sachunsky 2025-10-08 01:03:48 +02:00
  • 7dd51d1b10 run_single: call writer.build_pagexml_no_full_layout w/ kwargs Robert Sachunsky 2025-10-08 00:54:53 +02:00
  • cc4f263d88 tests: cover table detection in various modes Robert Sachunsky 2025-10-08 00:44:32 +02:00
  • 4ec7999803 writer: simplify Robert Sachunsky 2025-10-08 00:43:29 +02:00
  • 0d3d476f0a writer: simplify Robert Sachunsky 2025-10-07 23:03:27 +02:00
  • a388de147c get/do_work_of_slopes etc.: reduce call/return signatures Robert Sachunsky 2025-10-07 22:53:30 +02:00
  • e451ccd0a6 no more need to rm from contours_only_text_parent_d_ordered now Robert Sachunsky 2025-10-07 22:47:34 +02:00
  • c770108941 filter_contours_without_textline_inside: simplify Robert Sachunsky 2025-10-07 22:42:36 +02:00
  • a39a9c5cc4 avoid unnecessary 3-channel conversions: for tables, too Robert Sachunsky 2025-10-07 22:37:05 +02:00
  • 634d2b059f do_work_of_slopes: rm unused old variant Robert Sachunsky 2025-10-07 22:33:06 +02:00
  • 3e7628b5cd get_text_region_boxes_b_given_contours: simplify Robert Sachunsky 2025-10-07 22:32:06 +02:00
  • 316d813db9 filter_contours_inside_a_bigger_one: fix edge case in 81827c29 Robert Sachunsky 2025-10-07 22:06:57 +02:00
  • f700aaf371 CI: run deps-test with OCR extra so symlink rule fires Robert Sachunsky 2025-10-07 00:54:25 +02:00
  • 59a19a169d tests: symlink OCR models into layout model directory Robert Sachunsky 2025-10-06 21:27:21 +02:00
  • cd8e6b81eb tests: cover layout with OCR in various modes Robert Sachunsky 2025-10-06 17:44:12 +02:00
  • 4bb93b8f46 run_single: simplify; allow running TrOCR in non-fl mode, too Robert Sachunsky 2025-10-06 17:24:50 +02:00
  • 4a18a486a0 textline_contours_postprocessing: do not catch arbitrary exceptions Robert Sachunsky 2025-10-06 16:53:59 +02:00
  • 70344c137c avoid unnecessary 3-channel conversions: missing cases Robert Sachunsky 2025-10-06 16:53:06 +02:00
  • 584cde7eb8 updating CHANGELOG for v0.5.0 updating_CHANGELOG_v0.5.0 vahidrezanezhad 2025-10-06 14:53:47 +02:00
  • 51995c9e46 avoid unnecessary 3-channel conversions Robert Sachunsky 2025-10-06 13:11:03 +02:00
  • 1fa46303c0 matching deskewed text region contours with predicted: improve Robert Sachunsky 2025-10-06 12:58:24 +02:00
  • 2850fc6f8d matching deskewed text region contours with predicted: improve Robert Sachunsky 2025-10-06 12:55:10 +02:00
  • 29fcc75c0b matching deskewed text region contours with predicted: improve Robert Sachunsky 2025-10-05 02:45:01 +02:00
  • 56f2d4131e matching deskewed text region contours with predicted: simplify Robert Sachunsky 2025-10-05 02:33:03 +02:00
  • 04766df3d3 matching deskewed text region contours with predicted: simplify Robert Sachunsky 2025-10-05 02:18:17 +02:00
  • fa58653ec2 do_order_of_regions: simplify Robert Sachunsky 2025-10-03 02:06:08 +02:00
  • 0b1ecc02c8 do_order_of_regions: drop redundant no/full_layout Robert Sachunsky 2025-10-03 00:59:25 +02:00
  • b52ce118b8 do_order_of_regions: simplify Robert Sachunsky 2025-10-02 23:44:00 +02:00
  • f5e15ed6f9 do_order_of_regions: improve box matching, simplify Robert Sachunsky 2025-10-02 22:35:40 +02:00
  • 94599b9b12 order_of_regions: simplify Robert Sachunsky 2025-10-02 22:28:52 +02:00
  • 8897dbe8dd do_order_of_regions / order_of_regions: simplify Robert Sachunsky 2025-10-02 21:41:37 +02:00
  • 9a7bfd6409 eynollah, drop_capitals: simplify Robert Sachunsky 2025-10-02 21:36:22 +02:00
  • a06b7da306 do_order_of_regions: simplify Robert Sachunsky 2025-10-02 21:28:04 +02:00
  • 9bcad6f4c4 separate_marginals_to_left_and_right_and_order_from_top_to_down: simplify Robert Sachunsky 2025-10-02 21:07:35 +02:00
  • 81827c2942 filter_contours_inside_a_bigger_one: simplify Robert Sachunsky 2025-10-02 21:03:07 +02:00
  • 0b9d4901a6 contour features: avoid unused calculations, simplify, add shortcuts Robert Sachunsky 2025-10-02 20:51:03 +02:00
  • 8a9b4f8f55 remove commented-out requirement for tf == 2.12.1, rely on same version as in eynollah proper kba 2025-10-02 12:16:26 +02:00
  • 96eb1c11e6 Merge remote-tracking branch 'bertsky/loky-with-shm-for-175-rebuilt' into prepare-v0.6.0 kba 2025-10-01 20:27:56 +02:00
  • f60e0543ab training: update docs kba 2025-10-01 19:16:58 +02:00
  • 1c043c586a eynollah-training: all training CLI into single click group kba 2025-10-01 18:52:11 +02:00
  • 690d47444c make relative wildcard imports explicit kba 2025-10-01 18:36:28 +02:00
  • 2baf42e878 organize imports, use relative imports kba 2025-10-01 18:15:54 +02:00
  • 4f5cdf3140 move training scripts to src/eynollah/training kba 2025-10-01 18:12:45 +02:00
  • f0ef2b5db2 remove unused imports kba 2025-10-01 18:10:13 +02:00
  • 95bb5908bb Merge branch 'integrate-training-from-sbb_pixelwise_segmentation' of https://github.com/qurator-spk/eynollah into integrate-training-from-sbb_pixelwise_segmentation kba 2025-10-01 18:02:09 +02:00