Commit graph

  • bcec0c4a55
    Merge 9801129aa6 into 1df32eba87 Robert Sachunsky 2026-05-22 10:38:11 +00:00
  • 9801129aa6 estimate_skew_contours: ensure retval is always float Robert Sachunsky 2026-05-22 12:37:07 +02:00
  • 26afc5ddab ModelZoo: ensure exported TensorShape is converted to plain tuple Robert Sachunsky 2026-05-22 12:35:44 +02:00
  • 0836230c6b utils_ocr: avoid module-level import of TF Robert Sachunsky 2026-05-21 22:50:53 +02:00
  • f3a93983c0 ModelZoo: add ocr key for memory_limit Robert Sachunsky 2026-05-21 22:50:13 +02:00
  • ea41dcae1d trocr: use beam search instead of greedy decoding Robert Sachunsky 2026-05-21 17:52:27 +02:00
  • 074753a98e ModelZoo: fix Torch device selection Robert Sachunsky 2026-05-21 17:25:53 +02:00
  • 000e4ac8d8 trocr: extract confidence, too Robert Sachunsky 2026-05-21 17:25:39 +02:00
  • f3649adbf2 trocr: apply do_not_mask_with_textline_contour here, too Robert Sachunsky 2026-05-21 17:23:11 +02:00
  • 1d67e65f11 trocr: simplify, batch over entire page… Robert Sachunsky 2026-05-21 15:48:21 +02:00
  • d50bd7c650 trocr: avoid warnings by passing clean_up_tokenization_spaces=False Robert Sachunsky 2026-05-21 14:20:51 +02:00
  • f9f9130dbb do_order_of_regions: remove redundant+overcautious assertion Robert Sachunsky 2026-05-21 03:21:36 +02:00
  • bf7ec0233d ModelZoo.load_model: use memory_limit instead of memory_growth Robert Sachunsky 2026-05-21 02:43:34 +02:00
  • 94a5e9da14 ModelZoo.load_model: avoid attempting to load exported models as Keras models (which causes a warning), but switch to TF-Serving import right away Robert Sachunsky 2026-05-21 02:41:19 +02:00
  • 7f2bf715df ModelZoo.load_model: fix loading exported vs saved models Robert Sachunsky 2026-05-21 02:39:59 +02:00
  • 3de1407d18 drop unnecessary TF / Torch imports Robert Sachunsky 2026-05-21 02:38:20 +02:00
  • bdfebd2c70 reload_weights: save()export() w/ serve() inference Robert Sachunsky 2026-05-19 03:20:24 +02:00
  • 86adaf299a training.models.transformer_block: tf.reshape → Keras Reshape layer Robert Sachunsky 2026-05-19 03:17:31 +02:00
  • 9efce5e9f2 Predictor.shutdown: use join() instead of terminate() Robert Sachunsky 2026-05-19 03:16:15 +02:00
  • ffe5cdc519 ModelZoo.shutdown: drop extra del (already done by shutdown()) Robert Sachunsky 2026-05-19 02:09:49 +02:00
  • 481c286da9 ModelZoo.load_model: no XLA compilation Robert Sachunsky 2026-05-19 02:08:14 +02:00
  • f329e10a80 test_layout: rm ignored --allow_scaling option Robert Sachunsky 2026-05-12 20:04:41 +02:00
  • 17b311441a model_zoo: also parse comma/colon syntax for device in Torch case Robert Sachunsky 2026-05-12 20:02:40 +02:00
  • be4fe8c263 contour: drop unused functions depending on rotation_image_new() Robert Sachunsky 2026-05-12 19:04:37 +02:00
  • 87cce6c963 CLI tests: add opt-in envvar EYNOLLAH_OPTIONS for device selection, model directory etc. Robert Sachunsky 2026-05-12 19:03:32 +02:00
  • 1ed633bc25 test_model_zoo: adapt (load_models instead of load_model) Robert Sachunsky 2026-05-12 19:02:43 +02:00
  • 21ecb043f7 CLIs: move --device option to group level Robert Sachunsky 2026-05-12 18:41:21 +02:00
  • 7ed1a1ebac CLIs: allow -h and show defaults uniformly, harmonise help, drop remaining redundant negative options Robert Sachunsky 2026-05-12 18:34:56 +02:00
  • cd62f13872 eynollah_ocr: make work again, re-use Eynollah base class… Robert Sachunsky 2026-05-12 18:31:18 +02:00
  • ded668a256 model_zoo: fix clash between Predictor and direct (OCR) use-cases… Robert Sachunsky 2026-05-12 18:17:43 +02:00
  • 98e6fbbcbb mbreorder: make work again, re-use Eynollah base class Robert Sachunsky 2026-05-11 11:30:39 +02:00
  • 7e8b9311d3 Revert "test_model_zoo: fix calls" Robert Sachunsky 2026-05-19 03:32:37 +02:00
  • a1449da1d1 Revert "fix model loading in mb_ro and ocr" Robert Sachunsky 2026-05-19 03:32:19 +02:00
  • 1df32eba87 CD: base docker image: typo {,v}3.13.0 main kba 2026-05-11 13:41:30 +02:00
  • d7337a3080 CD: base docker image on versioned ocrd/core-cuda-tf2:v3.13.0 kba 2026-05-11 13:38:36 +02:00
  • e612db2bb1 📦 v0.8.0 v0.8.0 kba 2026-05-11 13:16:30 +02:00
  • 6cfbd93ac7 📝 changelog kba 2026-05-11 13:14:56 +02:00
  • c7104c2852 Merge branch 'prepare-release-v0.8.0' kba 2026-05-11 13:12:19 +02:00
  • 5a98f55be3 test_model_zoo: fix calls prepare-release-v0.8.0 kba 2026-05-11 12:22:24 +02:00
  • 218a95e6a0 fix model loading in mb_ro and ocr kba 2026-05-11 12:19:20 +02:00
  • 2035b07b55 Merge remote-tracking branch 'bertsky/ro-fixes-final' into prepare-release-v0.8.0 kba 2026-05-11 09:46:17 +02:00
  • db87aa995d reqs for OCR: relax ad5f2272 (depending on Python version) Robert Sachunsky 2026-05-11 03:15:54 +02:00
  • e183937c5d separate_lines_new2: fix coord overflow by clipping, simplify… Robert Sachunsky 2026-05-11 03:09:02 +02:00
  • 130f0aee42 do_work_of_slopes_curved: improve on d257869d Robert Sachunsky 2026-05-11 03:03:04 +02:00
  • ce5d6bc43c try to accomodate outdaten Python versions unsupported by current transformers kba 2026-05-09 18:03:40 +02:00
  • 03f3f9af17 update model zoo and docs to link to v0_8_0 model release on zenodo kba 2026-05-09 17:58:59 +02:00
  • a61fb09ec5 CI: drop py3.8 (u/a for new req transformers >= 5) Robert Sachunsky 2026-05-09 04:14:49 +02:00
  • 4406a0299e update CLI test for binarization… Robert Sachunsky 2026-05-09 04:12:19 +02:00
  • 4cd398bd0d standalone binarization: update, simplify… Robert Sachunsky 2026-05-09 04:12:02 +02:00
  • 29abae0144 update CLI test for enhancer… Robert Sachunsky 2026-05-09 02:59:52 +02:00
  • c1b6a61301 standalone enhancer: make this work (at all)… Robert Sachunsky 2026-05-09 02:55:01 +02:00
  • d63ce5538c resize_image_with_column_classifier(): apply num_col bounds here too Robert Sachunsky 2026-05-09 02:53:04 +02:00
  • 6df2144c0f fix 2 typos in previous commits… Robert Sachunsky 2026-05-09 02:31:22 +02:00
  • daf0c90d6e
    Merge pull request #8 from bertsky/ro-fixes-training-reload Robert Sachunsky 2026-05-08 18:46:43 +02:00
  • 395decd6d6
    Merge pull request #7 from qurator-spk/ro-fixes-training-reload-additions Robert Sachunsky 2026-05-08 18:45:28 +02:00
  • 3a9d72d3fc
    Merge pull request #6 from qurator-spk/update-cd Robert Sachunsky 2026-05-08 18:44:49 +02:00
  • ea8f985ff1 apply cropping only after textline and early layout… Robert Sachunsky 2026-05-08 18:41:47 +02:00
  • 58afdf5e87 do_prediction*(): ensure always returns dtype=uint8 Robert Sachunsky 2026-05-08 17:36:31 +02:00
  • 68a26a5c3f do_prediction*(): smooth window transitions with sigmoid… Robert Sachunsky 2026-05-08 05:18:00 +02:00
  • cefe596f8b do_prediction*(): avoid unnecessary tiles, simplify… Robert Sachunsky 2026-05-08 00:55:18 +02:00
  • a0bf1b51f4 makefile to reload models kba 2026-05-07 19:30:29 +02:00
  • 34a9d458ce training deps: use sacred fork w/o pkg_resources, pin tf/tf_keras, protobuf packages to work with tensorflow_addons kba 2026-05-07 18:09:27 +02:00
  • 2747385f89 remove unused deprecating-warning-causing biopyton dependency kba 2026-05-07 17:15:15 +02:00
  • d8c83d6137 make_valid(): avoid oversimplification, improve parameter search Robert Sachunsky 2026-05-05 15:00:16 +02:00
  • 45868e99cd get_slopes_and_deskew_new_light2: ignore tiny contour areas Robert Sachunsky 2026-05-04 15:55:00 +02:00
  • 934ac90e92 get_slopes_and_deskew_new_light2: avoid +/- 90° cancellation… Robert Sachunsky 2026-05-04 15:52:07 +02:00
  • 29bb55ceff return_deskew_slop: no >90° search unless for full page, simplify Robert Sachunsky 2026-05-01 00:27:00 +02:00
  • d7a3f4cec6 training: add cfg param reload_weights for building but loading… Robert Sachunsky 2026-04-30 16:54:26 +02:00
  • cbb3be0e01 add diagnostic plotting for prediction masking (commented) Robert Sachunsky 2026-04-30 00:49:18 +02:00
  • 33c055389d bold run_single refactoring (predict segmentation on cropped img)… Robert Sachunsky 2026-04-29 01:52:29 +02:00
  • 7e7cc6a801 do_order_of_regions(): use region mask instead of textline mask… Robert Sachunsky 2026-04-29 14:05:06 +02:00
  • 63df9be4db find_number_of_columns_in_document(): pass in (reuse) masks Robert Sachunsky 2026-04-29 14:03:26 +02:00
  • da9e00cfe5 consistently handle textline mask with respect to drop-capital mask… Robert Sachunsky 2026-04-27 13:04:09 +02:00
  • 2641171fb1 return_boxes_...order_of_reading...: avoid negative slices… Robert Sachunsky 2026-04-27 00:42:31 +02:00
  • 6a92f0d49c make get_deskewed_masks() unconditional, call only when needed Robert Sachunsky 2026-04-27 00:38:22 +02:00
  • 52eb4c9a0a move label definition and deskewing cancellation up Robert Sachunsky 2026-04-27 00:33:07 +02:00
  • fa882e1dbe move run_boxes_order() call to RO section of run_single() Robert Sachunsky 2026-04-27 00:22:15 +02:00
  • d88bd485ff get_slopes*(): does not need passing boxes separately Robert Sachunsky 2026-04-27 00:14:28 +02:00
  • 869646cbf5 get_full_layout() does not need the textline mask Robert Sachunsky 2026-04-27 00:07:27 +02:00
  • b5bc161a4c extract_page(): get external contours instead of indiscriminate tree Robert Sachunsky 2026-04-27 00:02:48 +02:00
  • 287bebde0d get_marginals(): fix height factor for mask resizing Robert Sachunsky 2026-04-30 15:44:46 +02:00
  • a031d590b8 get_marginals(): do allow both left and right point (f/u 4bdea39)… Robert Sachunsky 2026-04-30 15:32:15 +02:00
  • 9571ce3474 get_marginals(): reduce indentation Robert Sachunsky 2026-04-27 00:02:03 +02:00
  • c18deb0722 drop relabelling all marginalia to main if no main (now unnecessary) Robert Sachunsky 2026-04-27 00:19:33 +02:00
  • 1f6db34adf run/get_marginals(): simplify and speed up… Robert Sachunsky 2026-04-26 23:51:05 +02:00
  • 45a43f7e5e get_marginals(): fixup point_right fallback Robert Sachunsky 2026-04-26 21:52:11 +02:00
  • 0b8d8a7330 docker: core to 3.12.3 kba 2026-04-29 17:20:36 +02:00
  • ad5f22726e 🔥 require transformers >= 5 kba 2026-04-29 17:06:13 +02:00
  • f58189d5f4 ci: tag eynollah docker image with git tag version if possible, else latest kba 2026-04-29 16:28:09 +02:00
  • 4e9c0010a3 . ocrd-wrappers kba 2026-04-28 15:48:03 +02:00
  • 243cde804e . kba 2026-04-28 15:45:47 +02:00
  • 7f8bfc9945 . kba 2026-04-28 15:45:03 +02:00
  • d2aae35446 . kba 2026-04-28 15:39:53 +02:00
  • d705f855f1 . kba 2026-04-28 15:36:50 +02:00
  • abdcb1a1f9 . kba 2026-04-28 15:33:57 +02:00
  • 69280187c5 . kba 2026-04-28 15:29:48 +02:00
  • 1ba82ede88 . kba 2026-04-28 15:25:36 +02:00
  • be1296150c . kba 2026-04-28 15:07:33 +02:00
  • 4899a8fa17 . kba 2026-04-28 14:59:01 +02:00
  • 29ef9f09dc . kba 2026-04-28 14:53:13 +02:00