Commit graph

  • 01d0b03356
    Merge dfa651ef8a into 2e3f45c300 Robert Sachunsky 2026-06-12 20:22:29 +00:00
  • dfa651ef8a predictor: show full stacktrace before passing the exception over Robert Sachunsky 2026-06-12 22:21:06 +02:00
  • 89ce9de6fa musicregion is added to pagexml to label integrating_trocr_and_torch_ensembling_and_updating_characters_list-refactor vahidrezanezhad 2026-02-24 15:46:15 +01:00
  • 303bdfe0e7 Amiri font which works for both arabic and latin vahidrezanezhad 2026-02-24 13:55:45 +01:00
  • d2123a2746 FIXME: get label for decoration without type attribute vahidrezanezhad 2026-02-24 01:39:12 +01:00
  • 499e3d0715 trocr inference is integrated - works on CPU cause seg fault on GPU vahidrezanezhad 2026-02-18 15:04:54 +01:00
  • a11c833fc1 bug fix: layout visualization vahidrezanezhad 2026-02-16 11:50:39 +01:00
  • d0b3bb419f extracting ocr textline images and text: vertical lines threshold has changed to 1.4 vahidrezanezhad 2026-02-10 14:32:23 +01:00
  • 4776ea9fc4 torch model ensembling is integrated vahidrezanezhad 2026-02-04 21:16:08 +01:00
  • aba0138216 generate or update list of characters in the case of cnn-rnn ocr training vahidrezanezhad 2026-02-03 20:20:20 +01:00
  • 7f86a55ccb integrating transformer ocr vahidrezanezhad 2026-02-03 19:45:50 +01:00
  • e9839a8b54 makefile to reload models: avoid ONNX conversion for cnn-rnn-ocr too Robert Sachunsky 2026-06-12 15:00:36 +02:00
  • 19504cb932 makefile to reload models: add target for SavedModel Keras format Robert Sachunsky 2026-06-12 14:59:38 +02:00
  • 60c9f4786c ModelZoo device selection: warn if model category still unmatched… Robert Sachunsky 2026-06-12 14:58:32 +02:00
  • 94082bc64a ModelZoo TF-Serving backend: deal with buggy .inputs signature… Robert Sachunsky 2026-06-12 14:56:44 +02:00
  • 45c92eada2 models w/ multiple inputs yield a tuple for .input_shape Robert Sachunsky 2026-06-12 14:55:46 +02:00
  • 08946067ac ModelZoo ONNX backend: handle multiple inputs, too Robert Sachunsky 2026-06-12 14:54:51 +02:00
  • 9d2412080f training.models for cnn-rnn-ocr: fix config names for height/width… Robert Sachunsky 2026-06-12 14:52:23 +02:00
  • 4181e03bc9 training convert --rebuild for cnn-rnn-ocr: override charset file… Robert Sachunsky 2026-06-12 14:48:47 +02:00
  • 33734d3eeb
    Merge 28a559c710 into 2e3f45c300 Konstantin Baierer 2026-06-11 19:07:25 +02:00
  • 28a559c710 Merge remote-tracking branch 'bertsky/fix-0.8-modelzoo-and-predictor' into integrating_trocr_and_torch_ensembling_and_updating_characters_list kba 2026-06-11 19:07:19 +02:00
  • 6df11d92d8 Merge branch 'integrating_trocr_and_torch_ensembling_and_updating_characters_list' kba 2026-06-11 18:59:33 +02:00
  • bed7fe526b remove (half-implemented) page_alto functionality kba 2026-06-11 17:04:56 +02:00
  • 348ac95ad3 Eynollah_ocr: drop fixed input sizes… Robert Sachunsky 2026-06-03 20:59:00 +02:00
  • 24c7d4c277 update trocr smoke test, add cnnrnn ocr smoke test Robert Sachunsky 2026-06-03 20:58:05 +02:00
  • 27ca9733db ModelZoo ONNX backend for inference: support multi-input or -output Robert Sachunsky 2026-06-03 20:57:02 +02:00
  • 38fe4d33ad Predictor for multi-input models: present as list instead of tuple… Robert Sachunsky 2026-06-03 20:56:00 +02:00
  • 4e7e1c06b9 trocr viarant for Predictor runtime: no model size for input_shape… Robert Sachunsky 2026-06-03 20:51:56 +02:00
  • f447a9f248 trocr: move preprocessor and decoder into model object, too… Robert Sachunsky 2026-06-03 03:41:44 +02:00
  • d2f2a1e06b Eynollah_ocr: correctly handle min_conf, improve writer… Robert Sachunsky 2026-06-03 00:43:46 +02:00
  • 8ffc4ed8d3 Eynollah_ocr: adapt to inference model, improve and simplify… Robert Sachunsky 2026-06-02 21:20:06 +02:00
  • a391ee24e6 Predictor: handle multi-input and/or multi-output cases Robert Sachunsky 2026-06-02 21:18:22 +02:00
  • c79b73dcc8 cnn-rnn-ocr: move CTC decoder and string decoder to inference model… Robert Sachunsky 2026-06-02 20:26:42 +02:00
  • 13f2f81c45 ModelZoo: support inference with ONNX/TensorRT… Robert Sachunsky 2026-05-28 18:08:08 +02:00
  • f833a516e7 training: add CLI command convert Robert Sachunsky 2026-05-28 17:48:21 +02:00
  • 62b55a3809 train params: drop reload_weights, re-use dir_of_start_model Robert Sachunsky 2026-05-28 17:42:55 +02:00
  • 093030f503 train/models: move all model builders to models.get_model() Robert Sachunsky 2026-05-28 17:37:45 +02:00
  • faef1967f8 models.cnn_rnn_ocr_model: add inference option, drop model name Robert Sachunsky 2026-05-28 17:32:02 +02:00
  • 2e3f45c300
    Merge pull request #220 from dokempf/remove-vim-swap-file main Clemens Neudecker 2026-05-28 11:41:21 +02:00
  • e17712eacd Remove accidentally committed vim swap file Dominic Kempf 2026-05-28 11:27:41 +02:00
  • c4a7eec5b3 models: cosmetics Robert Sachunsky 2026-05-27 01:58:21 +02:00
  • 9801129aa6 estimate_skew_contours: ensure retval is always float Robert Sachunsky 2026-05-22 12:37:07 +02:00
  • 26afc5ddab ModelZoo: ensure exported TensorShape is converted to plain tuple Robert Sachunsky 2026-05-22 12:35:44 +02:00
  • 0836230c6b utils_ocr: avoid module-level import of TF Robert Sachunsky 2026-05-21 22:50:53 +02:00
  • f3a93983c0 ModelZoo: add ocr key for memory_limit Robert Sachunsky 2026-05-21 22:50:13 +02:00
  • ea41dcae1d trocr: use beam search instead of greedy decoding Robert Sachunsky 2026-05-21 17:52:27 +02:00
  • 074753a98e ModelZoo: fix Torch device selection Robert Sachunsky 2026-05-21 17:25:53 +02:00
  • 000e4ac8d8 trocr: extract confidence, too Robert Sachunsky 2026-05-21 17:25:39 +02:00
  • f3649adbf2 trocr: apply do_not_mask_with_textline_contour here, too Robert Sachunsky 2026-05-21 17:23:11 +02:00
  • 1d67e65f11 trocr: simplify, batch over entire page… Robert Sachunsky 2026-05-21 15:48:21 +02:00
  • d50bd7c650 trocr: avoid warnings by passing clean_up_tokenization_spaces=False Robert Sachunsky 2026-05-21 14:20:51 +02:00
  • f9f9130dbb do_order_of_regions: remove redundant+overcautious assertion Robert Sachunsky 2026-05-21 03:21:36 +02:00
  • bf7ec0233d ModelZoo.load_model: use memory_limit instead of memory_growth Robert Sachunsky 2026-05-21 02:43:34 +02:00
  • 94a5e9da14 ModelZoo.load_model: avoid attempting to load exported models as Keras models (which causes a warning), but switch to TF-Serving import right away Robert Sachunsky 2026-05-21 02:41:19 +02:00
  • 7f2bf715df ModelZoo.load_model: fix loading exported vs saved models Robert Sachunsky 2026-05-21 02:39:59 +02:00
  • 3de1407d18 drop unnecessary TF / Torch imports Robert Sachunsky 2026-05-21 02:38:20 +02:00
  • bdfebd2c70 reload_weights: save()export() w/ serve() inference Robert Sachunsky 2026-05-19 03:20:24 +02:00
  • 86adaf299a training.models.transformer_block: tf.reshape → Keras Reshape layer Robert Sachunsky 2026-05-19 03:17:31 +02:00
  • 9efce5e9f2 Predictor.shutdown: use join() instead of terminate() Robert Sachunsky 2026-05-19 03:16:15 +02:00
  • ffe5cdc519 ModelZoo.shutdown: drop extra del (already done by shutdown()) Robert Sachunsky 2026-05-19 02:09:49 +02:00
  • 481c286da9 ModelZoo.load_model: no XLA compilation Robert Sachunsky 2026-05-19 02:08:14 +02:00
  • f329e10a80 test_layout: rm ignored --allow_scaling option Robert Sachunsky 2026-05-12 20:04:41 +02:00
  • 17b311441a model_zoo: also parse comma/colon syntax for device in Torch case Robert Sachunsky 2026-05-12 20:02:40 +02:00
  • be4fe8c263 contour: drop unused functions depending on rotation_image_new() Robert Sachunsky 2026-05-12 19:04:37 +02:00
  • 87cce6c963 CLI tests: add opt-in envvar EYNOLLAH_OPTIONS for device selection, model directory etc. Robert Sachunsky 2026-05-12 19:03:32 +02:00
  • 1ed633bc25 test_model_zoo: adapt (load_models instead of load_model) Robert Sachunsky 2026-05-12 19:02:43 +02:00
  • 21ecb043f7 CLIs: move --device option to group level Robert Sachunsky 2026-05-12 18:41:21 +02:00
  • 7ed1a1ebac CLIs: allow -h and show defaults uniformly, harmonise help, drop remaining redundant negative options Robert Sachunsky 2026-05-12 18:34:56 +02:00
  • cd62f13872 eynollah_ocr: make work again, re-use Eynollah base class… Robert Sachunsky 2026-05-12 18:31:18 +02:00
  • ded668a256 model_zoo: fix clash between Predictor and direct (OCR) use-cases… Robert Sachunsky 2026-05-12 18:17:43 +02:00
  • 98e6fbbcbb mbreorder: make work again, re-use Eynollah base class Robert Sachunsky 2026-05-11 11:30:39 +02:00
  • 7e8b9311d3 Revert "test_model_zoo: fix calls" Robert Sachunsky 2026-05-19 03:32:37 +02:00
  • a1449da1d1 Revert "fix model loading in mb_ro and ocr" Robert Sachunsky 2026-05-19 03:32:19 +02:00
  • 1df32eba87 CD: base docker image: typo {,v}3.13.0 kba 2026-05-11 13:41:30 +02:00
  • d7337a3080 CD: base docker image on versioned ocrd/core-cuda-tf2:v3.13.0 kba 2026-05-11 13:38:36 +02:00
  • e612db2bb1 📦 v0.8.0 v0.8.0 kba 2026-05-11 13:16:30 +02:00
  • 6cfbd93ac7 📝 changelog kba 2026-05-11 13:14:56 +02:00
  • c7104c2852 Merge branch 'prepare-release-v0.8.0' kba 2026-05-11 13:12:19 +02:00
  • 5a98f55be3 test_model_zoo: fix calls prepare-release-v0.8.0 kba 2026-05-11 12:22:24 +02:00
  • 218a95e6a0 fix model loading in mb_ro and ocr kba 2026-05-11 12:19:20 +02:00
  • 2035b07b55 Merge remote-tracking branch 'bertsky/ro-fixes-final' into prepare-release-v0.8.0 kba 2026-05-11 09:46:17 +02:00
  • db87aa995d reqs for OCR: relax ad5f2272 (depending on Python version) Robert Sachunsky 2026-05-11 03:15:54 +02:00
  • e183937c5d separate_lines_new2: fix coord overflow by clipping, simplify… Robert Sachunsky 2026-05-11 03:09:02 +02:00
  • 130f0aee42 do_work_of_slopes_curved: improve on d257869d Robert Sachunsky 2026-05-11 03:03:04 +02:00
  • ce5d6bc43c try to accomodate outdaten Python versions unsupported by current transformers kba 2026-05-09 18:03:40 +02:00
  • 03f3f9af17 update model zoo and docs to link to v0_8_0 model release on zenodo kba 2026-05-09 17:58:59 +02:00
  • a61fb09ec5 CI: drop py3.8 (u/a for new req transformers >= 5) Robert Sachunsky 2026-05-09 04:14:49 +02:00
  • 4406a0299e update CLI test for binarization… Robert Sachunsky 2026-05-09 04:12:19 +02:00
  • 4cd398bd0d standalone binarization: update, simplify… Robert Sachunsky 2026-05-09 04:12:02 +02:00
  • 29abae0144 update CLI test for enhancer… Robert Sachunsky 2026-05-09 02:59:52 +02:00
  • c1b6a61301 standalone enhancer: make this work (at all)… Robert Sachunsky 2026-05-09 02:55:01 +02:00
  • d63ce5538c resize_image_with_column_classifier(): apply num_col bounds here too Robert Sachunsky 2026-05-09 02:53:04 +02:00
  • 6df2144c0f fix 2 typos in previous commits… Robert Sachunsky 2026-05-09 02:31:22 +02:00
  • daf0c90d6e
    Merge pull request #8 from bertsky/ro-fixes-training-reload Robert Sachunsky 2026-05-08 18:46:43 +02:00
  • 395decd6d6
    Merge pull request #7 from qurator-spk/ro-fixes-training-reload-additions Robert Sachunsky 2026-05-08 18:45:28 +02:00
  • 3a9d72d3fc
    Merge pull request #6 from qurator-spk/update-cd Robert Sachunsky 2026-05-08 18:44:49 +02:00
  • ea8f985ff1 apply cropping only after textline and early layout… Robert Sachunsky 2026-05-08 18:41:47 +02:00
  • 58afdf5e87 do_prediction*(): ensure always returns dtype=uint8 Robert Sachunsky 2026-05-08 17:36:31 +02:00
  • 68a26a5c3f do_prediction*(): smooth window transitions with sigmoid… Robert Sachunsky 2026-05-08 05:18:00 +02:00
  • cefe596f8b do_prediction*(): avoid unnecessary tiles, simplify… Robert Sachunsky 2026-05-08 00:55:18 +02:00