Commit Graph

  • 2c93904985 avoiding double binarization vahidrezanezhad 2024-09-12 17:35:28 +0200
  • f0b49073b7 adding option for textline detection in printspace vahidrezanezhad 2024-09-03 23:10:38 +0200
  • c156a1612e
    Exclude `run_image_extraction_over_ppn_lists.py` from merge Clemens Neudecker 2024-09-03 20:03:44 +0200
  • 6b2e5d110e all tests are passed vahidrezanezhad 2024-09-03 13:55:55 +0200
  • c3a4a1bba7 resolving issue #110 in a better way vahidrezanezhad 2024-09-03 13:14:10 +0200
  • b6d3d2bdbf fix indentation cneud 2024-09-02 20:11:42 +0200
  • de32d86fb6 Merge branch 'refs/heads/main' into extracting_images_only cneud 2024-09-02 19:55:33 +0200
  • 0f87974b0c writing drop capitals in xml output + and may resolve issue #110 vahidrezanezhad 2024-09-02 16:21:07 +0200
  • a49fcb1754
    Merge c6e0e058d0 into 256a7c347f Konstantin Baierer 2024-09-02 12:53:44 +0000
  • c6e0e058d0 Merge branch 'main' into v3-api v3-api kba 2024-09-02 14:53:37 +0200
  • fdedae2406 require ocrd>=3.0.0b4 kba 2024-09-02 11:47:57 +0200
  • f9c2d85dd7 Merge branch 'main' into v3-api kba 2024-09-02 11:46:56 +0200
  • 9b274dcc20
    Merge pull request #134 from bertsky/v3-api Konstantin Baierer 2024-09-02 11:46:33 +0200
  • 17eafc1ccb adapt tool json to v3 Robert Sachunsky 2024-09-01 10:15:31 +0200
  • 1e902571ea undo customizing metadata_filename (not correct with namespace pkg support in core) Robert Sachunsky 2024-09-01 10:15:11 +0200
  • dfc4ac2538 setuptools: fix (packages.find.where prevented finding namespace qurator) Robert Sachunsky 2024-08-30 22:46:51 +0200
  • 256a7c347f
    Merge pull request #133 from qurator-spk/src-layout Clemens Neudecker 2024-08-29 23:13:37 +0200
  • 84b844203d switch from qurator namespace to src-layout src-layout kba 2024-08-29 17:11:29 +0200
  • 9367f86483 remove setup.py stub completely pyproject-setuptools-ocrd kba 2024-08-29 17:06:39 +0200
  • 93005959e5 inference batch size debugged vahidrezanezhad 2024-08-27 18:13:46 +0200
  • 62314c453c fully transition to pyproject kba 2024-08-27 15:04:57 +0200
  • a5c7f223d1 📦 v0.3.1 v0.3.1 kba 2024-08-27 14:54:59 +0200
  • 9ae0575436 📝 changelog kba 2024-08-27 14:52:01 +0200
  • 7ae6a8776f ignoring dpi check by light version vahidrezanezhad 2024-08-26 16:02:10 +0200
  • aef46a4669 require ocrd >= 3.0.0b1 kba 2024-08-26 11:31:13 +0200
  • 7b92620a10
    processor: no more DPI info lost Konstantin Baierer 2024-08-26 10:45:53 +0200
  • d26079db85 procesor.py: simplify imports further kba 2024-08-26 10:40:15 +0200
  • ecd202ea4c
    processor.py: Simplify import Konstantin Baierer 2024-08-26 10:39:22 +0200
  • d98fa2a85b check_dpi: fix Pillow type detection Robert Sachunsky 2024-05-28 14:07:45 +0200
  • 61bcb435ae processor: reuse loaded models across pages, use derived images Robert Sachunsky 2023-06-11 22:14:41 +0200
  • c37d95dedf non-legacy namespace package # Conflicts: # setup.py Robert Sachunsky 2024-05-23 21:19:33 +0200
  • 49c1a8f384 fix namespace pkg setup Robert Sachunsky 2024-05-24 14:29:57 +0000
  • 3381e5a015 adapt to OcrdFile.local_filename now :Path # Conflicts: # qurator/eynollah/processor.py Robert Sachunsky 2024-01-24 19:33:49 +0100
  • 8dfecb70d4 adapt to ocrd>=2.54 url vs local_filename Robert Sachunsky 2024-01-19 16:17:02 +0000
  • 8ec9fc6da2 Merge remote-tracking branch 'origin/refactor' into refactoring-2024-08 refactoring-2024-08-merged kba 2024-08-24 18:51:44 +0200
  • d7caeb2b05 ocrd interface: add ignore_page_extraction kba 2024-08-24 18:11:15 +0200
  • ddcc0198bd ocrd interface: add right_to_left kba 2024-08-24 18:05:21 +0200
  • 39b16e5978 ocrd interface: add textline_light kba 2024-08-24 18:00:45 +0200
  • 87adc4b0c6 ocrd interface: add light_mode parameter kba 2024-08-24 16:51:52 +0200
  • 0d83db7bc4 update processor to the latest change in bertsky/core#14 kba 2024-08-24 16:46:25 +0200
  • b954a55d26 move self.model_* to EynollaDirs kba 2024-08-24 16:15:21 +0200
  • 04e79002b3 making light version faster for 1 and 2 columns images vahidrezanezhad 2024-08-24 12:54:19 +0200
  • 59dbffea59 remove commented out code kba 2024-08-23 21:35:43 +0200
  • ac2958edb1 separate_lines.separate_lines_vertical_cont: remove unused args kba 2024-08-23 21:25:37 +0200
  • 9109e88d50 wip typing kba 2024-08-23 21:22:29 +0200
  • a5b178e1d1 remove dead code (found with vulture) kba 2024-08-23 21:11:48 +0200
  • 9ee9c4403b introduce self.batch_processing_mode to clarify when data is read from dir_in kba 2024-08-23 21:04:23 +0200
  • 532ee6fe41 rfct: introduce EynollahDirs to reduce self.dir_* proliferation kba 2024-08-23 20:55:14 +0200
  • 762a7a058e adapt to one-arg start_new_session_and_model and rename load_model kba 2024-08-23 20:29:44 +0200
  • 8c4bfa229f rfct: move all tensorflow/keras imports and hacks to utils.tf kba 2024-08-23 20:10:49 +0200
  • b15b1bdcd5 rfct: remove unused _old method kba 2024-08-23 19:59:12 +0200
  • 2c9727f9c9 move keras-specific classes to utils.keras, clean up imports kba 2024-08-23 19:53:04 +0200
  • d7a774ebd2 test_run: require EYNOLLAH_MODELS to be defined in environ kba 2024-08-23 19:52:02 +0200
  • d6a72709a1 remove unused image_filename_stem kwarg kba 2024-08-23 18:53:28 +0200
  • 9ce02a569e ocrd-tool: add "allow_enhancement" parameter kba 2024-08-23 18:32:59 +0200
  • 4a13781ef4 class Eynollah: add typing, consistent interface in CLI and OCR-D CLI kba 2024-08-23 18:32:29 +0200
  • 0a3f525f0a port processor to core v3 kba 2024-08-23 18:19:28 +0200
  • 78bfa97c06
    Merge pull request #129 from qurator-spk/resolving_issue_106 Clemens Neudecker 2024-08-23 14:10:26 +0200
  • 84d05bd0ae s,url,local_filename, kba 2024-08-23 14:01:20 +0200
  • c10a525675 inference with batch size bigger than 1 vahidrezanezhad 2024-08-23 02:18:16 +0200
  • 7f99526b9d update Makefile model location cneud 2024-08-15 23:59:18 +0200
  • 4f8210de71 update Makefile model location cneud 2024-08-15 23:23:48 +0200
  • 6f4205ba49 update pyproject.toml vahidrezanezhad 2024-08-15 16:08:45 +0200
  • 74eac4dacc dtype = object in the case of length 1 arise error vahidrezanezhad 2024-08-15 13:50:36 +0200
  • 8f76966394 update pyproject.toml for v0.3.1 cneud 2024-08-14 19:51:48 +0200
  • 28ee1e527e update pyproject.toml for v0.3.1 cneud 2024-08-14 19:50:57 +0200
  • 4c50479cb8 pyproject.toml may work for ocrd vahidrezanezhad 2024-08-14 15:28:36 +0200
  • 53fd5fb2a5 resolving #106 for pyproject.toml test vahidrezanezhad 2024-08-14 14:42:37 +0200
  • e976778796 testing pyproject.toml vahidrezanezhad 2024-08-14 14:33:01 +0200
  • 23ac58405c
    update pyproject.toml Clemens Neudecker 2024-08-13 21:47:32 +0200
  • e3edb0ec30 update vahidrezanezhad 2024-08-09 02:23:17 +0200
  • 8e2cdad1be extracting images only - avoid artifacts with heuristics vahidrezanezhad 2024-08-07 23:22:27 +0200
  • 00bf2b64d0 1&2 column images only printspace vahidrezanezhad 2024-08-07 19:07:54 +0200
  • be144db9f8 updating 1&2 columns images + full layout vahidrezanezhad 2024-08-07 18:13:10 +0200
  • a62ae370c3 new full layout model and early layout for 1&2 column images are integrated - light version vahidrezanezhad 2024-08-07 02:21:01 +0200
  • 9170a9f21c only images extraction - update inference parameters vahidrezanezhad 2024-08-06 16:11:32 +0200
  • f0e7f75499 Update README.md cneud 2024-08-01 00:30:25 +0200
  • 7ded54a8d2 rename GH action cneud 2024-08-01 00:25:31 +0200
  • c9f63826c0 create draft pyproject.toml cneud 2024-08-01 00:13:42 +0200
  • 8862df9156 format options table cneud 2024-07-31 22:53:36 +0200
  • 38698c6609 Update README.md cneud 2024-07-31 21:16:02 +0200
  • 40f5408b1e improve huggingface url cneud 2024-07-31 20:02:56 +0200
  • 3cfa447e84 remove CircleCI cneud 2024-07-31 20:01:36 +0200
  • b6ce6890f8
    Merge 60cf0bddfd into ad133e3425 Robert Sachunsky 2024-07-31 20:00:11 +0200
  • ad133e3425 Update model download url cneud 2024-07-31 19:49:43 +0200
  • 5144668834 ocr engine first integration vahidrezanezhad 2024-07-17 10:01:37 +0200
  • 60cf0bddfd check_dpi: fix Pillow type detection Robert Sachunsky 2024-05-28 14:07:45 +0200
  • c52ef98821 processor: reuse loaded models across pages, use derived images Robert Sachunsky 2023-06-11 22:14:41 +0200
  • 721d3f70a0
    Merge pull request #127 from bertsky/new-namespace-pkg vahidrezanezhad 2024-05-24 16:47:48 +0200
  • 9644c8fcb7 fix namespace pkg setup Robert Sachunsky 2024-05-24 14:29:57 +0000
  • 45bd76f5e8 fix namespace pkg setup Robert Sachunsky 2024-05-24 14:27:56 +0000
  • a367620b02
    non-legacy namespace package Robert Sachunsky 2024-05-23 21:19:33 +0200
  • f88ee99f3c
    non-legacy namespace package Robert Sachunsky 2024-05-23 21:17:38 +0200
  • 899bb9f00c
    update GitHub actions Clemens Neudecker 2024-04-10 15:27:29 +0200
  • ba64282118
    Update README.md Clemens Neudecker 2024-04-03 19:58:24 +0200
  • 1469dd505f remove Python 3.12 from tests again for now cneud 2024-03-20 00:37:12 +0100
  • 533736a3e3
    update supported Python+Tensorflow version combinations Clemens Neudecker 2024-03-20 00:28:22 +0100
  • c3d9315b9e pin `scikit-learn` version supporting Python 3.8-3.12 cneud 2024-03-19 23:33:57 +0100
  • 6524b0b850 Update pyproject.toml.draft cneud 2024-03-19 23:21:29 +0100
  • 8f6dd0649e update `scikit-learn` to version supporting Python 3.12 cneud 2024-03-19 23:19:21 +0100