Commit graph

  • 78bfa97c06
    Merge pull request #129 from qurator-spk/resolving_issue_106 Clemens Neudecker 2024-08-23 14:10:26 +02:00
  • 84d05bd0ae s,url,local_filename, kba 2024-08-23 14:01:20 +02:00
  • c10a525675 inference with batch size bigger than 1 vahidrezanezhad 2024-08-23 02:18:16 +02:00
  • 9904846776 using prepared binarized images in the case of augmentation vahidrezanezhad 2024-08-22 21:58:09 +02:00
  • 61cdd2acb8 using prepared binarized images in the case of augmentation vahidrezanezhad 2024-08-22 21:58:09 +02:00
  • f31219b1c9 scaling, channels shuffling, rgb background and red content added to no patch augmentation vahidrezanezhad 2024-08-21 19:33:23 +02:00
  • aeb2ee4e3e scaling, channels shuffling, rgb background and red content added to no patch augmentation vahidrezanezhad 2024-08-21 19:33:23 +02:00
  • 95bbdf8040 updating augmentations vahidrezanezhad 2024-08-21 16:17:59 +02:00
  • 445c45cb87 updating augmentations vahidrezanezhad 2024-08-21 16:17:59 +02:00
  • 7be326d689 augmentation function for red textlines, rgb background and scaling for no patch case vahidrezanezhad 2024-08-21 00:48:30 +02:00
  • 5e1821a741 augmentation function for red textlines, rgb background and scaling for no patch case vahidrezanezhad 2024-08-21 00:48:30 +02:00
  • 7f99526b9d update Makefile model location cneud 2024-08-15 23:59:18 +02:00
  • 4f8210de71 update Makefile model location cneud 2024-08-15 23:23:48 +02:00
  • 6f4205ba49 update pyproject.toml vahidrezanezhad 2024-08-15 16:08:45 +02:00
  • 74eac4dacc dtype = object in the case of length 1 arise error vahidrezanezhad 2024-08-15 13:50:36 +02:00
  • 8f76966394 update pyproject.toml for v0.3.1 cneud 2024-08-14 19:51:48 +02:00
  • 28ee1e527e update pyproject.toml for v0.3.1 cneud 2024-08-14 19:50:57 +02:00
  • 4c50479cb8 pyproject.toml may work for ocrd vahidrezanezhad 2024-08-14 15:28:36 +02:00
  • 53fd5fb2a5 resolving #106 for pyproject.toml test vahidrezanezhad 2024-08-14 14:42:37 +02:00
  • e976778796 testing pyproject.toml vahidrezanezhad 2024-08-14 14:33:01 +02:00
  • 23ac58405c
    update pyproject.toml Clemens Neudecker 2024-08-13 21:47:32 +02:00
  • 85dd59f23e update vahidrezanezhad 2024-08-09 13:20:09 +02:00
  • bf5837bf6e update vahidrezanezhad 2024-08-09 13:20:09 +02:00
  • f4bad09083 save only layout output. different from overlayed layout on image vahidrezanezhad 2024-08-09 12:46:18 +02:00
  • 3b90347a94 save only layout output. different from overlayed layout on image vahidrezanezhad 2024-08-09 12:46:18 +02:00
  • e3edb0ec30 update vahidrezanezhad 2024-08-09 02:23:17 +02:00
  • b6bdf942fd
    add documentation from wiki as markdown file to the codebase Clemens Neudecker 2024-08-08 16:35:06 +02:00
  • 2d83b8faad add documentation from wiki as markdown file to the codebase Clemens Neudecker 2024-08-08 16:35:06 +02:00
  • 8e2cdad1be extracting images only - avoid artifacts with heuristics vahidrezanezhad 2024-08-07 23:22:27 +02:00
  • 00bf2b64d0 1&2 column images only printspace vahidrezanezhad 2024-08-07 19:07:54 +02:00
  • be144db9f8 updating 1&2 columns images + full layout vahidrezanezhad 2024-08-07 18:13:10 +02:00
  • a62ae370c3 new full layout model and early layout for 1&2 column images are integrated - light version vahidrezanezhad 2024-08-07 02:21:01 +02:00
  • 9170a9f21c only images extraction - update inference parameters vahidrezanezhad 2024-08-06 16:11:32 +02:00
  • 59e5892f25 erosion rate changed vahidrezanezhad 2024-08-01 14:30:51 +02:00
  • 6fb28d6ce8 erosion rate changed vahidrezanezhad 2024-08-01 14:30:51 +02:00
  • f0e7f75499 Update README.md cneud 2024-08-01 00:30:25 +02:00
  • 7ded54a8d2 rename GH action cneud 2024-08-01 00:25:31 +02:00
  • c9f63826c0 create draft pyproject.toml cneud 2024-08-01 00:13:42 +02:00
  • 8862df9156 format options table cneud 2024-07-31 22:53:36 +02:00
  • 38698c6609 Update README.md cneud 2024-07-31 21:16:02 +02:00
  • 40f5408b1e improve huggingface url cneud 2024-07-31 20:02:56 +02:00
  • 3cfa447e84 remove CircleCI cneud 2024-07-31 20:01:36 +02:00
  • ad133e3425 Update model download url cneud 2024-07-31 19:49:43 +02:00
  • 5fbe941f53 inference updated vahidrezanezhad 2024-07-24 18:00:39 +02:00
  • 381976099f inference updated vahidrezanezhad 2024-07-24 18:00:39 +02:00
  • 30894ddc75 erosion and dilation parameters are changed & separators are written in label images after artificial label vahidrezanezhad 2024-07-24 16:52:05 +02:00
  • 2c822dae4e erosion and dilation parameters are changed & separators are written in label images after artificial label vahidrezanezhad 2024-07-24 16:52:05 +02:00
  • c340fbb721 increasing margin in the case of pixelwise inference b-vr103 2024-07-23 11:29:05 +02:00
  • 840d7c2283 increasing margin in the case of pixelwise inference b-vr103 2024-07-23 11:29:05 +02:00
  • f2692cf8dd brightness augmentation modified b-vr103 2024-07-17 18:20:24 +02:00
  • 861f0b1ebd brightness augmentation modified b-vr103 2024-07-17 18:20:24 +02:00
  • 9521768774 adding degrading and brightness augmentation to no patches case training vahidrezanezhad 2024-07-17 17:14:20 +02:00
  • 453d0fbf92 adding degrading and brightness augmentation to no patches case training vahidrezanezhad 2024-07-17 17:14:20 +02:00
  • 5144668834 ocr engine first integration vahidrezanezhad 2024-07-17 10:01:37 +02:00
  • 55f3cb9a84 printspace_as_class_in_layout is integrated. Printspace can be defined as a class for layout segmentation vahidrezanezhad 2024-07-16 18:29:27 +02:00
  • 3bceec9c19 printspace_as_class_in_layout is integrated. Printspace can be defined as a class for layout segmentation vahidrezanezhad 2024-07-16 18:29:27 +02:00
  • 647a3f8cc4 resolving typo vahidrezanezhad 2024-07-09 03:04:29 +02:00
  • 9260d2962a resolving typo vahidrezanezhad 2024-07-09 03:04:29 +02:00
  • c0faecec2c update inference vahidrezanezhad 2024-06-21 23:42:25 +02:00
  • fe69b9c4a8 update inference vahidrezanezhad 2024-06-21 23:42:25 +02:00
  • 033cf6734b update reading order machine based vahidrezanezhad 2024-06-21 13:06:26 +02:00
  • b3cd01de37 update reading order machine based vahidrezanezhad 2024-06-21 13:06:26 +02:00
  • 9358657a0d update config vahidrezanezhad 2024-06-12 17:40:40 +02:00
  • 66022cf771 update config vahidrezanezhad 2024-06-12 17:40:40 +02:00
  • 743f2e97d6 Transformer+CNN structure is added to vision transformer type vahidrezanezhad 2024-06-12 17:39:57 +02:00
  • 22d7359db2 Transformer+CNN structure is added to vision transformer type vahidrezanezhad 2024-06-12 17:39:57 +02:00
  • f1fd74c7eb transformer patch size is dynamic now. vahidrezanezhad 2024-06-12 13:26:27 +02:00
  • 95faf1a4c8 transformer patch size is dynamic now. vahidrezanezhad 2024-06-12 13:26:27 +02:00
  • 2aa216e388 binarization as a separate task of segmentation vahidrezanezhad 2024-06-11 17:48:30 +02:00
  • 29da23da76 binarization as a separate task of segmentation vahidrezanezhad 2024-06-11 17:48:30 +02:00
  • 41a0e15e79 updating train.py nontransformer backend vahidrezanezhad 2024-06-10 22:15:30 +02:00
  • 1921e6754f updating train.py nontransformer backend vahidrezanezhad 2024-06-10 22:15:30 +02:00
  • 815e5a1d35 updating train.py vahidrezanezhad 2024-06-07 16:24:31 +02:00
  • cc91e4b12c updating train.py vahidrezanezhad 2024-06-07 16:24:31 +02:00
  • dc356a5f42 just defined graphic region types can be extracted as label vahidrezanezhad 2024-06-06 18:55:22 +02:00
  • 4c376289e9 just defined graphic region types can be extracted as label vahidrezanezhad 2024-06-06 18:55:22 +02:00
  • b1d971a200 just defined textregion types can be extracted as label vahidrezanezhad 2024-06-06 18:47:30 +02:00
  • 0e4dd0b9ef just defined textregion types can be extracted as label vahidrezanezhad 2024-06-06 18:47:30 +02:00
  • 1c8873ffa3 just defined textregion types can be extracted as label vahidrezanezhad 2024-06-06 18:45:47 +02:00
  • 5a5914e06c just defined textregion types can be extracted as label vahidrezanezhad 2024-06-06 18:45:47 +02:00
  • e25a925169
    Update README.md vahidrezanezhad 2024-06-06 14:46:06 +02:00
  • 742e3c2aa2 Update README.md vahidrezanezhad 2024-06-06 14:46:06 +02:00
  • b9cbc0edb7 replacement in a list done correctly vahidrezanezhad 2024-06-06 14:38:29 +02:00
  • 13ebe71d13 replacement in a list done correctly vahidrezanezhad 2024-06-06 14:38:29 +02:00
  • 821290c464 scaling and cropping of labels and org images vahidrezanezhad 2024-05-30 16:59:50 +02:00
  • 3ef0dbdd42 scaling and cropping of labels and org images vahidrezanezhad 2024-05-30 16:59:50 +02:00
  • 4640d9f2dc modifying xml parsing vahidrezanezhad 2024-05-30 12:56:56 +02:00
  • 47a1646451 modifying xml parsing vahidrezanezhad 2024-05-30 12:56:56 +02:00
  • 785033536a min_area size of regions considered for reading order detection passed as an argument for inference vahidrezanezhad 2024-05-29 13:07:06 +02:00
  • 09789619a8 min_area size of regions considered for reading order detection passed as an argument for inference vahidrezanezhad 2024-05-29 13:07:06 +02:00
  • f6abefb0a8 reading order detection on xml with layout + result will be written in an output directory with the same file name vahidrezanezhad 2024-05-29 11:18:35 +02:00
  • 06ed006193 reading order detection on xml with layout + result will be written in an output directory with the same file name vahidrezanezhad 2024-05-29 11:18:35 +02:00
  • 2e7c69f2ac inference for reading order vahidrezanezhad 2024-05-28 16:48:51 +02:00
  • 4fb45a6711 inference for reading order vahidrezanezhad 2024-05-28 16:48:51 +02:00
  • 60cf0bddfd check_dpi: fix Pillow type detection Robert Sachunsky 2024-05-28 14:07:45 +02:00
  • c52ef98821 processor: reuse loaded models across pages, use derived images Robert Sachunsky 2023-06-11 22:14:41 +02:00
  • 356da4cc53 min area size of text region passes as an argument for machine based reading order vahidrezanezhad 2024-05-28 10:14:16 +02:00
  • cc7577d2c1 min area size of text region passes as an argument for machine based reading order vahidrezanezhad 2024-05-28 10:14:16 +02:00
  • 29ddd4d909 pass degrading scales for image enhancement as a json file vahidrezanezhad 2024-05-28 10:01:17 +02:00
  • 467bbb2884 pass degrading scales for image enhancement as a json file vahidrezanezhad 2024-05-28 10:01:17 +02:00