1
0
Fork 0
mirror of https://github.com/mikegerber/ocrd_calamari.git synced 2025-06-09 11:49:53 +02:00
Commit graph

234 commits

Author SHA1 Message Date
23821cba57 ✔ Do not delete test workspace when DEBUG env variable is set 2023-03-13 19:35:04 +01:00
2997a471f5 ✔ CircleCI: Test on Python 3.11, too 2023-03-13 19:34:33 +01:00
a858144d4d ✔ CircleCI: Install binary OpenCV for Python 3.6 2023-03-13 19:27:34 +01:00
acfdd20696 Revert "✔ CircleCI: Do not test on Python 3.6 anymore"
This reverts commit 6d022f3c9e.
2023-03-13 17:51:31 +01:00
6d022f3c9e ✔ CircleCI: Do not test on Python 3.6 anymore 2023-03-13 17:49:57 +01:00
2c54a265a6 ✔ Fix tests to use the new filenames
OCR-D seems to create the files in the output file group with a new
naming scheme (based on file group name and page id); Adjust the tests
accordingly.

It's probably not the best approach to expect certain filenames, and
it would be better to use METS/the OCR-D API to ask for a certain page
and it's file, but for now we use this.
2023-03-13 17:14:50 +01:00
1dcec06c2b 🐛 Fix NumPy dependency (hopefully...) 2023-03-09 18:20:29 +01:00
24c7762353 🐛 Fix syntax error in setup.py 2023-03-09 18:18:46 +01:00
699554dc8a 🐛 Require NumPy < 1.24 due to np.str deprecation/error
NumPy has deprecated np.str (etc.) since NumPy 1.20, and since Numpy
1.24 throws an error. We can't currently update to Calamari 2 (and fix
the problem there if necessary), so stick to NumPy 1.23.x for now.

https://github.com/OCR-D/ocrd_calamari/issues/87
2023-03-09 18:11:58 +01:00
c7ad6eb724 📦 v1.0.5 2022-09-16 16:47:25 +02:00
eb48dcbd84
Merge pull request #76 from bertsky/skip-empty
recognize: skip tiny or bin-empty lines, too
2022-09-16 14:36:05 +02:00
91cced4e0d
Merge pull request #80 from mikegerber/fix/readme-no-checkpoint
🧹 README: Do not mention obsolete + unsupported checkpoint parameter
2022-09-16 11:33:36 +02:00
33f5d63373 🧹 README: Do not mention obsolete + unsupported checkpoint parameter
Fixes #79.
2022-09-16 11:25:38 +02:00
3767faeeaf 📝 README-DEV: Update wrt ocrd-tool.json and CI 2022-09-15 18:43:09 +02:00
77ff6df5a4 📦 v1.0.4 2022-09-15 18:19:56 +02:00
9866b06e87 🔐 Use a safe temporary directory for the test workspace 2022-09-15 17:00:22 +02:00
Robert Sachunsky
8c2e4ca76d recognize: skip tiny or bin-empty lines, too 2022-08-19 12:14:36 +02:00
395e43c074
Merge pull request #75 from bertsky/recurse-regions
descend to all available TextRegions recursively
2022-07-12 14:02:17 +02:00
Robert Sachunsky
36e513604e descend to all available TextRegions recursively 2022-07-11 00:14:16 +02:00
ecb0d67567 ✔️ Handle PAGE XML namespace dynamically and without hardcoding 2022-02-24 18:41:21 +01:00
ef63291b5a ✔️ CircleCI: Use a test matrix to test on different Python versions 2022-02-24 17:05:50 +01:00
50877f87be ✔️ Move installing deps to Makefile 2022-02-24 16:54:56 +01:00
97e28a9f67 ✔️ CircleCI: Test using the python:3.6.12 image 2022-02-24 16:50:48 +01:00
85f5b685eb
Merge pull request #74 from bertsky/fix-tests-again
test: workaround for core#809
2022-02-24 15:30:53 +01:00
Robert Sachunsky
0f80198892 test: workaround for core#809 2022-02-24 13:36:52 +01:00
37ac602d72
git push 2022-02-23 14:33:37 +01:00
1eb342ef65
Merge pull request #70 from bertsky/patch-2
add checkpoint_dir content-type, remove checkpoint variant
2022-02-23 13:12:03 +01:00
Robert Sachunsky
5fddd32929 test: use other fileGrp to avoid assets#87 2022-02-11 07:56:01 +01:00
Robert Sachunsky
1f0252d0d7 test: workspace download instead of urllib 2022-02-11 07:55:33 +01:00
Robert Sachunsky
59089fbe89 test: use resmgr for downloading model 2022-02-11 07:52:50 +01:00
Robert Sachunsky
7661662e3e test: fix initLogging 2022-02-11 07:51:24 +01:00
Robert Sachunsky
01312c6369 recognize: delegate to core functions 2022-02-11 07:50:58 +01:00
Robert Sachunsky
13031d585f fix deps 2022-02-11 07:50:04 +01:00
Robert Sachunsky
332d02b13e
adapt to checkpoint_dir only 2022-02-10 18:06:31 +01:00
Robert Sachunsky
5f23c03cd9
recognize: remove checkpoint param in favour of checkpoint_dir alone 2022-02-10 17:41:51 +01:00
Robert Sachunsky
11615be6b2
ocrd-tool.json: add model content-type, remove glob variant 2022-02-10 17:38:37 +01:00
76b34c50cb 📦 v1.0.3 2021-07-21 11:56:37 +02:00
bacd6747d8 🐛 Fix TensorFlow (and h3py) dependency
TF 2.5 works for Python 3.6 + 3.9. Fixes #66.
2021-07-21 11:54:10 +02:00
34013ddb02 📝 Reduce process() docstring again 2021-07-02 20:24:20 +02:00
b4c3b026db
Merge pull request #65 from bertsky/setup-init
init from constructor not during process
2021-07-02 20:20:34 +02:00
fcd4aba0ff
Merge pull request #64 from bertsky/patch-1
requirements: prevent TF 2.5
2021-07-01 19:57:14 +02:00
Robert Sachunsky
4c6d6655e1 improve process() docstring 2021-06-26 13:50:20 +02:00
Robert Sachunsky
3bde7cb37f init from constructor not process(), use conventional name setup() 2021-06-26 13:50:00 +02:00
Robert Sachunsky
2fd6e536f6
requirements: prevent TF 2.5 2021-06-19 23:18:43 +02:00
da98713e73 📦 v1.0.2 2021-02-09 20:03:49 +01:00
0869386ec4 🐛 Fix word and glyph coordinates
Fixes GH-57.
2021-02-09 19:57:03 +01:00
41a48455f8 ✔️ Explicitly check for None
Explicitly check for None to remove this warning:

  test/test_recognize.py::test_word_segmentation
    /home/mike/devel/ocrd_calamari/test/test_recognize.py:133: FutureWarning: The behavior of this method will change in future versions. Use specific 'len(elem)' or 'elem is not None' test instead.
      assert line
2021-02-09 18:57:47 +01:00
3a4013572c
Merge pull request #10 from bertsky/image-features
Image features
2021-02-09 18:43:17 +01:00
f4c0fe8570 🐛 Fix small merge merge error (text not checked in test_recognize_should_warn_if_given_rgb_image_and_single_channel_model) 2021-02-09 18:29:49 +01:00
4cf25b8119 🎨 Rename input_channels variable to network_input_channels 2021-02-09 18:20:46 +01:00