1
0
Fork 0
mirror of https://github.com/mikegerber/ocrd_calamari.git synced 2025-06-09 19:59:53 +02:00
Commit graph

118 commits

Author SHA1 Message Date
71096493ac 📝 README-DEV: Improve info about releasing 2020-02-06 13:04:29 +01:00
b26194179c 📝 README-DEV: Improve markdown 2020-02-06 13:03:06 +01:00
cf7a788854 📝 README-DEV: Mention cleaning up the dict/ directory 2020-02-06 13:02:02 +01:00
4508e3ec47 📦 v0.0.4 2020-02-05 17:55:51 +01:00
73beab1770 📝 README: Add a missing cd 2020-02-05 17:49:31 +01:00
3416a155ec 📝 README: Provide a complete example using real data and other processors
See #33.
2020-02-05 17:39:49 +01:00
f2001a79f1 Merge branch 'master' of https://github.com/OCR-D/ocrd_calamari 2020-02-05 16:19:12 +01:00
3e426b2a0a 📝 README: Use gt4histocr-calamari from the Makefile in the example
See #33.
2020-02-05 16:18:30 +01:00
46fe34400f
📝 README: Link to the correct ocrd-tool.json 2020-02-05 13:33:52 +01:00
0c7cd69526
📝 README: Update intro that we're mostly on par with Calamari's functionality 2020-02-05 13:33:02 +01:00
b802b4deaf Allow configuring a cut off confidence value for glyph alternatives 2020-02-05 13:29:44 +01:00
e39a2bce01 📝 Fix example parameters JSON 2020-02-05 13:07:56 +01:00
ef3fb44fb5 Allow controlling of output hierarchy level, e.g. only line, not words+glyphs 2020-02-05 13:02:10 +01:00
0f0bae18ba Remove GT text to not accidently check it instead of OCR text 2020-02-04 19:29:56 +01:00
82fe0333f1 Test word segmentation (Fixes #30) 2020-02-04 18:40:06 +01:00
9010250911 ♻ test: Move binarization into the workspace fixture 2020-02-04 13:54:45 +01:00
6f4736f8e4 Do word segmentation as expected by OCR-D PAGE specs 2020-02-03 19:10:16 +01:00
0f9c94e7dc 🐛 Start with TextEquiv index=1 to adhere to OCR-D PAGE conventions
https://ocr-d.github.io/page#multiple-textequivs
2020-02-03 17:40:45 +01:00
909632493b 🚧 Add future TODOs 2020-02-03 17:37:19 +01:00
3149e1d9e0 📝 unwanted() 2020-02-03 15:33:38 +01:00
91cca1e1b8 📝 Document why we are using Unicode text segmentation to produce word results 2020-02-03 15:33:11 +01:00
0a572df0ba 📝 README: Add information about the new glyph and word segmentation 2020-02-03 15:32:26 +01:00
2650189910 🧹 Add whitespace 2020-02-03 14:03:54 +01:00
f75426060e 🧹 Remove debugging print 2020-02-03 14:03:28 +01:00
decaa7b69f 🎨 Use polygon_from_x0y0x1y1 to build word/glyph polygon 2020-02-03 14:03:01 +01:00
2ccfc7b195 🎨 Set vim textwidth 2020-02-03 12:22:22 +01:00
507bc1ce5e Include proper word + glyph segmentation 2020-02-03 12:22:01 +01:00
24532f693a 🚧 Use character positions as word segmentation 2020-01-31 17:45:00 +01:00
17dbeb2480 🔧 Loosen tensorflow-gpu dependency a bit to 1.15.* 2020-01-31 10:57:30 +01:00
c416e0c253 Revert "🐛 Use the documented package name for TensorFlow 1.15.x"
This reverts commit 739f43e9da.
2020-01-31 10:56:16 +01:00
5dfd809fbc 🐛 CircleCI: Try upgrading pip 2020-01-31 10:46:50 +01:00
7d02c8dff0 📝 README-DEV: Document installing test requirements 2020-01-30 19:06:42 +01:00
739f43e9da 🐛 Use the documented package name for TensorFlow 1.15.x 2020-01-30 19:04:20 +01:00
dependabot[bot]
c09fe169f2
Bump tensorflow-gpu from 1.14.0 to 1.15.2
Bumps [tensorflow-gpu](https://github.com/tensorflow/tensorflow) from 1.14.0 to 1.15.2.
- [Release notes](https://github.com/tensorflow/tensorflow/releases)
- [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md)
- [Commits](https://github.com/tensorflow/tensorflow/compare/v1.14.0...v1.15.2)

Signed-off-by: dependabot[bot] <support@github.com>
2020-01-28 22:17:13 +00:00
c9cf782dbf
Merge pull request #26 from OCR-D/fix-circle
circle: set locale to a UTF-8 variant so python doesn't fall back to ascii
2020-01-16 16:14:13 +01:00
Konstantin Baierer
60aa158341 circle: set locale to a UTF-8 variant so python doesn't fall back to ascii 2020-01-15 18:12:36 +01:00
1c36265599 ⬆ Update ocrd 2020-01-15 17:39:01 +01:00
2797b0e806 CircleCI: Try to fix encoding problem 2020-01-15 16:15:25 +01:00
e8f60f9bf4 CircleCI: Try to fix encoding problem 2020-01-15 16:08:11 +01:00
7bdd15648f CircleCI: Try to fix encoding problem 2020-01-15 15:59:30 +01:00
d2ca24bf1e CircleCI: Try to fix encoding problem 2020-01-15 15:54:09 +01:00
357a2a970a ⬆ Update model download URL 2020-01-15 15:05:33 +01:00
49b6dfe735 🧹 Clean up trailing whitespace 2019-12-19 16:30:11 +01:00
95281f3d29 Add metadata about the recognition operation w/ parameter info 2019-12-19 16:24:34 +01:00
dc38f0ee51 🎨 Use TOOL constant convention from the other OCR-D processors 2019-12-19 16:23:16 +01:00
b8129c6425 🧹 Do not advertise and support untested models 2019-12-11 17:53:26 +01:00
5273247ab3 Remove broken __main__ handling (stick to pytest) 2019-12-05 13:47:31 +01:00
e1b9d381a0 Actually binarize the image (not grayscale!) 2019-12-05 13:40:48 +01:00
e07b333db1 Convert to a pytest style test 2019-12-05 13:38:45 +01:00
2393edc645 CircleCI: Install imagemagick 2019-12-05 13:06:59 +01:00