ocrd_calamari

mirror of https://github.com/mikegerber/ocrd_calamari.git synced 2025-07-22 08:29:55 +02:00

Author	SHA1	Message	Date
Mike Gerber	3156121ff7	📝 Let intro mention ocrd_calamari + PAGE XML, link to OCR-D	2020-10-01 13:21:14 +02:00
Gerber, Mike	bb9b1ab41b	🐛 CircleCI: Ignore screenshots branch (second try)	2020-09-03 11:55:00 +02:00
Gerber, Mike	7705374cfc	🐛 CircleCI: Ignore screenshots branch	2020-09-03 11:44:47 +02:00
Gerber, Mike	c417a0ab77	📝 README: Add a screenshot of example output	2020-09-03 11:31:11 +02:00
Mike Gerber	210c126003	Merge pull request #42 from OCR-D/file-ids-and-such use make_file_id and assert_file_grp_cardinality	2020-08-06 17:37:06 +02:00
Konstantin Baierer	f746b73fd0	use make_file_id and assert_file_grp_cardinality	2020-08-06 15:24:53 +02:00
Gerber, Mike	f6dfedf837	🗒️ README-DEV: Also release on GitHub	2020-08-06 14:04:17 +02:00
Gerber, Mike	86410110bc	📦 v0.0.7	2020-08-06 12:39:45 +02:00
Mike Gerber	4c85b8311f	Merge pull request #41 from OCR-D/fix/set-pcgtsid Set pcGtsId	2020-08-06 12:37:17 +02:00
Gerber, Mike	7da45a0ec1	Set pcGtsId Newest OCR-D validation checks PAGE-XML pcGtsId against METS file/@ID. Set the pcGtsId here correctly. Fixes #40.	2020-08-06 12:31:47 +02:00
Gerber, Mike	046e3e8ee3	🚧 Tests: Add some TODOs re data + namespace version changes	2020-08-06 11:27:59 +02:00
Gerber, Mike	027fcd7d75	🐛 Fix test file path	2020-07-21 20:10:36 +02:00
Mike Gerber	c6ced9b3e9	Merge pull request #39 from OCR-D/dont-install-test setup.py: exclude "test", not "tests", from installation	2020-06-04 12:07:03 +02:00
kba	e03ff4064b	setup.py: exclude "test", not "tests", from installation	2020-05-31 20:31:06 +02:00
Mike Gerber	fb538845d8	📄 Update license (Fixes #35 ) Set copyright owner name. Also, going along the lines of "update the year when substantial revision of the work happenend", set the copyright years. The latter may be not be necessary, because "life of author + 70 years" or something.	2020-02-13 16:49:11 +01:00
Gerber, Mike	123ee61a8b	v0.0.6	2020-02-13 16:04:17 +01:00
Gerber, Mike	69df78bce1	🐛 setup.py: Fix GitHub url by s/kba/OCR-D	2020-02-13 16:02:02 +01:00
Gerber, Mike	62e5e0c295	🐛 ocrd-tool.json: Fix GitHub url by s/kba/OCR-D	2020-02-13 16:00:58 +01:00
Gerber, Mike	0334a35870	🐛 Sort predictions in exactly the same way, also when building the text	2020-02-12 17:18:37 +01:00
Gerber, Mike	0c9e1f13c7	🐛 Sort predictions in exactly the same way to make sure we are correctly removing spaces	2020-02-12 16:38:45 +01:00
Gerber, Mike	d2c843aa3f	📦 v0.0.5	2020-02-12 13:33:38 +01:00
Gerber, Mike	cd8f6a5fcb	🐛 Use line id for debug message	2020-02-12 13:32:10 +01:00
Gerber, Mike	5b6d8b3f41	🐛 Build line text on our own Calamari does whitespace post-processing on prediction.sentence, while it does not do the same on prediction.positions. Do it on our own to have consistency. Fixes GH-37.	2020-02-12 12:25:25 +01:00
Gerber, Mike	30f7e1b246	🐳 Docker: Run pip3 check for good measure	2020-02-06 14:01:36 +01:00
Gerber, Mike	303172b279	📝 Document make targets	2020-02-06 13:53:55 +01:00
Gerber, Mike	a2d1d76dbd	🐳 Docker: Do not use the make target to install calamari-ocr, stick to pip	2020-02-06 13:52:05 +01:00
Gerber, Mike	41f5c8a8fa	🐳 Docker: Upgrade pip to silence warning and fix potential other problems	2020-02-06 13:44:43 +01:00
Gerber, Mike	7c18b1d391	🐳 Docker: Use ocrd/core:master instead of outdated :edge	2020-02-06 13:43:59 +01:00
Gerber, Mike	1fda419f25	🐳 Fix Docker build	2020-02-06 13:43:36 +01:00
Gerber, Mike	71096493ac	📝 README-DEV: Improve info about releasing	2020-02-06 13:04:29 +01:00
Gerber, Mike	b26194179c	📝 README-DEV: Improve markdown	2020-02-06 13:03:06 +01:00
Gerber, Mike	cf7a788854	📝 README-DEV: Mention cleaning up the dict/ directory	2020-02-06 13:02:02 +01:00
Gerber, Mike	4508e3ec47	📦 v0.0.4	2020-02-05 17:55:51 +01:00
Gerber, Mike	73beab1770	📝 README: Add a missing `cd`	2020-02-05 17:49:31 +01:00
Gerber, Mike	3416a155ec	📝 README: Provide a complete example using real data and other processors See #33.	2020-02-05 17:39:49 +01:00
Gerber, Mike	f2001a79f1	Merge branch 'master' of https://github.com/OCR-D/ocrd_calamari	2020-02-05 16:19:12 +01:00
Gerber, Mike	3e426b2a0a	📝 README: Use gt4histocr-calamari from the Makefile in the example See #33.	2020-02-05 16:18:30 +01:00
Mike Gerber	46fe34400f	📝 README: Link to the correct ocrd-tool.json	2020-02-05 13:33:52 +01:00
Mike Gerber	0c7cd69526	📝 README: Update intro that we're mostly on par with Calamari's functionality	2020-02-05 13:33:02 +01:00
Gerber, Mike	b802b4deaf	✨ Allow configuring a cut off confidence value for glyph alternatives	2020-02-05 13:29:44 +01:00
Gerber, Mike	e39a2bce01	📝 Fix example parameters JSON	2020-02-05 13:07:56 +01:00
Gerber, Mike	ef3fb44fb5	✨ Allow controlling of output hierarchy level, e.g. only line, not words+glyphs	2020-02-05 13:02:10 +01:00
Gerber, Mike	0f0bae18ba	✅ Remove GT text to not accidently check it instead of OCR text	2020-02-04 19:29:56 +01:00
Gerber, Mike	82fe0333f1	✅ Test word segmentation (Fixes #30 )	2020-02-04 18:40:06 +01:00
Gerber, Mike	9010250911	♻ test: Move binarization into the workspace fixture	2020-02-04 13:54:45 +01:00
Gerber, Mike	6f4736f8e4	✨ Do word segmentation as expected by OCR-D PAGE specs	2020-02-03 19:10:16 +01:00
Gerber, Mike	0f9c94e7dc	🐛 Start with TextEquiv index=1 to adhere to OCR-D PAGE conventions https://ocr-d.github.io/page#multiple-textequivs	2020-02-03 17:40:45 +01:00
Gerber, Mike	909632493b	🚧 Add future TODOs	2020-02-03 17:37:19 +01:00
Gerber, Mike	3149e1d9e0	📝 unwanted()	2020-02-03 15:33:38 +01:00
Gerber, Mike	91cca1e1b8	📝 Document why we are using Unicode text segmentation to produce word results	2020-02-03 15:33:11 +01:00

1 2 3

147 commits