1
0
Fork 0
mirror of https://github.com/qurator-spk/dinglehopper.git synced 2025-06-08 19:30:01 +02:00
Commit graph

397 commits

Author SHA1 Message Date
ced6504ad0 🎨 dinglehopper: Expose clearing the Levenshtein cache as a function 2019-11-20 13:24:45 +01:00
5cf4eddaeb dinglehopper: Clear Levenshtein cache between OCR-D files 2019-11-20 13:05:45 +01:00
1206c9bb4b 📝 dinglehopper: Document installation + testing 2019-11-18 16:32:42 +01:00
58ff140bc0 ️ dinglehopper: Improve performance by caching the Levensthein matrix
Motivated by [a pull
request](https://github.com/qurator-spk/dinglehopper/pull/7) by
@JKamlah, implement a cache of the Levensthein matrix calculation.

We calculated the Levenshtein matrixes for characters and words twice:
Once for the error rates, once for the alignment.
2019-11-18 15:33:17 +01:00
11a6341641 🧹 dinglehopper: Remove broken implementation of the unordered word error rate 2019-11-18 15:03:17 +01:00
f22228840e 🧹 dinglehopper: Use exclusively relative imports in tests 2019-11-18 14:31:43 +01:00
d61c076aad 🧹 dinglehopper: Remove debug print()s 2019-11-18 13:15:43 +01:00
12a48f3bfe dinglehopper: Test aligning lists of lines 2019-11-18 13:00:40 +01:00
9c8ea6ab00
Merge pull request #6 from wrznr/master
Fix typo in README
2019-11-06 18:17:21 +01:00
Kay-Michael Würzner
165f763cef
Fix typo in README
Since *u* is pronounced `/ju/` in this case, the *n* is not added to prevent a hiatus here. Cf. https://en.wiktionary.org/wiki/an#Usage_notes.
2019-11-06 14:54:25 +01:00
680c2a2661 🐛 dinglehopper: Fix test_ocrd_cli for Python 3.5, again, and again 2019-10-28 15:05:08 +01:00
7cf1a540f4 🐛 dinglehopper: Fix test_ocrd_cli for Python 3.5, again 2019-10-28 14:58:24 +01:00
49e2065ad6 🐛 dinglehopper: Fix test_ocrd_cli for Python 3.5 2019-10-28 14:51:41 +01:00
86178271df dinglehopper: Fix repeated tests for the OCR-D interface 2019-10-28 11:47:42 +01:00
b6f50ef853 dinglehopper: Add a test for the OCR-D interface 2019-10-25 19:00:39 +02:00
29a2c8218f
Merge pull request #3 from kba/ocrd-tool-in-root
symlink ocrd-tool.json to repo root
2019-10-18 17:45:24 +02:00
Konstantin Baierer
2ca44af31d ocrd-tool: add category 2019-10-18 16:43:03 +02:00
Konstantin Baierer
9c7848706b symlink ocrd-tool.json to repo root 2019-10-18 16:38:44 +02:00
c30553985f � dinglehopper: Substitute more characters 2019-10-01 13:18:12 +02:00
493541fddf 🐛 dinglehopper: Always work with NFC text 2019-10-01 12:35:44 +02:00
df93c80e5d 🐛 dinglehopper: Always work with NFC text 2019-10-01 11:28:14 +02:00
715b813bbc � dinglehopper: Add two more eMOP ligatures 2019-10-01 10:53:20 +02:00
8d055e7b6e 🐛 dinglehopper: Work on NFC'ed grapheme clusters when aligning text 2019-09-30 18:17:13 +02:00
534958be1d 🐛 dinglehopper: Fix sorting the reading order
Regions were sorted wrongly when there are more than 9 regions in an
OrderedGroup because the index was sorted alphabetically, not
numerically. Fix this by converting the index to integers.
2019-09-30 16:06:59 +02:00
10f010eaa8 🐛 dinglehopper: Do not throw error if a region ID is not found
The ReadingOrder might contain regions of types other than text regions,
so not finding a TextRegion with the referenced ID is not an error.
Downgrade to a warning for now.
2019-09-26 15:19:30 +02:00
8237b3edaf � dinglehopper: Substitute more characters 2019-08-22 15:35:19 +02:00
d18583348a dinglehopper: Make Travis CI badge link to build status 2019-08-16 14:01:53 +02:00
fcc4dadb7a 📝 dinglehopper: Add a LICENSE file 2019-08-16 13:57:49 +02:00
0c179c8fb0 📝 dinglehopper: Update author info 2019-08-16 13:56:51 +02:00
0f056b9672 🐛 dinglehopper: Really install templates 2019-08-16 13:46:33 +02:00
5da4f57e77 🐛 dinglehopper: Install templates 2019-08-16 13:30:21 +02:00
4525dd2a9c 🐛 dinglehopper: Install package data, i.e. ocrd-tool.json 2019-08-16 13:03:45 +02:00
d65eb076e1 📝 dinglehopper: Remove the double Travis badge, AGAIN 2019-08-15 17:57:56 +02:00
c6daa328c8 Merge branch 'master' of https://github.com/qurator-spk/dinglehopper 2019-08-15 17:55:49 +02:00
e6b1882af0
📝 dinglehopper: Remove double Travis badge 2019-08-15 17:55:41 +02:00
43db7f5853 Merge branch 'master' of https://github.com/qurator-spk/dinglehopper 2019-08-15 17:53:34 +02:00
5dda1a8be1 📝 dinglehopper: Document basic CLI usage 2019-08-15 17:52:47 +02:00
a1007fad15
dinglehopper: Merge pull request #1 from cneud/cneud-xml-links
add links to supported XML formats
2019-08-15 17:51:50 +02:00
02a0e093bf dinglehopper: Add OCR-D interface 2019-08-15 17:42:56 +02:00
Clemens Neudecker
fc93173efa
add links to supported XML formats 2019-08-15 14:39:35 +02:00
e6e2db79da dinglehopper: Add Travis CI badge 2019-08-15 12:33:07 +02:00
dd74699bd6 dinglehopper: Add Travis configuration 2019-08-15 11:13:29 +02:00
743b5b5acf 📝 dinglehopper: Document Unicode support 2019-08-15 10:38:05 +02:00
3454a1ff3e 📝 dinglehopper: Document goals 2019-08-15 10:35:26 +02:00
510dfd6c6c 📝 dinglehopper: Add screenshot 2019-08-14 18:29:49 +02:00
495919c06d 🧹 dinglehopper: Move pytest.ini 2019-08-14 18:12:02 +02:00
89048bf55d ➡ Move dinglehopper into its own directory 2019-08-14 15:32:50 +02:00