1
0
Fork 0
mirror of https://github.com/qurator-spk/dinglehopper.git synced 2025-06-09 20:00:01 +02:00
Commit graph

287 commits

Author SHA1 Message Date
1b9497dfb0 🚧 dinglehopper: Test aligning by character while retaining segment id info 2020-06-18 13:08:05 +02:00
98f6c68df7 🚧 dinglehopper: Test aligning by character while retaining segment id info 2020-06-18 13:08:05 +02:00
1d553bb4e3 🚧 dinglehopper: Test aligning by character while retaining segment id info 2020-06-18 13:08:05 +02:00
354afdc0b2 🚧 dinglehopper: WIP data structure for extracted text 2020-06-18 13:08:05 +02:00
5a5e3c824b 🚧 dinglehopper: WIP data structure for extracted text 2020-06-18 13:08:05 +02:00
96273b026d 🚧 dinglehopper: WIP data structure for extracted text 2020-06-18 13:08:05 +02:00
4e9b0aeef1 🚧 dinglehopper: WIP data structure for extracted text 2020-06-18 13:08:05 +02:00
ac1e1ec79a 🚧 dinglehopper: WIP data structure for extracted text 2020-06-18 13:08:05 +02:00
f6a880860f 🚧 dinglehopper: WIP data structure for extracted text 2020-06-18 13:08:05 +02:00
a02e7dcbce 🚧 dinglehopper: WIP data structure for extracted text 2020-06-18 13:08:05 +02:00
9354efaf28 💄 Set maximum line length to 90 2020-06-18 13:07:09 +02:00
cdfd4d321d 🐛 dinglehopper: Add missing requirement MarkupSafe 2020-06-12 20:46:51 +02:00
eca8cbc81e 🚧 dinglehopper: WIP data structure for extracted text 2020-06-10 20:31:59 +02:00
91371971eb 🚧 dinglehopper: WIP data structure for extracted text 2020-06-10 20:31:59 +02:00
8e3a19d7e9 🚧 dinglehopper: WIP data structure for extracted text 2020-06-10 20:31:59 +02:00
93608ba697 🚧 dinglehopper: WIP data structure for extracted text 2020-06-10 20:31:59 +02:00
7f5789567f 🚧 dinglehopper: WIP data structure for extracted text 2020-06-10 20:31:59 +02:00
475aa65e98 🚧 dinglehopper: WIP data structure for extracted text 2020-06-10 20:31:59 +02:00
c3eefbb1e8 🚧 dinglehopper: WIP data structure for extracted text 2020-06-10 20:31:59 +02:00
32848eb2c6 🎨 dinglehopper: Set code width for flake8 2020-06-10 19:45:05 +02:00
833c02729b 🗒️ dinglehopper: Remove superfluous -m mets.xml in the README OCR-D example 2020-06-09 18:30:36 +02:00
668de758a0 dinglehopper: Support disabling metrics in the OCR-D interface 2020-06-09 18:29:59 +02:00
f699697eb3 🐛 dinglehopper: Fix reading OCR-D workspace files when only URLs are provided 2020-06-09 17:13:22 +02:00
ea1cc32b91 🧹 dinglehopper: .gitignore Python stuff 2020-06-09 13:25:05 +02:00
22765f02a2 🐛 dinglehopper: Fix tests by making metrics a keyword argument 2020-06-09 13:07:44 +02:00
1af062e1a9 🗒️ dinglehopper: Describe what dinglehopper does in the README 2020-06-08 18:27:20 +02:00
5cbeb7b0dd dinglehopper: Support disabling the metrics using CLI option --no-metrics 2020-06-08 18:26:21 +02:00
779472575c dinglehopper: Include number of characters and words in JSON report 2020-02-21 14:53:16 +01:00
745095e52c dinglehopper: Include number of characters and words in JSON report 2020-02-21 14:53:16 +01:00
be251a391e 🔧 dinglehopper: Add PyCharm config 2020-01-14 13:22:42 +01:00
6987a8e1e2 🔧 dinglehopper: Add PyCharm config 2020-01-14 13:22:42 +01:00
f94e8b9b1c Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector"
This reverts commit a3c1eee8f31349edcfb1e36920763bcecceb1129, reversing
changes made to dc76213ffc1fbabc2c45f0e52ced55449bdf2e83.
2019-12-09 12:44:05 +01:00
48a31ce672 Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector"
This reverts commit 2c89bf3b35ee290d7b830ef270df3a96aa48245e, reversing
changes made to 9f7e413148ca5dbac9b555d7b0d0a5fa3a0f5340.
2019-12-09 12:44:05 +01:00
b-vr103
1303a7d92f Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector 2019-12-09 11:57:16 +01:00
41e00eb900 Travis: Additionally test with Python 3.8 2019-12-06 16:11:18 +01:00
f32eb9eb69 🐛 dinglehopper: Escape text inserted into HTML (Fixes #8) 2019-12-06 15:59:09 +01:00
82e863fac2 📝 dinglehopper: Document seq_editops() 2019-12-04 13:22:38 +01:00
5ccdace1dd 🎨 dinglehopper: Move working_directory() context manager into tests/util 2019-12-02 15:18:08 +01:00
f98c527c93 🐛 dinglehopper: Fix working_directory() context manager 2019-12-02 15:14:16 +01:00
5273d10bac 🐛 dinglehopper: Generate a loadable JSON report even if CER=∞ 2019-12-02 15:00:07 +01:00
ced6504ad0 🎨 dinglehopper: Expose clearing the Levenshtein cache as a function 2019-11-20 13:24:45 +01:00
5cf4eddaeb dinglehopper: Clear Levenshtein cache between OCR-D files 2019-11-20 13:05:45 +01:00
1206c9bb4b 📝 dinglehopper: Document installation + testing 2019-11-18 16:32:42 +01:00
58ff140bc0 ️ dinglehopper: Improve performance by caching the Levensthein matrix
Motivated by [a pull
request](https://github.com/qurator-spk/dinglehopper/pull/7) by
@JKamlah, implement a cache of the Levensthein matrix calculation.

We calculated the Levenshtein matrixes for characters and words twice:
Once for the error rates, once for the alignment.
2019-11-18 15:33:17 +01:00
11a6341641 🧹 dinglehopper: Remove broken implementation of the unordered word error rate 2019-11-18 15:03:17 +01:00
f22228840e 🧹 dinglehopper: Use exclusively relative imports in tests 2019-11-18 14:31:43 +01:00
d61c076aad 🧹 dinglehopper: Remove debug print()s 2019-11-18 13:15:43 +01:00
12a48f3bfe dinglehopper: Test aligning lists of lines 2019-11-18 13:00:40 +01:00
9c8ea6ab00
Merge pull request #6 from wrznr/master
Fix typo in README
2019-11-06 18:17:21 +01:00
Kay-Michael Würzner
165f763cef
Fix typo in README
Since *u* is pronounced `/ju/` in this case, the *n* is not added to prevent a hiatus here. Cf. https://en.wiktionary.org/wiki/an#Usage_notes.
2019-11-06 14:54:25 +01:00