Commit Graph

243 Commits (091f069b3ccc534ab4ce5bd6a951c95aed1d4a08)
 

Author SHA1 Message Date
Gerber, Mike bc1002b1e6 🚧 dinglehopper: Extract text while retaining segment id info 4 years ago
Gerber, Mike 6d0db229fa 🚧 dinglehopper: Extract text while retaining segment id info 4 years ago
Gerber, Mike a09c1eae7e 🚧 dinglehopper: Extract text while retaining segment id info 4 years ago
Gerber, Mike 5dbf563d6a 🚧 dinglehopper: Extract text while retaining segment id info 4 years ago
Gerber, Mike 5b353a2232 🚧 dinglehopper: Test aligning by character while retaining segment id info 4 years ago
Gerber, Mike 278e52868f 🚧 dinglehopper: Test aligning by character while retaining segment id info 4 years ago
Gerber, Mike 1b9497dfb0 🚧 dinglehopper: Test aligning by character while retaining segment id info 4 years ago
Gerber, Mike 98f6c68df7 🚧 dinglehopper: Test aligning by character while retaining segment id info 4 years ago
Gerber, Mike 1d553bb4e3 🚧 dinglehopper: Test aligning by character while retaining segment id info 4 years ago
Gerber, Mike 354afdc0b2 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike 5a5e3c824b 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike 96273b026d 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike 4e9b0aeef1 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike ac1e1ec79a 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike f6a880860f 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike a02e7dcbce 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike 9354efaf28 💄 Set maximum line length to 90 4 years ago
Gerber, Mike cdfd4d321d 🐛 dinglehopper: Add missing requirement MarkupSafe 4 years ago
Gerber, Mike eca8cbc81e 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike 91371971eb 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike 8e3a19d7e9 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike 93608ba697 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike 7f5789567f 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike 475aa65e98 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike c3eefbb1e8 🚧 dinglehopper: WIP data structure for extracted text 4 years ago
Gerber, Mike 32848eb2c6 🎨 dinglehopper: Set code width for flake8 4 years ago
Gerber, Mike 833c02729b 🗒️ dinglehopper: Remove superfluous `-m mets.xml` in the README OCR-D example 4 years ago
Gerber, Mike 668de758a0 dinglehopper: Support disabling metrics in the OCR-D interface 4 years ago
Gerber, Mike f699697eb3 🐛 dinglehopper: Fix reading OCR-D workspace files when only URLs are provided 4 years ago
Gerber, Mike ea1cc32b91 🧹 dinglehopper: .gitignore Python stuff 4 years ago
Gerber, Mike 22765f02a2 🐛 dinglehopper: Fix tests by making metrics a keyword argument 4 years ago
Gerber, Mike 1af062e1a9 🗒️ dinglehopper: Describe what dinglehopper does in the README 5 years ago
Gerber, Mike 5cbeb7b0dd dinglehopper: Support disabling the metrics using CLI option --no-metrics 5 years ago
Gerber, Mike 779472575c dinglehopper: Include number of characters and words in JSON report 5 years ago
Gerber, Mike 745095e52c dinglehopper: Include number of characters and words in JSON report 5 years ago
Gerber, Mike be251a391e 🔧 dinglehopper: Add PyCharm config 5 years ago
Gerber, Mike 6987a8e1e2 🔧 dinglehopper: Add PyCharm config 5 years ago
Gerber, Mike f94e8b9b1c Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector"
This reverts commit a3c1eee8f31349edcfb1e36920763bcecceb1129, reversing
changes made to dc76213ffc1fbabc2c45f0e52ced55449bdf2e83.
5 years ago
Gerber, Mike 48a31ce672 Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector"
This reverts commit 2c89bf3b35ee290d7b830ef270df3a96aa48245e, reversing
changes made to 9f7e413148ca5dbac9b555d7b0d0a5fa3a0f5340.
5 years ago
b-vr103 1303a7d92f Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector 5 years ago
Gerber, Mike 41e00eb900 Travis: Additionally test with Python 3.8 5 years ago
Gerber, Mike f32eb9eb69 🐛 dinglehopper: Escape text inserted into HTML (Fixes #8) 5 years ago
Gerber, Mike 82e863fac2 📝 dinglehopper: Document seq_editops() 5 years ago
Gerber, Mike 5ccdace1dd 🎨 dinglehopper: Move working_directory() context manager into tests/util 5 years ago
Gerber, Mike f98c527c93 🐛 dinglehopper: Fix working_directory() context manager 5 years ago
Gerber, Mike 5273d10bac 🐛 dinglehopper: Generate a loadable JSON report even if CER=∞ 5 years ago
Gerber, Mike ced6504ad0 🎨 dinglehopper: Expose clearing the Levenshtein cache as a function 5 years ago
Gerber, Mike 5cf4eddaeb dinglehopper: Clear Levenshtein cache between OCR-D files 5 years ago
Gerber, Mike 1206c9bb4b 📝 dinglehopper: Document installation + testing 5 years ago
Gerber, Mike 58ff140bc0 ️ dinglehopper: Improve performance by caching the Levensthein matrix
Motivated by [a pull
request](https://github.com/qurator-spk/dinglehopper/pull/7) by
@JKamlah, implement a cache of the Levensthein matrix calculation.

We calculated the Levenshtein matrixes for characters and words twice:
Once for the error rates, once for the alignment.
5 years ago