Gerber, Mike
bc1002b1e6
🚧 dinglehopper: Extract text while retaining segment id info
4 years ago
Gerber, Mike
6d0db229fa
🚧 dinglehopper: Extract text while retaining segment id info
4 years ago
Gerber, Mike
a09c1eae7e
🚧 dinglehopper: Extract text while retaining segment id info
4 years ago
Gerber, Mike
5dbf563d6a
🚧 dinglehopper: Extract text while retaining segment id info
4 years ago
Gerber, Mike
5b353a2232
🚧 dinglehopper: Test aligning by character while retaining segment id info
4 years ago
Gerber, Mike
278e52868f
🚧 dinglehopper: Test aligning by character while retaining segment id info
4 years ago
Gerber, Mike
1b9497dfb0
🚧 dinglehopper: Test aligning by character while retaining segment id info
4 years ago
Gerber, Mike
98f6c68df7
🚧 dinglehopper: Test aligning by character while retaining segment id info
4 years ago
Gerber, Mike
1d553bb4e3
🚧 dinglehopper: Test aligning by character while retaining segment id info
4 years ago
Gerber, Mike
354afdc0b2
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
5a5e3c824b
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
96273b026d
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
4e9b0aeef1
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
ac1e1ec79a
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
f6a880860f
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
a02e7dcbce
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
9354efaf28
💄 Set maximum line length to 90
4 years ago
Gerber, Mike
cdfd4d321d
🐛 dinglehopper: Add missing requirement MarkupSafe
4 years ago
Gerber, Mike
eca8cbc81e
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
91371971eb
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
8e3a19d7e9
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
93608ba697
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
7f5789567f
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
475aa65e98
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
c3eefbb1e8
🚧 dinglehopper: WIP data structure for extracted text
4 years ago
Gerber, Mike
32848eb2c6
🎨 dinglehopper: Set code width for flake8
4 years ago
Gerber, Mike
833c02729b
🗒️ dinglehopper: Remove superfluous `-m mets.xml` in the README OCR-D example
4 years ago
Gerber, Mike
668de758a0
✨ dinglehopper: Support disabling metrics in the OCR-D interface
4 years ago
Gerber, Mike
f699697eb3
🐛 dinglehopper: Fix reading OCR-D workspace files when only URLs are provided
4 years ago
Gerber, Mike
ea1cc32b91
🧹 dinglehopper: .gitignore Python stuff
4 years ago
Gerber, Mike
22765f02a2
🐛 dinglehopper: Fix tests by making metrics a keyword argument
4 years ago
Gerber, Mike
1af062e1a9
🗒️ dinglehopper: Describe what dinglehopper does in the README
5 years ago
Gerber, Mike
5cbeb7b0dd
✨ dinglehopper: Support disabling the metrics using CLI option --no-metrics
5 years ago
Gerber, Mike
779472575c
✨ dinglehopper: Include number of characters and words in JSON report
5 years ago
Gerber, Mike
745095e52c
✨ dinglehopper: Include number of characters and words in JSON report
5 years ago
Gerber, Mike
be251a391e
🔧 dinglehopper: Add PyCharm config
5 years ago
Gerber, Mike
6987a8e1e2
🔧 dinglehopper: Add PyCharm config
5 years ago
Gerber, Mike
f94e8b9b1c
Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector "
...
This reverts commit a3c1eee8f31349edcfb1e36920763bcecceb1129, reversing
changes made to dc76213ffc1fbabc2c45f0e52ced55449bdf2e83.
5 years ago
Gerber, Mike
48a31ce672
Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector "
...
This reverts commit 2c89bf3b35ee290d7b830ef270df3a96aa48245e, reversing
changes made to 9f7e413148ca5dbac9b555d7b0d0a5fa3a0f5340.
5 years ago
b-vr103
1303a7d92f
Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector
5 years ago
Gerber, Mike
41e00eb900
✅ Travis: Additionally test with Python 3.8
5 years ago
Gerber, Mike
f32eb9eb69
🐛 dinglehopper: Escape text inserted into HTML ( Fixes #8 )
5 years ago
Gerber, Mike
82e863fac2
📝 dinglehopper: Document seq_editops()
5 years ago
Gerber, Mike
5ccdace1dd
🎨 dinglehopper: Move working_directory() context manager into tests/util
5 years ago
Gerber, Mike
f98c527c93
🐛 dinglehopper: Fix working_directory() context manager
5 years ago
Gerber, Mike
5273d10bac
🐛 dinglehopper: Generate a loadable JSON report even if CER=∞
5 years ago
Gerber, Mike
ced6504ad0
🎨 dinglehopper: Expose clearing the Levenshtein cache as a function
5 years ago
Gerber, Mike
5cf4eddaeb
⚡ dinglehopper: Clear Levenshtein cache between OCR-D files
5 years ago
Gerber, Mike
1206c9bb4b
📝 dinglehopper: Document installation + testing
5 years ago
Gerber, Mike
58ff140bc0
⚡ ️ dinglehopper: Improve performance by caching the Levensthein matrix
...
Motivated by [a pull
request](https://github.com/qurator-spk/dinglehopper/pull/7 ) by
@JKamlah, implement a cache of the Levensthein matrix calculation.
We calculated the Levenshtein matrixes for characters and words twice:
Once for the error rates, once for the alignment.
5 years ago