This website works better with JavaScript.
167dad18f4
🚧 dinglehopper: Test aligning by character while retaining segment id info
Gerber, Mike
2020-06-11 13:54:46 +0200
4cd835ae51
🚧 dinglehopper: Test aligning by character while retaining segment id info
Gerber, Mike
2020-06-11 13:04:36 +0200
8435d88419
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 20:31:54 +0200
534e042f9e
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 20:29:01 +0200
89852314dc
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 19:49:12 +0200
4bd30e6686
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 19:40:57 +0200
bc630233d0
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 19:36:49 +0200
2c69e077fe
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 18:30:34 +0200
84c9e6a9c7
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 18:29:11 +0200
c3709e2ec0
🧹 dinglehopper: Remove .vimrc again (security)
Gerber, Mike
2020-06-18 13:27:24 +0200
5aa74e8383
🎨 dinglehopper: Make PyCharm happier with the type hinting, newlines etc.
Gerber, Mike
2020-06-12 20:59:37 +0200
e972328e51
✨ dinglehopper: Validate read segment ids
Gerber, Mike
2020-06-12 20:43:25 +0200
c9109999db
🧹 dinglehopper: Remove obsolete normalization-related FIXME
Gerber, Mike
2020-06-12 20:29:50 +0200
bc006746dd
🧹 dinglehopper: Replace XXX with an actual comment
Gerber, Mike
2020-06-12 20:24:58 +0200
507ad6b6a4
🧹 dinglehopper: Remove obsolete XXX that has a GitHub issue
Gerber, Mike
2020-06-12 20:21:18 +0200
e0aa9bc3f4
🧹 dinglehopper: Remove obsolete XXX about None ids
Gerber, Mike
2020-06-12 20:19:38 +0200
6eb0a9350c
🎨 dinglehopper: Unfuck substitutions a bit
Gerber, Mike
2020-06-12 20:05:33 +0200
e3e7938162
🐛 dinglehopper: Fix tests to deal with new normalization logic
Gerber, Mike
2020-06-12 20:04:24 +0200
c3ae73d576
🧹 dinglehopper: Calculate segment ids once, on the first call
Gerber, Mike
2020-06-12 18:06:42 +0200
bc05f83088
🧹 dinglehopper: Remove obsolete XXX about the PAGE hierarchy
Gerber, Mike
2020-06-12 17:04:07 +0200
453247c2f3
🧹 dinglehopper: Clean up test_lines_similar()
Gerber, Mike
2020-06-12 17:01:56 +0200
dc85294380
📓 dinglehopper: Document editops()
Gerber, Mike
2020-06-12 17:01:28 +0200
e1c8546336
🧹 dinglehopper: Move Python 3.5 XXXs to a GitHub issue
Gerber, Mike
2020-06-12 16:08:56 +0200
4b86f01b15
🚧 dinglehopper: Use a Bootstrap tooltip for the segment id
Gerber, Mike
2020-06-12 15:56:01 +0200
a1c1b9c5ca
🚧 dinglehopper: Re-introduce "substitute_equivalences" as Normalization.NFC_SBB
Gerber, Mike
2020-06-12 15:53:15 +0200
28849c701b
🚧 dinglehopper: Remove debug output
Gerber, Mike
2020-06-12 14:25:11 +0200
25191b24f6
🚧 dinglehopper: Display segment id in the corresponding column
Gerber, Mike
2020-06-12 13:46:28 +0200
a448133394
🚧 dinglehopper: Display segment id when hovering over a character difference
Gerber, Mike
2020-06-12 13:25:35 +0200
bc1002b1e6
🚧 dinglehopper: Extract text while retaining segment id info
Gerber, Mike
2020-06-11 17:43:30 +0200
6d0db229fa
🚧 dinglehopper: Extract text while retaining segment id info
Gerber, Mike
2020-06-11 16:54:48 +0200
a09c1eae7e
🚧 dinglehopper: Extract text while retaining segment id info
Gerber, Mike
2020-06-11 15:37:34 +0200
5dbf563d6a
🚧 dinglehopper: Extract text while retaining segment id info
Gerber, Mike
2020-06-11 15:35:52 +0200
5b353a2232
🚧 dinglehopper: Test aligning by character while retaining segment id info
Gerber, Mike
2020-06-11 14:56:23 +0200
278e52868f
🚧 dinglehopper: Test aligning by character while retaining segment id info
Gerber, Mike
2020-06-11 14:54:50 +0200
1b9497dfb0
🚧 dinglehopper: Test aligning by character while retaining segment id info
Gerber, Mike
2020-06-11 14:50:32 +0200
98f6c68df7
🚧 dinglehopper: Test aligning by character while retaining segment id info
Gerber, Mike
2020-06-11 13:54:46 +0200
1d553bb4e3
🚧 dinglehopper: Test aligning by character while retaining segment id info
Gerber, Mike
2020-06-11 13:04:36 +0200
354afdc0b2
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 20:31:54 +0200
5a5e3c824b
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 20:29:01 +0200
96273b026d
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 19:49:12 +0200
4e9b0aeef1
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 19:40:57 +0200
ac1e1ec79a
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 19:36:49 +0200
f6a880860f
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 18:30:34 +0200
a02e7dcbce
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 18:29:11 +0200
9354efaf28
💄 Set maximum line length to 90
Gerber, Mike
2020-06-18 13:07:09 +0200
cdfd4d321d
🐛 dinglehopper: Add missing requirement MarkupSafe
Gerber, Mike
2020-06-12 20:46:51 +0200
eca8cbc81e
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 20:31:54 +0200
91371971eb
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 20:29:01 +0200
8e3a19d7e9
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 19:49:12 +0200
93608ba697
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 19:40:57 +0200
7f5789567f
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 19:36:49 +0200
475aa65e98
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 18:30:34 +0200
c3eefbb1e8
🚧 dinglehopper: WIP data structure for extracted text
Gerber, Mike
2020-06-10 18:29:11 +0200
32848eb2c6
🎨 dinglehopper: Set code width for flake8
Gerber, Mike
2020-06-10 19:45:05 +0200
833c02729b
🗒️ dinglehopper: Remove superfluous `-m mets.xml` in the README OCR-D example
Gerber, Mike
2020-06-09 18:30:36 +0200
668de758a0
✨ dinglehopper: Support disabling metrics in the OCR-D interface
Gerber, Mike
2020-06-09 18:29:59 +0200
f699697eb3
🐛 dinglehopper: Fix reading OCR-D workspace files when only URLs are provided
Gerber, Mike
2020-06-09 17:13:22 +0200
ea1cc32b91
🧹 dinglehopper: .gitignore Python stuff
Gerber, Mike
2020-06-09 13:25:05 +0200
22765f02a2
🐛 dinglehopper: Fix tests by making metrics a keyword argument
Gerber, Mike
2020-06-09 13:07:44 +0200
1af062e1a9
🗒️ dinglehopper: Describe what dinglehopper does in the README
Gerber, Mike
2020-06-08 18:27:20 +0200
5cbeb7b0dd
✨ dinglehopper: Support disabling the metrics using CLI option --no-metrics
Gerber, Mike
2020-06-08 18:26:21 +0200
779472575c
✨ dinglehopper: Include number of characters and words in JSON report
Gerber, Mike
2020-02-21 14:53:12 +0100
745095e52c
✨ dinglehopper: Include number of characters and words in JSON report
Gerber, Mike
2020-02-21 14:53:12 +0100
be251a391e
🔧 dinglehopper: Add PyCharm config
Gerber, Mike
2020-01-14 13:22:42 +0100
6987a8e1e2
🔧 dinglehopper: Add PyCharm config
Gerber, Mike
2020-01-14 13:22:42 +0100
f94e8b9b1c
Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector "
Gerber, Mike
2019-12-09 12:44:05 +0100
48a31ce672
Revert "Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector "
Gerber, Mike
2019-12-09 12:44:05 +0100
1303a7d92f
Merge branch 'master' of https://github.com/qurator-spk/sbb_textline_detector
b-vr103
2019-12-09 11:57:16 +0100
41e00eb900
✅ Travis: Additionally test with Python 3.8
Gerber, Mike
2019-12-06 16:11:18 +0100
f32eb9eb69
🐛 dinglehopper: Escape text inserted into HTML (Fixes #8 )
Gerber, Mike
2019-12-06 15:59:09 +0100
82e863fac2
📝 dinglehopper: Document seq_editops()
Gerber, Mike
2019-12-04 13:22:38 +0100
5ccdace1dd
🎨 dinglehopper: Move working_directory() context manager into tests/util
Gerber, Mike
2019-12-02 15:18:08 +0100
f98c527c93
🐛 dinglehopper: Fix working_directory() context manager
Gerber, Mike
2019-12-02 15:14:16 +0100
5273d10bac
🐛 dinglehopper: Generate a loadable JSON report even if CER=∞
Gerber, Mike
2019-12-02 14:58:35 +0100
ced6504ad0
🎨 dinglehopper: Expose clearing the Levenshtein cache as a function
Gerber, Mike
2019-11-20 13:24:45 +0100
5cf4eddaeb
⚡ dinglehopper: Clear Levenshtein cache between OCR-D files
Gerber, Mike
2019-11-20 13:05:45 +0100
1206c9bb4b
📝 dinglehopper: Document installation + testing
Gerber, Mike
2019-11-18 16:32:42 +0100
58ff140bc0
⚡ ️ dinglehopper: Improve performance by caching the Levensthein matrix
Gerber, Mike
2019-11-18 15:33:17 +0100
11a6341641
🧹 dinglehopper: Remove broken implementation of the unordered word error rate
Gerber, Mike
2019-11-18 15:03:17 +0100
f22228840e
🧹 dinglehopper: Use exclusively relative imports in tests
Gerber, Mike
2019-11-18 14:31:43 +0100
d61c076aad
🧹 dinglehopper: Remove debug print()s
Gerber, Mike
2019-11-18 13:15:43 +0100
12a48f3bfe
✅ dinglehopper: Test aligning lists of lines
Gerber, Mike
2019-11-18 13:00:40 +0100
a64099fac6
Squash merge.
JKamlah
2019-11-15 19:21:07 +0100
4ae2425e5a
ADD os path join and unique hashname
JKamlah
2019-11-15 19:17:32 +0100
057af6d192
Join strings with unique symbol for the hash.
JKamlah
2019-11-15 18:15:12 +0100
c36396302d
FIX problem with creation of hash, instead of merging the strings each string get's an own hash. Adding os.path.join.
JKamlah
2019-11-15 17:59:15 +0100
eb70510271
Merge branch 'master' of https://github.com/qurator-spk/dinglehopper
JKamlah
2019-11-15 15:22:19 +0100
9c8ea6ab00
Merge pull request #6 from wrznr/master
Mike Gerber
2019-11-06 18:17:21 +0100
165f763cef
Fix typo in README
Kay-Michael Würzner
2019-11-06 14:54:25 +0100
d458568ebd
FIX naming, spacing and deletion of tempcachfiles.
JKamlah
2019-11-04 11:22:06 +0100
c9cfdc59ae
Merge branch 'master' of https://github.com/qurator-spk/dinglehopper
JKamlah
2019-10-31 12:16:22 +0100
077396bb56
ADD tempcache for levensthein matrix and reformat code.
JKamlah
2019-10-31 12:14:05 +0100
680c2a2661
🐛 dinglehopper: Fix test_ocrd_cli for Python 3.5, again, and again
Gerber, Mike
2019-10-28 15:05:08 +0100
7cf1a540f4
🐛 dinglehopper: Fix test_ocrd_cli for Python 3.5, again
Gerber, Mike
2019-10-28 14:58:24 +0100
49e2065ad6
🐛 dinglehopper: Fix test_ocrd_cli for Python 3.5
Gerber, Mike
2019-10-28 14:51:41 +0100
fb89c8f571
Merge branch 'master' of https://github.com/qurator-spk/dinglehopper
JKamlah
2019-10-28 14:25:51 +0100
3e515933e6
Rearrange new algo and set a limit, when to use it.
JKamlah
2019-10-28 14:18:23 +0100
86178271df
✅ dinglehopper: Fix repeated tests for the OCR-D interface
Gerber, Mike
2019-10-28 11:47:42 +0100
b6f50ef853
✅ dinglehopper: Add a test for the OCR-D interface
Gerber, Mike
2019-10-25 19:00:39 +0200
6ad003b015
ADD a new levenshtein matrix calculation.
JKamlah
2019-10-25 10:49:51 +0200