Benjamin Rosemann
9f8f88df1f
Reintroduce tooltips in report.
4 years ago
Benjamin Rosemann
12dcdb81da
Add metrics parameter to integration test
4 years ago
Benjamin Rosemann
7642a53091
Allow disabling the html report.
4 years ago
Benjamin Rosemann
e8ccffb275
Updated reports and dependencies.
4 years ago
Benjamin Rosemann
40f23b8482
Added comments
4 years ago
Benjamin Rosemann
cee7b6891b
Fix CI Build
4 years ago
Benjamin Rosemann
714b569195
Fixed some flake8 and mypy issues.
4 years ago
Benjamin Rosemann
a44a3d4bf2
Error handling
4 years ago
Benjamin Rosemann
06468a436e
Implemented new metrics behaviour
4 years ago
Benjamin Rosemann
9f5112f8f6
Remove support for ExtractedText for bag metrics.
4 years ago
Benjamin Rosemann
381fe7cb6b
Switch to result tuple instead of multiple return parameters
4 years ago
Benjamin Rosemann
974ca3e5c0
Split html and json report generation
4 years ago
Benjamin Rosemann
8cd624f795
Add BoC and BoW metric
...
Also some refactoring for helper methods on normalization and word
splitting.
4 years ago
Benjamin Rosemann
4ccae9432d
Move metrics into separate package
4 years ago
Benjamin Rosemann
45465f8d13
Remove restriction on Python 3.5
4 years ago
Gerber, Mike
249787686f
Merge branch 'master' of github.com:qurator-spk/dinglehopper
continuous-integration/drone/push Build is failing
Details
4 years ago
Gerber, Mike
2a6cc5823e
🐛 dinglehopper: Call initLogging before logging
...
When using ocrd_utils' getLogger(), we need to call initLogging() before doing any
logging.
Fixes #55 .
4 years ago
Mike Gerber
0b9af3a21e
Merge pull request #58 from kba/unorderedgroupindexed
...
continuous-integration/drone/push Build is passing
Details
ReadingOrder may also contain UnorderedGroupIndexed
4 years ago
Konstantin Baierer
7fde00d911
ReadingOrder may also contain UnorderedGroupIndexed
4 years ago
Gerber, Mike
1778b36a9a
🚧 dinglehopper: Read PAGE UnorderedGroup in XML order
4 years ago
Gerber, Mike
bd324331e6
🚧 dinglehopper: Try out Drone CI
continuous-integration/drone/push Build is passing
Details
4 years ago
Gerber, Mike
a59ecb795c
🚧 dinglehopper: Try out Drone CI
continuous-integration/drone/push Build is failing
Details
4 years ago
Gerber, Mike
14230e073a
🚧 dinglehopper: Try out Drone CI
4 years ago
Gerber, Mike
985666a71c
🚧 dinglehopper: Try out Drone CI
4 years ago
Gerber, Mike
4a73053cfc
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
e3d4493c82
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
27f4c3bdf8
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
8533e6d421
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
e8da8b63f8
🚧 Replace Travis with CircleCI
4 years ago
Gerber, Mike
3b7a1a5631
🚧 Replace Travis with CircleCI
4 years ago
Mike Gerber
691ce371ca
Merge pull request #50 from b2m/fix-table-extraction
...
Fix the extraction of text from Page with TableRegion
4 years ago
Benjamin Rosemann
a68fc269d9
Fix the extraction of text from Page with TableRegion
...
Dinglehopper did not consider `OrderedGroupIndex` in the `ReadingOrder`
element when extracting text regions. As a consequence a `TableRegion`
was not considered for text extraction.
4 years ago
Gerber, Mike
8cd8314c8a
🐛 dinglehopper: Bump up ocrd req for zip_input_files
...
See also GH-49.
4 years ago
Mike Gerber
62670dd0c7
Merge pull request #49 from kba/zip_input_files
...
ocrd cli: use core-provided zip_input_files method
4 years ago
Konstantin Baierer
74e0ac18ed
ocrd cli: use core-provided zip_input_files method
4 years ago
Gerber, Mike
389e253c11
🐛 dinglehopper: Fix alto_extract_lines()'s type annotation
4 years ago
Gerber, Mike
fe3923a8af
🐛 dinglehopper: Fix alto_extract()'s type annotation
4 years ago
Gerber, Mike
132f91d500
✔️ dinglehopper: Add missing integration test markers
4 years ago
Gerber, Mike
c48d7646df
📝 dinglehopper: README-DEV: Massage markdown a bit
4 years ago
Mike Gerber
fed021090d
Merge pull request #46 from b2m/tool-changes
...
Tool changes
4 years ago
Benjamin Rosemann
cb1ac9d260
Add black to developer requirements.
5 years ago
Benjamin Rosemann
03ad413f4a
Added some helpful tools and configurations
5 years ago
Benjamin Rosemann
5cbd4f3d95
Preparation for black code formatter
5 years ago
Benjamin Rosemann
ce752e1912
Remove .idea folder and modify .gitignore
...
Sharing even parts of the .idea folder in worldwide setting is bound to
generate more problems than solutions. Therefore it should be removed
and consequently ignore in .gitignore.
Also adds some Python specific stuff to the .gitignore file.
5 years ago
Benjamin Rosemann
5270737c1f
Skip test on windows because it is unix specific.
5 years ago
Gerber, Mike
32a4b95a99
🐛 dinglehopper: Normalize in plain_extract()
5 years ago
Gerber, Mike
14421c8e53
🎨 dinglehopper: Reformat using black
5 years ago
Gerber, Mike
31c63f9e4c
🎨 dinglehopper: s/LOG/log
5 years ago
Mike Gerber
0804b029c4
Merge pull request #43 from bertsky/patch-1
...
1 more update for core's getLogger context
5 years ago
Robert Sachunsky
a60c14351e
1 more update for core's getLogger context
5 years ago