Kai Labusch
|
ef8244a466
|
add tsv2tsv tool;make easy re-processing of tsv files possible
|
1 month ago |
Kai Labusch
|
438b10e407
|
add tsv2tsv tool;make easy re-processing of tsv files possible
|
1 month ago |
Kai Labusch
|
24ecc16b2d
|
Merge pull request #17 from r0man-ist/patch-1
Fix small error to prevent recursion on tsv2page
|
3 months ago |
r0man-ist
|
772e6d1a42
|
Fix small error to prevent recursion on tsv2page
|
3 months ago |
Kai Labusch
|
06c8b382db
|
character normalization based on aletheia mapping
|
9 months ago |
Kai Labusch
|
eac71b3e40
|
Merge pull request #13 from qurator-spk/fix-ppn-xpath
make xpath for PPN number more specific to avoid catching the PPN of containing work
|
9 months ago |
Konstantin Baierer
|
3a8bfa74cc
|
fix namespace typo: s/mets/mods/
Co-authored-by: Stefan Weil <sw@weilnetz.de>
|
9 months ago |
Kai Labusch
|
2f7d01c7cd
|
fix alto2tsv bug
|
1 year ago |
Kai Labusch
|
eb750752c6
|
Merge pull request #15 from stweil/typo
Fix typo (found by codespell)
|
1 year ago |
Stefan Weil
|
3f35554a70
|
Fix typo (found by codespell)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
|
1 year ago |
Kai Labusch
|
fa1c6b5aa4
|
Merge pull request #14 from stweil/gitignore
.gitignore: Ignore build directory
|
1 year ago |
Stefan Weil
|
175694d25d
|
.gitignore: Ignore build directory
That directory is created by `make all` from ocrd_all and should be ignored
to get a clean `git status`.
Signed-off-by: Stefan Weil <sw@weilnetz.de>
|
1 year ago |
Kai Labusch
|
0ec6f83c4c
|
add alto2tsv
|
1 year ago |
Konstantin Baierer
|
82769077df
|
make xpath for PPN number more specific to avoid catching the PPN of containing work
|
2 years ago |
Konstantin Baierer
|
0f64f07635
|
📦 v0.0.1
|
2 years ago |
Konstantin Baierer
|
3b10dcb05b
|
Merge branch 'ocrd-processors' of https://github.com/kba/page2tsv into ocrd-processors
# Conflicts:
# setup.py
|
2 years ago |
Konstantin Baierer
|
1c0c1cd525
|
ocrd processors: use snake_case for add_file
|
2 years ago |
Konstantin Baierer
|
e1a440b91c
|
install into qurator namespace
|
2 years ago |
Konstantin Baierer
|
abeca0df16
|
drop requirement for matplotlib (not used)
|
2 years ago |
Konstantin Baierer
|
db25239075
|
Merge branch 'master' into ocrd-processors
# Conflicts:
# setup.py
|
2 years ago |
Kai Labusch
|
a0e5c82929
|
Merge branch 'master' into ocrd-processors
|
3 years ago |
Kai
|
75796b5c0c
|
refactor
|
3 years ago |
Konstantin Baierer
|
81ba7cff82
|
tests
|
3 years ago |
Konstantin Baierer
|
60a07c6310
|
drop support for scaling, not necessary for SBB use case anymore
|
3 years ago |
Konstantin Baierer
|
fe4a1eabb1
|
setup.py: use ocrd-tool.json for version
|
3 years ago |
Konstantin Baierer
|
aabcc4866d
|
remove obsolete tsv.py (now in qurator-sbb-tools
|
3 years ago |
Konstantin Baierer
|
f813c45ba2
|
Merge remote-tracking branch 'origin/master' into ocrd-processors
|
3 years ago |
Konstantin Baierer
|
aeb67e445f
|
implement page2tsv/tsv2page as ocrd-neat-{ex,im}port
|
3 years ago |
Konstantin Baierer
|
0aee20a7f6
|
cli: separate tsv2page and tsv2page_cli
|
3 years ago |
Konstantin Baierer
|
fe0c355e5a
|
cli: produce TSV if no words are transcribed
|
3 years ago |
Konstantin Baierer
|
93ee53c8e2
|
cli: split page2tsv from page2tsv_cli
|
3 years ago |
Kai
|
9d2d5fcd31
|
add missing imports
|
3 years ago |
Kai
|
568e1cd104
|
remove ner/ned code from page2tsv package
|
3 years ago |
Kai
|
ed90193c45
|
support segmentation only Page-XML
|
3 years ago |
Kai
|
ee5f03ce07
|
change default scale factor to 1.0
|
4 years ago |
Kai
|
5e60fabe4a
|
revert changes
|
4 years ago |
Kai
|
e5b635ec2d
|
try other coordinate computation
|
4 years ago |
Kai
|
f320904503
|
try other coordinate computation
|
4 years ago |
Kai
|
1eb05d0d62
|
xlrd does not support xsls files anymore
|
4 years ago |
Kai
|
ae93668bac
|
xlrd does not support xsls files anymore
|
4 years ago |
Kai
|
2bd4ae8d5a
|
add ned-priority option to page2tsv
|
4 years ago |
Kai
|
d4eb95b64b
|
make code more robust
|
4 years ago |
Kai
|
49861b1652
|
support confidences in find-entities
|
4 years ago |
Kai
|
0da38d6ec6
|
support confidences in find-entities
|
4 years ago |
Kai
|
9b3198e401
|
add priority option for find-entities
|
4 years ago |
Kai
|
7b53cc5539
|
add priority option for find-entities
|
4 years ago |
Kai
|
318d9bd122
|
fix #7
|
4 years ago |
Kai Labusch
|
abcdb67e9e
|
Merge pull request #6 from kba/lineid-ocr-tsv
Retain line_id, tsv2page CLI to propagate results back to PAGE-XML
|
4 years ago |
Konstantin Baierer
|
f03acbf54d
|
tsv2page CLI to propagate TSV results back to PAGE-XML
|
4 years ago |
Konstantin Baierer
|
ad379aea2b
|
store pc:TextLine ID in TSV, fix #5
|
4 years ago |