93 Commits (438b10e4074e9f8fcd0e4cd4001b71d962413c97)
 

Author SHA1 Message Date
Kai Labusch 438b10e407 add tsv2tsv tool;make easy re-processing of tsv files possible 1 month ago
Kai Labusch 24ecc16b2d
Merge pull request #17 from r0man-ist/patch-1
Fix small error to prevent recursion on tsv2page
3 months ago
r0man-ist 772e6d1a42
Fix small error to prevent recursion on tsv2page 3 months ago
Kai Labusch 06c8b382db character normalization based on aletheia mapping 9 months ago
Kai Labusch eac71b3e40
Merge pull request #13 from qurator-spk/fix-ppn-xpath
make xpath for PPN number more specific to avoid catching the PPN of containing work
9 months ago
Konstantin Baierer 3a8bfa74cc
fix namespace typo: s/mets/mods/
Co-authored-by: Stefan Weil <sw@weilnetz.de>
9 months ago
Kai Labusch 2f7d01c7cd fix alto2tsv bug 1 year ago
Kai Labusch eb750752c6
Merge pull request #15 from stweil/typo
Fix typo (found by codespell)
1 year ago
Stefan Weil 3f35554a70 Fix typo (found by codespell)
Signed-off-by: Stefan Weil <sw@weilnetz.de>
1 year ago
Kai Labusch fa1c6b5aa4
Merge pull request #14 from stweil/gitignore
.gitignore: Ignore build directory
1 year ago
Stefan Weil 175694d25d .gitignore: Ignore build directory
That directory is created by `make all` from ocrd_all and should be ignored
to get a clean `git status`.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
1 year ago
Kai Labusch 0ec6f83c4c add alto2tsv 1 year ago
Konstantin Baierer 82769077df make xpath for PPN number more specific to avoid catching the PPN of containing work 2 years ago
Konstantin Baierer 0f64f07635 📦 v0.0.1 2 years ago
Konstantin Baierer 3b10dcb05b Merge branch 'ocrd-processors' of https://github.com/kba/page2tsv into ocrd-processors
# Conflicts:
#	setup.py
2 years ago
Konstantin Baierer 1c0c1cd525 ocrd processors: use snake_case for add_file 2 years ago
Konstantin Baierer e1a440b91c install into qurator namespace 2 years ago
Konstantin Baierer abeca0df16 drop requirement for matplotlib (not used) 2 years ago
Konstantin Baierer db25239075 Merge branch 'master' into ocrd-processors
# Conflicts:
#	setup.py
2 years ago
Kai Labusch a0e5c82929
Merge branch 'master' into ocrd-processors 3 years ago
Kai 75796b5c0c refactor 3 years ago
Konstantin Baierer 81ba7cff82 tests 3 years ago
Konstantin Baierer 60a07c6310 drop support for scaling, not necessary for SBB use case anymore 3 years ago
Konstantin Baierer fe4a1eabb1 setup.py: use ocrd-tool.json for version 3 years ago
Konstantin Baierer aabcc4866d remove obsolete tsv.py (now in qurator-sbb-tools 3 years ago
Konstantin Baierer f813c45ba2 Merge remote-tracking branch 'origin/master' into ocrd-processors 3 years ago
Konstantin Baierer aeb67e445f implement page2tsv/tsv2page as ocrd-neat-{ex,im}port 3 years ago
Konstantin Baierer 0aee20a7f6 cli: separate tsv2page and tsv2page_cli 3 years ago
Konstantin Baierer fe0c355e5a cli: produce TSV if no words are transcribed 3 years ago
Konstantin Baierer 93ee53c8e2 cli: split page2tsv from page2tsv_cli 3 years ago
Kai 9d2d5fcd31 add missing imports 3 years ago
Kai 568e1cd104 remove ner/ned code from page2tsv package 3 years ago
Kai ed90193c45 support segmentation only Page-XML 3 years ago
Kai ee5f03ce07 change default scale factor to 1.0 4 years ago
Kai 5e60fabe4a revert changes 4 years ago
Kai e5b635ec2d try other coordinate computation 4 years ago
Kai f320904503 try other coordinate computation 4 years ago
Kai 1eb05d0d62 xlrd does not support xsls files anymore 4 years ago
Kai ae93668bac xlrd does not support xsls files anymore 4 years ago
Kai 2bd4ae8d5a add ned-priority option to page2tsv 4 years ago
Kai d4eb95b64b make code more robust 4 years ago
Kai 49861b1652 support confidences in find-entities 4 years ago
Kai 0da38d6ec6 support confidences in find-entities 4 years ago
Kai 9b3198e401 add priority option for find-entities 4 years ago
Kai 7b53cc5539 add priority option for find-entities 4 years ago
Kai 318d9bd122 fix #7 4 years ago
Kai Labusch abcdb67e9e
Merge pull request #6 from kba/lineid-ocr-tsv
Retain line_id, tsv2page CLI to propagate results back to PAGE-XML
4 years ago
Konstantin Baierer f03acbf54d tsv2page CLI to propagate TSV results back to PAGE-XML 4 years ago
Konstantin Baierer ad379aea2b store pc:TextLine ID in TSV, fix #5 4 years ago
Kai Labusch 9c63631d7a
Merge pull request #4 from kba/core-page-api
use OCR-D/core PAGE API for reading order and recursive regions
4 years ago