Konstantin Baierer
|
ad379aea2b
|
store pc:TextLine ID in TSV, fix #5
|
2021-04-01 17:13:30 +02:00 |
|
Konstantin Baierer
|
675c88a67d
|
requirements: ocrd pulls in requests already
|
2021-04-01 15:27:23 +02:00 |
|
Konstantin Baierer
|
d80b02c56d
|
use OCR-D/core PAGE API for reading order and recursive regions
|
2021-04-01 15:21:03 +02:00 |
|
Kai Labusch
|
e21fbc09a1
|
fix url
|
2021-03-18 21:20:46 +01:00 |
|
Kai
|
1ec06a3087
|
fix setup.py
|
2021-03-18 08:27:36 +01:00 |
|
Kai Labusch
|
eca7823b10
|
Merge pull request #3 from qurator-spk/cneud-patch-1
fix snippets
|
2021-03-10 17:00:59 +01:00 |
|
Clemens Neudecker
|
5c82b83b2e
|
fix snippets
use `full` resolution IIIF image for snippets
|
2021-03-10 16:59:09 +01:00 |
|
Kai
|
243c7b48c6
|
fix line shift
|
2021-03-10 16:08:39 +01:00 |
|
Kai
|
6ffba183ab
|
fix repeated text lines
|
2021-03-10 15:33:30 +01:00 |
|
Kai
|
de575037e6
|
fix repeated text rows
|
2021-03-10 15:15:24 +01:00 |
|
Kai
|
a6008b83b5
|
remove full
|
2021-03-10 15:11:38 +01:00 |
|
Kai
|
487b74b6e6
|
#2
|
2021-03-10 14:06:51 +01:00 |
|
Kai
|
c554644838
|
Add directory parsing option to make-page2tsv-commands
|
2021-03-10 12:03:45 +01:00 |
|
Kai
|
aa79678403
|
Add directory parsing option to make-page2tsv-commands
|
2021-03-10 11:57:39 +01:00 |
|
Kai
|
7fc39739b7
|
Add directory parsing option to make-page2tsv-commands
|
2021-03-10 11:51:49 +01:00 |
|
Kai
|
f606cb92b0
|
Change scale-factor default parameter. Fix make-page2tsv-commands
|
2021-03-10 11:15:42 +01:00 |
|
Kai
|
900015da61
|
store OCR or NED confidences in tsv file
|
2021-02-26 12:18:10 +01:00 |
|
Kai
|
5d55ba24a3
|
use max confidence instead of mean
|
2021-02-04 07:37:58 +01:00 |
|
Kai
|
85ec36218e
|
support visualization of ocr confidences
|
2021-02-03 15:31:36 +01:00 |
|
Kai
|
2b73b421ae
|
support visualization of ocr confidences
|
2021-02-03 15:22:38 +01:00 |
|
Kai
|
c3acd74e9f
|
add OCR annotation functionality
|
2021-02-01 16:25:12 +01:00 |
|
Kai Labusch
|
a834da494a
|
permit empty files
|
2020-08-15 08:46:18 +02:00 |
|
Kai Labusch
|
2dc3857770
|
make tools more robust against glitches within the input files
|
2020-07-02 11:37:54 +02:00 |
|
Kai Labusch
|
e09f40db61
|
proper support for retroactive entity linking
|
2020-06-19 14:30:38 +02:00 |
|
U-PK\b-kl104
|
449bd1d3ca
|
preserve URL structure in tsv files during NER/NED amendment
|
2020-06-19 11:29:51 +02:00 |
|
Kai Labusch
|
361c811264
|
add command line tool that creates page2tsv commands from an excel file
|
2020-06-02 15:40:36 +02:00 |
|
Kai
|
83fb2ea033
|
enable NED only usage of find-entities
|
2020-05-25 15:10:08 +02:00 |
|
Kai
|
c12bea2cb0
|
enable NED only usage of find-entities
|
2020-05-25 15:09:13 +02:00 |
|
Kai Labusch
|
975487a233
|
adapt find-entities to CLEF2020 requirements
|
2020-05-25 07:15:46 +02:00 |
|
Kai
|
0d650ebcc5
|
support loading ned result from disk
|
2020-05-22 08:29:08 +02:00 |
|
Kai
|
9fe35377e3
|
disable proxy option in find-entities
|
2020-05-19 19:51:23 +02:00 |
|
Kai
|
c7f4b6fe53
|
add proper NED support
|
2020-04-09 09:57:03 +02:00 |
|
Kai
|
24fd7245f5
|
add findentities command line tool that can be used in order to NER/NED tag an existing .tsv file
|
2020-03-25 08:18:59 +01:00 |
|
Kai
|
b13dae29f5
|
rename GND-ID column to more generic ID
|
2020-03-13 09:05:55 +01:00 |
|
Kai Labusch
|
0cd9cd932a
|
support automatic named entity disambiguation
|
2020-03-12 11:01:58 +01:00 |
|
Kai Labusch
|
05f49df6d2
|
support Qurator calamari PAGE xml
|
2020-03-11 12:56:09 +01:00 |
|
Kai Labusch
|
abdabbac4f
|
try to infer correct line ordering ...
|
2020-03-09 13:44:16 +01:00 |
|
Kai Labusch
|
7bf9cfa5de
|
try to infer correct line ordering ...
|
2020-03-09 10:58:07 +01:00 |
|
Clemens Neudecker
|
e535a070c4
|
Update cli.py
|
2020-02-20 18:35:16 +01:00 |
|
Kai Labusch
|
2946909cf3
|
add command line option for image scale factor
|
2020-01-10 13:04:07 +01:00 |
|
Clemens Neudecker
|
311dac31ac
|
Update README.md
|
2019-12-16 17:24:41 +01:00 |
|
cneud
|
f888017f03
|
add example.xml PAGE-XML
|
2019-12-16 16:40:39 +01:00 |
|
cneud
|
59a1e81243
|
extract TSV Tools from qurator-spk/neath
|
2019-12-16 16:37:47 +01:00 |
|
Clemens Neudecker
|
92a81a869c
|
Initial commit
|
2019-12-16 16:36:36 +01:00 |
|