1
0
Fork 0
mirror of https://github.com/qurator-spk/page2tsv.git synced 2025-06-09 19:39:54 +02:00
Commit graph

52 commits

Author SHA1 Message Date
Kai
49861b1652 support confidences in find-entities 2021-05-06 13:17:07 +02:00
Kai
0da38d6ec6 support confidences in find-entities 2021-05-06 13:01:41 +02:00
Kai
9b3198e401 add priority option for find-entities 2021-05-06 12:42:12 +02:00
Kai
7b53cc5539 add priority option for find-entities 2021-05-06 12:24:47 +02:00
Kai
318d9bd122 fix #7 2021-04-09 08:07:27 +02:00
Kai Labusch
abcdb67e9e
Merge pull request #6 from kba/lineid-ocr-tsv
Retain line_id, tsv2page CLI to propagate results back to PAGE-XML
2021-04-08 10:53:51 +02:00
Konstantin Baierer
f03acbf54d tsv2page CLI to propagate TSV results back to PAGE-XML 2021-04-01 17:53:27 +02:00
Konstantin Baierer
ad379aea2b store pc:TextLine ID in TSV, fix #5 2021-04-01 17:13:30 +02:00
Kai Labusch
9c63631d7a
Merge pull request #4 from kba/core-page-api
use OCR-D/core PAGE API for reading order and recursive regions
2021-04-01 15:43:46 +02:00
Konstantin Baierer
675c88a67d requirements: ocrd pulls in requests already 2021-04-01 15:27:23 +02:00
Konstantin Baierer
d80b02c56d use OCR-D/core PAGE API for reading order and recursive regions 2021-04-01 15:21:03 +02:00
Kai Labusch
e21fbc09a1 fix url 2021-03-18 21:20:46 +01:00
Kai
1ec06a3087 fix setup.py 2021-03-18 08:27:36 +01:00
Kai Labusch
eca7823b10
Merge pull request #3 from qurator-spk/cneud-patch-1
fix snippets
2021-03-10 17:00:59 +01:00
Clemens Neudecker
5c82b83b2e
fix snippets
use `full` resolution IIIF image for snippets
2021-03-10 16:59:09 +01:00
Kai
243c7b48c6 fix line shift 2021-03-10 16:08:39 +01:00
Kai
6ffba183ab fix repeated text lines 2021-03-10 15:33:30 +01:00
Kai
de575037e6 fix repeated text rows 2021-03-10 15:15:24 +01:00
Kai
a6008b83b5 remove full 2021-03-10 15:11:38 +01:00
Kai
487b74b6e6 #2 2021-03-10 14:06:51 +01:00
Kai
c554644838 Add directory parsing option to make-page2tsv-commands 2021-03-10 12:03:45 +01:00
Kai
aa79678403 Add directory parsing option to make-page2tsv-commands 2021-03-10 11:57:39 +01:00
Kai
7fc39739b7 Add directory parsing option to make-page2tsv-commands 2021-03-10 11:51:49 +01:00
Kai
f606cb92b0 Change scale-factor default parameter. Fix make-page2tsv-commands 2021-03-10 11:15:42 +01:00
Kai
900015da61 store OCR or NED confidences in tsv file 2021-02-26 12:18:10 +01:00
Kai
5d55ba24a3 use max confidence instead of mean 2021-02-04 07:37:58 +01:00
Kai
85ec36218e support visualization of ocr confidences 2021-02-03 15:31:36 +01:00
Kai
2b73b421ae support visualization of ocr confidences 2021-02-03 15:22:38 +01:00
Kai
c3acd74e9f add OCR annotation functionality 2021-02-01 16:25:12 +01:00
Kai Labusch
a834da494a permit empty files 2020-08-15 08:46:18 +02:00
Kai Labusch
2dc3857770 make tools more robust against glitches within the input files 2020-07-02 11:37:54 +02:00
Kai Labusch
e09f40db61 proper support for retroactive entity linking 2020-06-19 14:30:38 +02:00
U-PK\b-kl104
449bd1d3ca preserve URL structure in tsv files during NER/NED amendment 2020-06-19 11:29:51 +02:00
Kai Labusch
361c811264 add command line tool that creates page2tsv commands from an excel file 2020-06-02 15:40:36 +02:00
Kai
83fb2ea033 enable NED only usage of find-entities 2020-05-25 15:10:08 +02:00
Kai
c12bea2cb0 enable NED only usage of find-entities 2020-05-25 15:09:13 +02:00
Kai Labusch
975487a233 adapt find-entities to CLEF2020 requirements 2020-05-25 07:15:46 +02:00
Kai
0d650ebcc5 support loading ned result from disk 2020-05-22 08:29:08 +02:00
Kai
9fe35377e3 disable proxy option in find-entities 2020-05-19 19:51:23 +02:00
Kai
c7f4b6fe53 add proper NED support 2020-04-09 09:57:03 +02:00
Kai
24fd7245f5 add findentities command line tool that can be used in order to NER/NED tag an existing .tsv file 2020-03-25 08:18:59 +01:00
Kai
b13dae29f5 rename GND-ID column to more generic ID 2020-03-13 09:05:55 +01:00
Kai Labusch
0cd9cd932a support automatic named entity disambiguation 2020-03-12 11:01:58 +01:00
Kai Labusch
05f49df6d2 support Qurator calamari PAGE xml 2020-03-11 12:56:09 +01:00
Kai Labusch
abdabbac4f try to infer correct line ordering ... 2020-03-09 13:44:16 +01:00
Kai Labusch
7bf9cfa5de try to infer correct line ordering ... 2020-03-09 10:58:07 +01:00
Clemens Neudecker
e535a070c4
Update cli.py 2020-02-20 18:35:16 +01:00
Kai Labusch
2946909cf3 add command line option for image scale factor 2020-01-10 13:04:07 +01:00
Clemens Neudecker
311dac31ac
Update README.md 2019-12-16 17:24:41 +01:00
cneud
f888017f03 add example.xml PAGE-XML 2019-12-16 16:40:39 +01:00