11 Commits (374a607ea81a3d8201f4f587662948aad59f721b)

Author SHA1 Message Date
cneud 98e48dec38 typo 5 years ago
Clemens Neudecker 409d7db2f2
Transform OCR coordinates for web presentation images (fixes #31)
thx @kba!

(scaling factor will require testing with more images though)
5 years ago
Kai Labusch 4cb0a53434 turn --noproxy option into flag 5 years ago
Kai Labusch 8d79c67478 remove breakpoint 5 years ago
Kai Labusch ee07c0cf7c remove superfluous parameter 5 years ago
Kai Labusch 137fff5655 add word/sentence tokenization and NER pre-processing 5 years ago
Kai Labusch daa9a2676e fix wrong computation of boundaries 5 years ago
Kai Labusch 692e990fba improve html layout; add reasonable default for --image-url option 5 years ago
Kai Labusch d6311edd0c improve page2tsv tool 5 years ago
Kai Labusch 450886cda6 add image preview 5 years ago
Kai Labusch 6afb0a6375 add annotation tools and url mapping integration 5 years ago