11 Commits (57b6b7d6d3ca4a11a418773f2b007558ea2b79a2)

Author SHA1 Message Date
cneud 98e48dec38 typo 5 years ago
Clemens Neudecker 409d7db2f2
Transform OCR coordinates for web presentation images (fixes #31)
thx @kba!

(scaling factor will require testing with more images though)
5 years ago
Kai Labusch 4cb0a53434 turn --noproxy option into flag 5 years ago
Kai Labusch 8d79c67478 remove breakpoint 5 years ago
Kai Labusch ee07c0cf7c remove superfluous parameter 5 years ago
Kai Labusch 137fff5655 add word/sentence tokenization and NER pre-processing 5 years ago
Kai Labusch daa9a2676e fix wrong computation of boundaries 5 years ago
Kai Labusch 692e990fba improve html layout; add reasonable default for --image-url option 5 years ago
Kai Labusch d6311edd0c improve page2tsv tool 5 years ago
Kai Labusch 450886cda6 add image preview 5 years ago
Kai Labusch 6afb0a6375 add annotation tools and url mapping integration 5 years ago