Commit Graph

83 Commits (master)

Author SHA1 Message Date
Gerber, Mike c7ad6eb724 📦 v1.0.5 2 years ago
Mike Gerber eb48dcbd84
Merge pull request #76 from bertsky/skip-empty
recognize: skip tiny or bin-empty lines, too
2 years ago
Gerber, Mike 77ff6df5a4 📦 v1.0.4 2 years ago
Robert Sachunsky 8c2e4ca76d recognize: skip tiny or bin-empty lines, too 2 years ago
Robert Sachunsky 36e513604e descend to all available TextRegions recursively 2 years ago
Robert Sachunsky 01312c6369 recognize: delegate to core functions 2 years ago
Robert Sachunsky 5f23c03cd9
recognize: remove checkpoint param in favour of checkpoint_dir alone 2 years ago
Robert Sachunsky 11615be6b2
ocrd-tool.json: add model content-type, remove glob variant 2 years ago
Gerber, Mike 76b34c50cb 📦 v1.0.3 3 years ago
Gerber, Mike 34013ddb02 📝 Reduce process() docstring again 3 years ago
Robert Sachunsky 4c6d6655e1 improve process() docstring 3 years ago
Robert Sachunsky 3bde7cb37f init from constructor not process(), use conventional name setup() 3 years ago
Gerber, Mike da98713e73 📦 v1.0.2 3 years ago
Gerber, Mike 0869386ec4 🐛 Fix word and glyph coordinates
Fixes GH-57.
3 years ago
Gerber, Mike 4cf25b8119 🎨 Rename input_channels variable to network_input_channels 3 years ago
Gerber, Mike c0902cdef5 Merge branch 'master' into image-features 3 years ago
Gerber, Mike 1bb72cbaf1 📦 v1.0.1 3 years ago
Mike Gerber 53c94fea95
Merge pull request #53 from OCR-D/resolve-resources
Resolve resources
3 years ago
Konstantin Baierer 03f5e44e62 define default for checkpoint_dir, but allow checkpoint still 3 years ago
Mike Gerber a014bab5b6
Merge pull request #49 from OCR-D/fix-48
check for empty line image, ht @andbue, fix #48
3 years ago
Mike Gerber e7fb432e35
Merge pull request #52 from OCR-D/checkpoint_dir
Checkpoint dir
3 years ago
Konstantin Baierer 00e43b1d1f use Processor.resolve_files to handle on-demand download of models via registry 3 years ago
Konstantin Baierer fdd30ebb89 also add tensorflow version to --version output 3 years ago
Konstantin Baierer d6804bd9c3 fix typos 3 years ago
Konstantin Baierer 83adfcfd5a implement "checkpoint_dir" parameter as a simpler alternative to "checkpoint" 3 years ago
Konstantin Baierer fe973e58db add version of calamari in --version output 3 years ago
Konstantin Baierer df530877dc check for empty line image, ht @andbue, fix #48 3 years ago
Gerber, Mike 448a5b0dbc 📦 v1.0.0 4 years ago
Gerber, Mike 8fcd331fbd Merge branch 'feat/update-calamari1' 4 years ago
Konstantin Baierer e4982aff37 getLogger per method 4 years ago
Konstantin Baierer f746b73fd0 use make_file_id and assert_file_grp_cardinality 4 years ago
Gerber, Mike 86410110bc 📦 v0.0.7 4 years ago
Gerber, Mike 7da45a0ec1 Set pcGtsId
Newest OCR-D validation checks PAGE-XML pcGtsId against METS file/@ID.
Set the pcGtsId here correctly.

Fixes #40.
4 years ago
Gerber, Mike 93190fae3b Recognize more than one line at a time (Fixes gh#20) 4 years ago
Gerber, Mike 123ee61a8b v0.0.6 4 years ago
Gerber, Mike 62e5e0c295 🐛 ocrd-tool.json: Fix GitHub url by s/kba/OCR-D 4 years ago
Gerber, Mike 0334a35870 🐛 Sort predictions in exactly the same way, also when building the text 4 years ago
Gerber, Mike 0c9e1f13c7 🐛 Sort predictions in exactly the same way to make sure we are correctly removing spaces 4 years ago
Gerber, Mike d2c843aa3f 📦 v0.0.5 4 years ago
Gerber, Mike cd8f6a5fcb 🐛 Use line id for debug message 4 years ago
Gerber, Mike 5b6d8b3f41 🐛 Build line text on our own
Calamari does whitespace post-processing on prediction.sentence, while
it does not do the same on prediction.positions. Do it on our own to
have consistency.

Fixes GH-37.
4 years ago
Gerber, Mike 4508e3ec47 📦 v0.0.4 4 years ago
Gerber, Mike b802b4deaf Allow configuring a cut off confidence value for glyph alternatives 4 years ago
Gerber, Mike ef3fb44fb5 Allow controlling of output hierarchy level, e.g. only line, not words+glyphs 4 years ago
Gerber, Mike 6f4736f8e4 Do word segmentation as expected by OCR-D PAGE specs 4 years ago
Gerber, Mike 0f9c94e7dc 🐛 Start with TextEquiv index=1 to adhere to OCR-D PAGE conventions
https://ocr-d.github.io/page#multiple-textequivs
4 years ago
Gerber, Mike 909632493b 🚧 Add future TODOs 4 years ago
Gerber, Mike 3149e1d9e0 📝 unwanted() 4 years ago
Gerber, Mike 91cca1e1b8 📝 Document why we are using Unicode text segmentation to produce word results 4 years ago
Gerber, Mike 2650189910 🧹 Add whitespace 4 years ago