Gerber, Mike
c7ad6eb724
📦 v1.0.5
2 years ago
Mike Gerber
eb48dcbd84
Merge pull request #76 from bertsky/skip-empty
...
recognize: skip tiny or bin-empty lines, too
2 years ago
Gerber, Mike
77ff6df5a4
📦 v1.0.4
2 years ago
Robert Sachunsky
8c2e4ca76d
recognize: skip tiny or bin-empty lines, too
2 years ago
Robert Sachunsky
36e513604e
descend to all available TextRegions recursively
2 years ago
Robert Sachunsky
01312c6369
recognize: delegate to core functions
2 years ago
Robert Sachunsky
5f23c03cd9
recognize: remove checkpoint param in favour of checkpoint_dir alone
2 years ago
Robert Sachunsky
11615be6b2
ocrd-tool.json: add model content-type, remove glob variant
2 years ago
Gerber, Mike
76b34c50cb
📦 v1.0.3
3 years ago
Gerber, Mike
34013ddb02
📝 Reduce process() docstring again
3 years ago
Robert Sachunsky
4c6d6655e1
improve process() docstring
3 years ago
Robert Sachunsky
3bde7cb37f
init from constructor not process(), use conventional name setup()
3 years ago
Gerber, Mike
da98713e73
📦 v1.0.2
3 years ago
Gerber, Mike
0869386ec4
🐛 Fix word and glyph coordinates
...
Fixes GH-57.
3 years ago
Gerber, Mike
4cf25b8119
🎨 Rename input_channels variable to network_input_channels
3 years ago
Gerber, Mike
c0902cdef5
Merge branch 'master' into image-features
3 years ago
Gerber, Mike
1bb72cbaf1
📦 v1.0.1
3 years ago
Mike Gerber
53c94fea95
Merge pull request #53 from OCR-D/resolve-resources
...
Resolve resources
3 years ago
Konstantin Baierer
03f5e44e62
define default for checkpoint_dir, but allow checkpoint still
3 years ago
Mike Gerber
a014bab5b6
Merge pull request #49 from OCR-D/fix-48
...
check for empty line image, ht @andbue, fix #48
3 years ago
Mike Gerber
e7fb432e35
Merge pull request #52 from OCR-D/checkpoint_dir
...
Checkpoint dir
3 years ago
Konstantin Baierer
00e43b1d1f
use Processor.resolve_files to handle on-demand download of models via registry
3 years ago
Konstantin Baierer
fdd30ebb89
also add tensorflow version to --version output
3 years ago
Konstantin Baierer
d6804bd9c3
fix typos
3 years ago
Konstantin Baierer
83adfcfd5a
implement "checkpoint_dir" parameter as a simpler alternative to "checkpoint"
3 years ago
Konstantin Baierer
fe973e58db
add version of calamari in --version output
3 years ago
Konstantin Baierer
df530877dc
check for empty line image, ht @andbue, fix #48
3 years ago
Gerber, Mike
448a5b0dbc
📦 v1.0.0
4 years ago
Gerber, Mike
8fcd331fbd
Merge branch 'feat/update-calamari1'
4 years ago
Konstantin Baierer
e4982aff37
getLogger per method
4 years ago
Konstantin Baierer
f746b73fd0
use make_file_id and assert_file_grp_cardinality
4 years ago
Gerber, Mike
86410110bc
📦 v0.0.7
4 years ago
Gerber, Mike
7da45a0ec1
Set pcGtsId
...
Newest OCR-D validation checks PAGE-XML pcGtsId against METS file/@ID.
Set the pcGtsId here correctly.
Fixes #40 .
4 years ago
Gerber, Mike
93190fae3b
⚡ Recognize more than one line at a time (Fixes gh#20)
4 years ago
Gerber, Mike
123ee61a8b
v0.0.6
4 years ago
Gerber, Mike
62e5e0c295
🐛 ocrd-tool.json: Fix GitHub url by s/kba/OCR-D
4 years ago
Gerber, Mike
0334a35870
🐛 Sort predictions in exactly the same way, also when building the text
4 years ago
Gerber, Mike
0c9e1f13c7
🐛 Sort predictions in exactly the same way to make sure we are correctly removing spaces
4 years ago
Gerber, Mike
d2c843aa3f
📦 v0.0.5
4 years ago
Gerber, Mike
cd8f6a5fcb
🐛 Use line id for debug message
4 years ago
Gerber, Mike
5b6d8b3f41
🐛 Build line text on our own
...
Calamari does whitespace post-processing on prediction.sentence, while
it does not do the same on prediction.positions. Do it on our own to
have consistency.
Fixes GH-37.
4 years ago
Gerber, Mike
4508e3ec47
📦 v0.0.4
4 years ago
Gerber, Mike
b802b4deaf
✨ Allow configuring a cut off confidence value for glyph alternatives
4 years ago
Gerber, Mike
ef3fb44fb5
✨ Allow controlling of output hierarchy level, e.g. only line, not words+glyphs
4 years ago
Gerber, Mike
6f4736f8e4
✨ Do word segmentation as expected by OCR-D PAGE specs
4 years ago
Gerber, Mike
0f9c94e7dc
🐛 Start with TextEquiv index=1 to adhere to OCR-D PAGE conventions
...
https://ocr-d.github.io/page#multiple-textequivs
4 years ago
Gerber, Mike
909632493b
🚧 Add future TODOs
4 years ago
Gerber, Mike
3149e1d9e0
📝 unwanted()
4 years ago
Gerber, Mike
91cca1e1b8
📝 Document why we are using Unicode text segmentation to produce word results
4 years ago
Gerber, Mike
2650189910
🧹 Add whitespace
4 years ago