Update README.md

This commit is contained in:
vahidrezanezhad 2020-08-03 12:46:42 +02:00 committed by GitHub
parent 51d90dad50
commit 8872131a43
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23

View file

@ -5,10 +5,10 @@
This tool performs printspace, region and textline detection from document image This tool performs printspace, region and textline detection from document image
data and returns the results as [PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML). data and returns the results as [PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML).
The goal of this project is to extract textlines of a document to feed an ocr model. This is achieved by four successive stages as follows: The goal of this project is to extract textlines of a document to feed an ocr model. This is achieved by four successive stages as follows:
* Item 1 Printspace or border extraction * Printspace or border extraction
* Item 2 Layout analysis * Layout analysis
* Item 3 Textline detection * Textline detection
* Item 4 Heuristic methods * Heuristic methods
## Installation ## Installation
`pip install .` `pip install .`