Update README.md

2025-07-27 19:29:57 +02:00 · 2020-08-03 12:45:55 +02:00 · 2020-08-03 12:45:55 +02:00 · 51d90dad50
commit 51d90dad50
parent 4036e2a527
1 changed files with 5 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -4,6 +4,11 @@
 ## Introduction
 This tool performs printspace, region and textline detection from document image
 data and returns the results as [PAGE-XML](https://github.com/PRImA-Research-Lab/PAGE-XML).
+The goal of this project is to extract textlines of a document to feed an ocr model. This is achieved by four successive stages as follows:
+* Item 1 Printspace or border extraction
+* Item 2 Layout analysis
+* Item 3 Textline detection
+* Item 4 Heuristic methods

 ## Installation
 `pip install .`