| 
								
								
									 Robert Sachunsky | 352783cf7c | non-legacy namespace package | 2024-05-23 20:50:49 +02:00 |  | 
				
					
						| 
								
								
									 Kai Labusch | 06c8b382db | character normalization based on aletheia mapping | 2024-03-14 17:31:41 +01:00 |  | 
				
					
						| 
								
								
									 Kai Labusch | eac71b3e40 | Merge pull request #13 from qurator-spk/fix-ppn-xpath make xpath for PPN number more specific to avoid catching the PPN of containing work | 2024-03-13 12:50:07 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | 3a8bfa74cc | fix namespace typo: s/mets/mods/ Co-authored-by: Stefan Weil <sw@weilnetz.de> | 2024-03-13 12:13:24 +01:00 |  | 
				
					
						| 
								
								
									 Kai Labusch | 2f7d01c7cd | fix alto2tsv bug | 2023-11-10 17:28:30 +01:00 |  | 
				
					
						| 
								
								
									 Kai Labusch | eb750752c6 | Merge pull request #15 from stweil/typo Fix typo (found by codespell) | 2023-10-23 08:09:38 +02:00 |  | 
				
					
						| 
								
								
									 Stefan Weil | 3f35554a70 | Fix typo (found by codespell) Signed-off-by: Stefan Weil <sw@weilnetz.de> | 2023-10-23 08:06:07 +02:00 |  | 
				
					
						| 
								
								
									 Kai Labusch | fa1c6b5aa4 | Merge pull request #14 from stweil/gitignore .gitignore: Ignore build directory | 2023-10-23 07:54:57 +02:00 |  | 
				
					
						| 
								
								
									 Stefan Weil | 175694d25d | .gitignore: Ignore build directory That directory is created by `make all` from ocrd_all and should be ignored
to get a clean `git status`.
Signed-off-by: Stefan Weil <sw@weilnetz.de> | 2023-10-22 19:15:35 +02:00 |  | 
				
					
						| 
								
								
									 Kai Labusch | 0ec6f83c4c | add alto2tsv | 2023-10-20 16:36:43 +02:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | 82769077df | make xpath for PPN number more specific to avoid catching the PPN of containing work | 2023-06-08 19:08:40 +02:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | 0f64f07635 | 📦 v0.0.1 | 2022-11-09 16:01:44 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | 3b10dcb05b | Merge branch 'ocrd-processors' of https://github.com/kba/page2tsv into ocrd-processors # Conflicts:
#	setup.py | 2022-11-08 16:24:04 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | 1c0c1cd525 | ocrd processors: use snake_case for add_file | 2022-11-08 16:22:28 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | e1a440b91c | install into qurator namespace | 2022-11-08 16:19:23 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | abeca0df16 | drop requirement for matplotlib (not used) | 2022-10-24 14:29:42 +02:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | db25239075 | Merge branch 'master' into ocrd-processors # Conflicts:
#	setup.py | 2022-10-24 14:02:24 +02:00 |  | 
				
					
						| 
								
								
									 Kai Labusch | a0e5c82929 | Merge branch 'master' into ocrd-processors | 2022-06-10 10:32:34 +02:00 |  | 
				
					
						| 
								
								
									 Kai | 75796b5c0c | refactor | 2022-06-10 10:00:32 +02:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | 81ba7cff82 | tests | 2022-05-30 17:01:50 +02:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | 60a07c6310 | drop support for scaling, not necessary for SBB use case anymore | 2022-05-30 16:58:20 +02:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | fe4a1eabb1 | setup.py: use ocrd-tool.json for version | 2022-02-21 18:51:28 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | aabcc4866d | remove obsolete tsv.py (now in qurator-sbb-tools | 2022-02-21 18:50:09 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | f813c45ba2 | Merge remote-tracking branch 'origin/master' into ocrd-processors | 2022-02-21 18:47:10 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | aeb67e445f | implement page2tsv/tsv2page as ocrd-neat-{ex,im}port | 2022-02-21 18:47:03 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | 0aee20a7f6 | cli: separate tsv2page and tsv2page_cli | 2022-02-21 17:04:54 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | fe0c355e5a | cli: produce TSV if no words are transcribed | 2022-02-21 17:00:03 +01:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | 93ee53c8e2 | cli: split page2tsv from page2tsv_cli | 2022-02-21 15:22:04 +01:00 |  | 
				
					
						| 
								
								
									 Kai | 9d2d5fcd31 | add missing imports | 2022-02-21 14:03:07 +01:00 |  | 
				
					
						| 
								
								
									 Kai | 568e1cd104 | remove ner/ned code from page2tsv package | 2021-12-15 15:51:00 +01:00 |  | 
				
					
						| 
								
								
									 Kai | ed90193c45 | support segmentation only Page-XML | 2021-11-19 11:27:46 +01:00 |  | 
				
					
						| 
								
								
									 Kai | ee5f03ce07 | change default scale factor to 1.0 | 2021-05-07 12:36:42 +02:00 |  | 
				
					
						| 
								
								
									 Kai | 5e60fabe4a | revert changes | 2021-05-07 11:28:24 +02:00 |  | 
				
					
						| 
								
								
									 Kai | e5b635ec2d | try other coordinate computation | 2021-05-07 10:50:09 +02:00 |  | 
				
					
						| 
								
								
									 Kai | f320904503 | try other coordinate computation | 2021-05-07 10:40:52 +02:00 |  | 
				
					
						| 
								
								
									 Kai | 1eb05d0d62 | xlrd does not support xsls files anymore | 2021-05-07 08:02:33 +02:00 |  | 
				
					
						| 
								
								
									 Kai | ae93668bac | xlrd does not support xsls files anymore | 2021-05-07 07:54:55 +02:00 |  | 
				
					
						| 
								
								
									 Kai | 2bd4ae8d5a | add ned-priority option to page2tsv | 2021-05-06 16:23:08 +02:00 |  | 
				
					
						| 
								
								
									 Kai | d4eb95b64b | make code more robust | 2021-05-06 15:13:26 +02:00 |  | 
				
					
						| 
								
								
									 Kai | 49861b1652 | support confidences in find-entities | 2021-05-06 13:17:07 +02:00 |  | 
				
					
						| 
								
								
									 Kai | 0da38d6ec6 | support confidences in find-entities | 2021-05-06 13:01:41 +02:00 |  | 
				
					
						| 
								
								
									 Kai | 9b3198e401 | add priority option for find-entities | 2021-05-06 12:42:12 +02:00 |  | 
				
					
						| 
								
								
									 Kai | 7b53cc5539 | add priority option for find-entities | 2021-05-06 12:24:47 +02:00 |  | 
				
					
						| 
								
								
									 Kai | 318d9bd122 | fix #7 | 2021-04-09 08:07:27 +02:00 |  | 
				
					
						| 
								
								
									 Kai Labusch | abcdb67e9e | Merge pull request #6 from kba/lineid-ocr-tsv Retain line_id, tsv2page CLI to propagate results back to PAGE-XML | 2021-04-08 10:53:51 +02:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | f03acbf54d | tsv2page CLI to propagate TSV results back to PAGE-XML | 2021-04-01 17:53:27 +02:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | ad379aea2b | store pc:TextLine ID in TSV, fix #5 | 2021-04-01 17:13:30 +02:00 |  | 
				
					
						| 
								
								
									 Kai Labusch | 9c63631d7a | Merge pull request #4 from kba/core-page-api use OCR-D/core PAGE API for reading order and recursive regions | 2021-04-01 15:43:46 +02:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | 675c88a67d | requirements: ocrd pulls in requests already | 2021-04-01 15:27:23 +02:00 |  | 
				
					
						| 
								
								
									 Konstantin Baierer | d80b02c56d | use OCR-D/core PAGE API for reading order and recursive regions | 2021-04-01 15:21:03 +02:00 |  |