1
0
Fork 0
mirror of https://github.com/qurator-spk/modstool.git synced 2025-06-09 19:59:57 +02:00

Remove direct CSV/Excel support

Remove direct CVS/Excel support from the CLI; README now has instructions to convert.

Closes gh-40.
This commit is contained in:
Mike Gerber 2024-07-31 11:09:34 +02:00
parent a1f333f4a4
commit 7122f0265f
3 changed files with 19 additions and 13 deletions

View file

@ -28,6 +28,18 @@ mods4pandas /path/to/a/directory/containing/mets_files
alto4pandas /path/to/a/directory/full/of/alto_files
~~~
### Conversion to other formats
CSV:
```
python -c 'import pandas as pd; pd.read_parquet("mods_info_df.parquet").to_csv("mods_info_df.csv")'
```
Excel (requires `XlsxWriter`):
```
python -c 'import pandas as pd; pd.read_parquet("mods_info_df.parquet").to_excel("mods_info_df.xlsx"
, engine="xlsxwriter")'
```
## Example
In this example we convert the MODS metadata contained in the METS files in
`/srv/data/digisam_mets-sample-300` to a pandas DataFrame under