1
0
Fork 0
mirror of https://github.com/qurator-spk/modstool.git synced 2025-06-09 19:59:57 +02:00

🚧 Write a Parquet file

This commit is contained in:
Mike Gerber 2024-07-27 12:57:33 +02:00
parent 03d86ce68a
commit dd4febf24d
3 changed files with 12 additions and 11 deletions

View file

@ -28,8 +28,8 @@ alto4pandas /path/to/a/directory/full/of/alto_files
## Example
In this example we convert the MODS metadata contained in the METS files in
`/srv/data/digisam_mets-sample-300` to a pandas DataFrame under
`mods_info_df.pkl`. This file can then be read by your data scientist using
`pd.read_pickle()`.
`mods_info_df.parquet`. This file can then be read by your data scientist using
`pd.read_parquet()`.
~~~
% mods4pandas /srv/data/digisam_mets-sample-300