Hello,
Let’s say I have just generated new data in a renku project, either by executing commands or by running renku run
.
What is the recommended way of creating a dataset with that new data ? In particular:
- When creating the dataset, should the short name of the dataset match the directory hierarchy in
data
to prevent additional directories from being created ? - Should I specify the git repo URL and the path to the data within the git repo even though the dataset is created within the project where the data was generated ?
- If I re-run
renku run
, do I need torenku dataset update
even though the dataset is specified for the project where the data was generated ?
Thanks !
Cyril