Erasing lineage of a file(s)

Hi all,

Recently, I did a renku run for an experiment, but I got a bit of an awkward lineage. I therefore reverted that step with git revert, and run again. However, I got an even more awkward lineage:

Somehow, I believe that renku is just taking the old lineage, that was not reverted, and adding the new changes on top. Therefore, here it comes the question: is there any way to fully remove the lineage of a file, or set of files, in order to regenerate it again?

In this case, the involved files are the input (00_rawpdfs.tar.gz) and the output (02_extractedxml.tar.gz), and the script used (run_extract_origxml.py). And I am aiming at getting something like:

Thank you so much!
Cheers
Luis

1 Like