I have a dataset tracked by DVC which contains personal information about a group of people. Keeping privacy and security in mind, if I get a request from someone for having their data deleted from the dataset, how do I go about doing that from the entire history of the dataset? Just deleting the file and adding another DVC commit obviously won’t do because the data is still easily recoverable in the history.
I can see that
dvc gc command does something along this line but it’s not clear to me from the documentation whether you can specify the file that you want deleted. Looks like it only deletes unused files in the history by default.
Is there any command that the deletion of targeted files from the entire DVC history?