Is it possible to remove old versions of data files?

My dvc repository got big, and I do not need the old versions of data files.

Is there any way to remove them from the repository, to save the disk space?

In fact, if there are no other options, I would remove my dvc repository and start it again.

However, I do not want to recreate my dvc files (pipelines), keeping the existing ones.

Is this possible?

Thanks

2 Likes

Hi @byoussin !

Sure, you can use dvc gc command for that. In short running dvc gc(no args) will remove everything from cache, except for those cache files that are used in your current workspace. Check out https://dvc.org/doc/command-reference/gc for more details. And let us know if you have any further questions :slightly_smiling_face:

2 Likes