I am using DVC to track different versions of a dataset consisting of annotations on a NER dataset. In particular, I would like to keep track of some dataset metadata which may change from one version to another, and I would also like to quickly compare different versions of the dataset through the evolution of these metadata.
The current solution I am adopting is to track the information on the dataset in a README.md but I would like to keep the link with the dataset file more clear than that.
How to do that? I thought about adding this metadata to the
dataset.dvc file that is automatically generated, but I am not sure that this is a good idea: would a future
dvc add overwrite/delete that for instance?
Thank you in advance for your help!