I have two questions.
- My data storage is Azure blob which contain parquet files. These parquet files are output of preprocessing IOT data which means data keep coming in storage blobs. How to version data with dvc because with every change dvc will create a new version and then i will end up in many many data versions.
Please help me in understanding how people integrate dvc with continuous data.
- I am using Azure ML env. for ML pipeline. How i can integrate dvc with Azure ML env.
Thanks in advance.