Hey guys, quick question.
Following the ML flow best practice, we must store our raw data in one data lake structure, right?
DVC can use Google Storage/S3 and a lot of other storage support to store our code/dataset and models in one common place.
Is it correct to say than I have one “Data Lake” structure just using DVC?
I’m using DVC to store my dataset and models, and now I would like to know if is necessary to implement one data lake architecture on top of that.