CONGRATS for this amazing milestone!!!
I have few points that might need further clarification:
-
Multi-stage DVC files: How the
params
section relates to the code inprocess_raw_data
? Isprocess_file
specified in the code? -
Run cache: I am so used to the linkage between
dvc
andgit
that I find this section confusing… Candvc
“persist” a state without having “S3, Azure Blob, SSH” around? Maybe few more examples could be helpful here. -
Plots: Now I’m even more confused.
dvc
doesn’t use thegit
anymore to track the state, but the plots compare git hashes?