There is not an out-of-the-box integration between DVC Pipelines and Hydra, yet. We are currently exploring what improvements could be made in that regard. Don’t hesitate on commenting your opinions or requests on that discussion or new issues.
As best practices that can be recommended now, it would depend on how you are using hydra within the stages, any additional info about your pipeline would be helpful. Regardless, here are some thoughts, assuming a basic Hydra app like the one used in the tutorial:
Regarding “hydra outputs” I would override hydra.output_subdir to None and use DVC outputs` as you would usually do without Hydra. You don’t really need to use the subfolder date-based versioning that Hydra provides as DVC+Git will do the proper versioning for you.
Regarding “hydra config”, I would suggest tracking the parent config directory as a DVC dependency.
You would just run this with dvc repro and modify params.yaml to pass other args to hydra.
As said in the beginning, these are just some workarounds for a very simple app. If you are wiling to share more details, we can discuss your use case and see what else can be done.
Are there any news on Hydra-DVC pipelines integration? I’m currently using both in my projects and would love if there would be any options to use hydra configs and dvc piplelines from console with dvc experiment run/dvc repro.
Hi @TheDataCoder , sorry for the late reply. We are currently working on a more in-depth integration between DVC and Hydra. I don’t have an accurate ETA but should be ready before the end of this quarter