Dvc non bare remote

Hi,
let’s say I want to use dvc with azure blob storage. Is there a way that I can use the data from the blob storage directly in other azure projects? Normally a blob storage just gets mounted into another resource. With dvc this seems a problem, because files on the remote are stored differently than locally. Is there a way around that? With git for example it is possible to push into a non bare repository. Is there a similar option for dvc?
Thanks

Is there a way that I can use the data from the blob storage directly in other azure projects?

The data is there as is, but just formatted in a different convention to allow deduplication and simplify the process for the DVC, so you can’t technically access it is as if it was present in a workspace.

Is there a way around that?

External outputs should help you on this point, this way the data is in your remote as if it were locally and DVC tracks the data on your remote not the data in your local workspace. This way, you should be able to get around this problem. See https://dvc.org/doc/user-guide/managing-external-data for more details.

1 Like

You can also access the data through the Python API: https://dvc.org/doc/api-reference. You can get the URL of the data or read directly into your Python program. Alternatively, you can use the command line to download the data anywhere: https://dvc.org/doc/command-reference/get.

1 Like