let’s say I want to use dvc with azure blob storage. Is there a way that I can use the data from the blob storage directly in other azure projects? Normally a blob storage just gets mounted into another resource. With dvc this seems a problem, because files on the remote are stored differently than locally. Is there a way around that? With git for example it is possible to push into a non bare repository. Is there a similar option for dvc?
Is there a way that I can use the data from the blob storage directly in other azure projects?
The data is there as is, but just formatted in a different convention to allow deduplication and simplify the process for the DVC, so you can’t technically access it is as if it was present in a workspace.
Is there a way around that?
External outputs should help you on this point, this way the data is in your remote as if it were locally and DVC tracks the data on your remote not the data in your local workspace. This way, you should be able to get around this problem. See https://dvc.org/doc/user-guide/managing-external-data for more details.