Hello,
First of all thank you for your contribution.
I would like to ask if there is a way to keep all my cache of a dataset into a remote minio bucket and not appearing into my local storage.
I have added my dataset into a remote minio bucket
I tried to do pull data on another computer and it works
I would like to understand why ‘dvc pull’ works as intended (downloads everything)
but in the scenacio
I want to pull only selected files, I should first create a directory named ‘data_temp’ to proceed with dvc pull data/file
Why I cannot pull selected directory with dvc pull data/folder1
I’m not 100% sure I understand your usecase, but if you need to pull individual subdirectories, I’d recommend first creating data_temp, then importing subfolders. That will make it possible to pull subfolders:
Thank you for your reply!
Actually this is what I try to achieve, managing-external-data.
My usecase is that I have a large dataset with limited storage, I want to share the dataset without creating unnecessary cache to me or my colleagues. Also, I need to select part of the data (e.g. with a certain tag) or dropping out garbage, this is why I try to pull certain folders from the external storage. I believe the Examples section is what I want to achieve. However, I haven’t managed to do it in minio
I tried import-url before and even though the configuration exist and dvc pull works, dvc import-url does not locate the credentials
ERROR: unexpected error - Unable to locate credentials
I also tried