`dvc push` stage outputs without cache

now_fred · November 6, 2023, 12:50pm

We use dvc for our non-ML models, since it gives a clear dependency management for data. The input data is tracked on AWS S3, the results of all stages should be uploaded back to S3.
However, when using dvc push, it wants to upload GB of data (since our inputs are large) but that looks very expensive and in the end, we only care about the resulting files and not the cache.

Is there any way to avoid uploading GB of cache and have the stage outputs tracked and uploaded?

Topic		Replies	Views
Best practice for handling large data Questions	5	2533	April 16, 2021
Is it Possible to train data in s3 bucket without downloading to local machine with DVC? Questions	11	747	March 22, 2023
DVC - can’t I track directly an S3 remote data? Questions	1	1262	July 12, 2019
Tracking files stored in S3 without adding it into local storage Questions	4	1045	July 5, 2023
`dvc pull --run-cache [target]` Questions	16	2292	July 18, 2020

`dvc push` stage outputs without cache

Related topics