This example in the docs seems a bit odd, and a basic question

JohnAtl · March 25, 2023, 11:39pm

I’m a bit of a noob, and I’m trying to understand how dvc install changes the steps introduced in Dr. O’Brien’s introductory videos.

Question 1

The final example on this page of the docs shows a source file stored in a dvc repository, thus when changed, dictates running repro to create the data, then everything is somehow up to date.

I would think that:

source would be saved to git
running modified source with repro would result in a changed dataset that would need to be added, committed, and pushed.

Question 2

When I’m working on my project and I run git commit, dvc status is run and (let’s assume) shows me that some of my dvc-managed files have changed. Am I correct in these commands (from memory):

# add alll modified files in data folder
dvc add data
dvc push
# make sure files changed by dvc makes sense
git status 
git add -a -m “dvc modified”
git push

This seems pretty convoluted, so if there’s a better way, would love to know. If not, not looking a gift horse in the mouth

Question 3

Okay, a bonus - what about merging, say, a branch into main? Say, the signatures in the *.dvc files don’t match. Is it just a matter of always selecting the branch data over main in the conflicted .dvc files (unless something has gone really wrong)?

Thanks!

shcheklein · March 30, 2023, 1:49am

Cont of the discussion is here: This example in the docs seems a bit odd, and a basic question · iterative/dvc · Discussion #9263 · GitHub

Topic		Replies	Views
Cannot git commit data changes? Questions	3	14	June 9, 2025
"dvc status" confusion in concurrent use case Questions	3	441	August 7, 2022
First steps with DVC, a few questions Questions	2	68	September 20, 2024
What DVC does when git merge is executed? Questions	5	469	September 1, 2022
Tracking data and code dependencies Questions	4	2133	May 18, 2018

This example in the docs seems a bit odd, and a basic question

Question 1

Question 2

Question 3

Related topics