What's causing my pre-commit hook to write so much data?

Still going in the same pattern after 36 hours, so I just ctrl-c’d it and nothing unusual seems to have happened - all of the writes seem to have been in the cache directory, and the data had already been pushed to the remote, so worst case that can be recreated. I disabled the hook and committed all the .dvc files to git, so at least now there’s a known snapshot of the data in the dvc remote and of the hashes in git.

I’ve started a separate thread asking about the underlying goal I was trying to achieve with this setup - would be interested if you had any thoughts on that, @shcheklein?