It is obvious that the structure of the docs needs some revising, but it is not so clear what the best structure would be. I would try these things:
-
Create a top level section for “Contributor Guide” or “Developer Guide” and move there the sections:
- /user-guide/development
- /user-guide/contributing
- /user-guide/contributing-documentation
However the section “How to report a problem” should still be on the User Guide.
Later, more relevant sections may be added to the Developer Guide, for example to explain the structure/architecture of the code, maybe to explain design design decisions and the reasons behind them (pointing to the relevant discussion threads, etc.)
I am not sure yet whether DVC is extensible (whether one can develop plugins or extensions for it), but if yes, then this would be the right place to explain how this can be done. -
Move “Use Cases” inside the “User Guide”. I think that these “use cases” are actually useful for helping or guiding the user on how to use properly DVC, so they should a part of the User Guide. If the term “use case” sounds a bit technical (and it is actually a bit overloaded with other meanings), they may be called “Configuration Examples” or “Usage Examples”. But the term “Use Cases” is fine too.
I think that use cases are very helpful for the user because they reduce the amount of the possibilities and options that he has to consider in order to setup his workspace properly. So, we should consider adding a few more use cases which hopefully should cover all the possible ways that DVC can be setup and used. It is OK if there is some small overlap among the use cases (for example a particular situation can be covered by more than one use case).
If we have a comprehensive set of use cases, then for each query by the users about how to best setup DVC, we should be able to refer him to a use case that matches his situation. If we have to explain him the details of what he should do, then we should consider revising our use cases or adding a new one.
-
Create a separate section for the tutorials and move there the examples at the of “Get Started”. Move inside it the “Tutorial” a well. Actually there is some overlap among the tutorials and some parts of them need to be updated (because the code does not work as expected). So, they need to be cleaned up and updated. Other tutorials may be added by adopting blog posts about DVC or adapting blog posts about DS/ML in general.
Tutorials describe the solution of a particular problem (from the point of view of how to use DVC). They are practical (hands-on) and if the user follows the instructions correctly, everything should work. This is a bit different from use cases and user guides, which may have detailed explanations, but not actionable examples (they may have example commands, but you cannot copy-paste them and expect them to work).
User Guide and the Command Reference should not try to include actionable examples inside their pages. Rather they should refer to one or more tutorials that make use of that command or that uses the concepts that are being explained. This may also be an indicator of how good the set of tutorials is. If all the command pages and user guide pages can refer to some tutorial that uses the command or the concept, then everything is fine. If there is some command or concept that is not cover by the existing tutorials, then we should consider extending some of the tutorials or adding some new ones.
-
“Installation” should be a top level and separate section, placed before the “Tutorials”.
By the way, “Tutorials” should be placed before the “Use Guide”, “Developer Guide” and “Command Reference”.
Also, “Understanding DVC” should be better renamed to “About DVC” (and it is already connected to the front page, I think).
-
The “Get Started” part should be simplified further from some detailed/advanced explanations and from links/references to more advanced sections. Actually, it can also be replaced by a set of Katacoda interactive tutorials, which should be very basic, hands-on and learn-by-doing style (https://github.com/iterative/dvc.org/issues/546).
Their aim should be just to give a feeling to a new user about what DVC is and what it does, about its main features etc. without going to much details. If the user gets interested/excited, the next step would be to install DVC and to start with the tutorials.
-
The sections inside the “User Guide” need to be restructured, rearranged or expanded. For example see these discussions:
- https://github.com/iterative/dvc.org/issues/563#issuecomment-522171263
- https://github.com/iterative/dvc.org/issues/566
- https://github.com/iterative/dvc.org/issues/53
- https://github.com/iterative/dvc.org/issues/144
- https://github.com/iterative/dvc.org/issues/143
There seems to be a lot of work that can be done here, but one thing that seems obvious to me right now is that maybe the sections “external-dependencies” and “external-outputs” should be merged into the same section (called for example “external-data”) since they are closely related. This may help to make the explanations a bit more consistent. This page may also list some of the different scenarios/use-cases where external data is used, and link to the relevant use-case pages that has mode detailed explanations for each use-case.
One thing that I would like to note is that the Waterfall process does not seem to work well with the docs too (same as with software development and DS/ML development). So, we should not aim for a perfect plan and execution, but let’s start by doing and experimenting, and then correcting and fixing and reverting (if necessary), until the docs seem to be in a good shape.
During my GSoD period I may not be able to finish all of these, but I may try to do as much as possible. I would like to try them in this order:
- Build a set of basic interactive tutorials with Katacoda.
- Separate the installation page and simplify the Get Started tutorial.
- Fix the general structure of the docs (merge Use Cases with the User Guide, separate Contributor Guide from User Guide, make a section for the Tutorials, etc.)
- Work on cleaning up and improving the “external-data” of the User Guide and its related use-cases.
- Work on deduplicating and updating the existing tutorials.
- Add more use-cases and tutorials.
- Update the pages of the commands to refer to the appropriate tutorials (or interactive tutorials in Katacoda).
8 . Try to describe and integrate “DVC packages” on the docs.
Please let me know if you have any comments or suggestions.