DVC (Data Version Control) is an open source tool designed to bring version control principles to machine learning and data science workflows. It enables developers and data scientists to track datasets, machine learning models, and experiment results in a way that integrates with existing Git repositories. Instead of storing large datasets directly in Git, DVC keeps lightweight metadata in the repository while storing the actual data in external storage systems. This approach allows teams to manage large files efficiently while maintaining a clear history of changes to data and models. DVC also provides a pipeline system that defines the stages of machine learning workflows, making experiments reproducible and easier to manage. By tracking dependencies between code, data, and parameters, the system ensures that only the necessary stages are re-run when changes occur. DVC also includes experiment tracking capabilities that allow users to compare different training runs.

Features

  • Version control for datasets, machine learning models, and experiment artifacts
  • Git-compatible workflow that tracks data changes through metadata files
  • Pipeline system for defining reproducible machine learning workflows
  • Experiment tracking with metrics comparison and visualization
  • Support for external storage backends for large datasets
  • Ability to reproduce and share experiments across environments

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Data Version Control

Data Version Control Web Site

Other Useful Business Software
Earn up to 16% annual interest with Nexo. Icon
Earn up to 16% annual interest with Nexo.

Access competitive interest rates on your digital assets.

Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
Get started with Nexo.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Data Version Control!

Additional Project Details

Programming Language

Python

Related Categories

Python Artificial Intelligence Software

Registered

2026-03-12