Data Contract CLI is an open-source command-line tool and Python library for creating, validating, testing, importing, exporting, and enforcing data contracts. It uses YAML-based contract files to define the structure, meaning, quality rules, service levels, and connection details for a data product. The tool can connect to real data sources and check whether the actual dataset matches the schema, constraints, and quality expectations described in the contract. It supports both the Data Contract Specification and the Open Data Contract Standard, making it useful for teams standardizing data governance across different platforms. It can run locally, inside CI/CD pipelines, through Docker, or directly from Python code. Overall, it helps data producers and consumers treat data products more like APIs, with explicit expectations, automated checks, and clearer accountability.
Features
- Data contract linting and validation
- Schema and quality testing
- Breaking change detection
- Import and export across formats
- CI/CD and Docker workflow support
- Python library and CLI usage