DataHub
The Metadata Platform for your Data and AI Stack
DataHub is an open source metadata platform that helps organizations discover, understand, and trust their data assets at scale. It models data as a richly connected graph spanning datasets, dashboards, pipelines, ML features, and services, so users can explore relationships like lineage and ownership across tools and domains. The platform focuses on continuous metadata ingestion from many sources, treating metadata as a stream that stays fresh as systems change. A modern web UI and search layer make it easy for analysts, engineers, and stewards to find assets, read documentation, and see context such as usage, schema history, and downstream impacts. ...