Great Expectations Airflow operator
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
The open-source tool for building high-quality datasets
A multi-cloud framework for big data analytics
Docker image used to run data processing workloads
The open standard for data logging
High-Performance Symbolic Regression in Python and Julia
Collaborative forensic timeline analysis
An open source multi-tool for exploring and publishing data
An orchestration platform for the development, production
Always know what to expect from your data
AI-data warehouse to enrich, transform and analyze unstructured data
Python Stream Processing
Build, run, and manage data pipelines for integrating data
Train machine learning models within Docker containers
A reactive notebook for Python
Python module that helps you build complex pipelines of batch jobs
WebGL-based viewer for volumetric data
AutoGluon: AutoML for Image, Text, and Tabular Data
Metadata and data identification tool and Python library
Production-ready data processing made easy and shareable
The toolkit to test, validate, and evaluate your models and surface
Open-source data observability for analytics engineers
Detecting silent model failure. NannyML estimates performance
A Python package for interactive mapping and geospatial analysis