Data integration platform for ELT pipelines from APIs, databases
Repository for the Astropy core package
CKAN is an open-source DMS for powering data hubs
Efficiently diff rows across two different databases
Project structure for doing and sharing data science work
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
An orchestration platform for the development, production
Python module that helps you build complex pipelines of batch jobs
High-Performance Symbolic Regression in Python and Julia
The toolkit to test, validate, and evaluate your models and surface
Making DAG construction easier
Training data (data labeling, annotation, workflow) for all data types
Streaming reactive and dataflow graphs in Python
re_data - fix data issues before your users & CEO would discover them
Data science on data without acquiring a copy
The power of Chart.js with Python
Open-source GCP metadata collector based on ODD Specification
Open-source metadata collector based on ODD Specification
AutoGluon: AutoML for Image, Text, and Tabular Data
Deep neural networks for density functional theory Hamiltonian
Create HTML profiling reports from pandas DataFrame objects
Recap tracks and transform schemas across your whole application
A cross-platform installer for the Julia programming language
The standard data-centric AI package for data quality and ML
Tool for visualizing and tracking your machine learning experiments