Project structure for doing and sharing data science work
Training data (data labeling, annotation, workflow) for all data types
Train machine learning models within Docker containers
Open-source data observability for analytics engineers
Automatically find issues in image datasets
A cross-platform installer for the Julia programming language
The open-source tool for building high-quality datasets
Streamline your ML workflow
The open standard for data logging
A multi-cloud framework for big data analytics
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Parallel computing with task scheduling
Python ETL framework for stream processing, real-time analytics, LLM
Clone with Python! Data structures for double stranded DNA
Spatial data processing for geomodeling
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Recap tracks and transform schemas across your whole application
Make your own running home page
A more accurate representation of jupyter notebooks
Light-weight, flexible, expressive statistical data testing library
An open source multi-tool for exploring and publishing data
Mie scattering of light by perfect spheres
Metadata and data identification tool and Python library
A tool for semi-automatic cell type classification, harmonization
A Python package for interactive mapping and geospatial analysis