TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
The standard data-centric AI package for data quality and ML
Create HTML profiling reports from pandas DataFrame objects
airda(Air Data Agent
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
A more accurate representation of jupyter notebooks
The open-source tool for building high-quality datasets
ETL framework to index data for AI, such as RAG
Python implementation of global optimization with gaussian processes
Training data (data labeling, annotation, workflow) for all data types
Scalable and Flexible Gradient Boosting
Production-ready data processing made easy and shareable
Beautiful and flexible vizualizations of high dimensional data
The toolkit to test, validate, and evaluate your models and surface
Open-source data observability for analytics engineers
Library providing end-to-end GPU-accelerated recommender systems
Performance Software for Cyclists, Runners, Triathletes and Coaches
Beta Machine Learning Toolkit
The open standard for data logging
Scale your Pandas workflows by changing a single line of code
Synthetic data generators for structured and unstructured text
Efficiently diff rows across two different databases
Project structure for doing and sharing data science work
Automatic extraction of relevant features from time series