Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Recap tracks and transform schemas across your whole application
Make your own running home page
TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
Training data (data labeling, annotation, workflow) for all data types
airda(Air Data Agent
Data science on data without acquiring a copy
A more accurate representation of jupyter notebooks
An open source multi-tool for exploring and publishing data
ETL framework to index data for AI, such as RAG
Exploratory analysis of Bayesian models with Julia
Production-ready data processing made easy and shareable
The toolkit to test, validate, and evaluate your models and surface
Open-source data observability for analytics engineers
Library providing end-to-end GPU-accelerated recommender systems
A package for Counterfactual Explanations and Algorithmic Recourse
Beta Machine Learning Toolkit
The open standard for data logging
Uncover insights, surface problems, monitor, and fine tune your LLM
Scalable and Flexible Gradient Boosting
Scale your Pandas workflows by changing a single line of code
Synthetic data generators for structured and unstructured text
Efficiently diff rows across two different databases
Project structure for doing and sharing data science work
Automatic extraction of relevant features from time series