Synthetic data generators for structured and unstructured text
The open-source tool for building high-quality datasets
Efficiently diff rows across two different databases
Project structure for doing and sharing data science work
Automatic extraction of relevant features from time series
Performance Software for Cyclists, Runners, Triathletes and Coaches
The standard data-centric AI package for data quality and ML
Create HTML profiling reports from pandas DataFrame objects
Train machine learning models within Docker containers
Exploratory analysis of Bayesian models with Julia
Privacy and Security focused Segment-alternative, in Golang
RStudio is an integrated development environment (IDE) for R
Scalable and Flexible Gradient Boosting
Code review for data in dbt
Benchmarking synthetic data generation methods
Optimal transport algorithms for Julia
A data visualization and analytics component
Julia interface to Sundials, including a nonlinear solver
OpenCL Julia bindings
A distributed and extensible workflow scheduler platform
A package for Counterfactual Explanations and Algorithmic Recourse
An optimized graphs package for the Julia programming language
Algorithms from circuit theory to predict connectivity
High accuracy derivatives, estimated via numerical finite differences
In-memory tabular data in Julia