Data integration platform for ELT pipelines from APIs, databases
Training data (data labeling, annotation, workflow) for all data types
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
A cross-platform installer for the Julia programming language
Main repository for Vispy
Pythonic tool for running machine-learning/high performance workflows
A curated list of data mining papers about fraud detection
Clone with Python! Data structures for double stranded DNA
Make your own running home page
A more accurate representation of jupyter notebooks
Integrate multiple high-dimensional datasets with fuzzy k-means
A tool for semi-automatic cell type classification, harmonization
Survival analysis in Python
Concurrent Python made simple
A multi-cloud framework for big data analytics
Progress bars for threading and multiprocessing tasks on terminal
The power of Chart.js with Python
Kubeflow’s superfood for Data Scientists
Great Expectations Airflow operator
Create HTML profiling reports from pandas DataFrame objects
Diagram generation for understanding codebases and system architecture
An interactive Formula 1 race visualisation and data analysis tool
Python ETL framework for stream processing, real-time analytics, LLM
Python Stream Processing
Spatial data processing for geomodeling