Spatial data processing for geomodeling
Monitor the stability of a Pandas or Spark dataframe
Pythonic tool for running machine-learning/high performance workflows
Dataset Management Framework, a Python library and a CLI tool to build
Visualize and compare datasets, target values and associations
Main repository for Vispy
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Integrate multiple high-dimensional datasets with fuzzy k-means
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
Materials and IPython notebooks for "Python for Data Analysis"
A tool for semi-automatic cell type classification, harmonization
Positron, a next-generation data science IDE
Training data (data labeling, annotation, workflow) for all data types
Detection tools for the June 2026 atomic-lockfile AUR supply-chain
AI-data warehouse to enrich, transform and analyze unstructured data
The power of Chart.js with Python
Kubeflow’s superfood for Data Scientists
Repository for the Astropy core package
Benchmarking synthetic data generation methods
Python implementation of global optimization with gaussian processes
The open standard for data logging
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Burp Suite extension for JavaScript static analysis
An AI-powered data science team of agents
Clean Jupyter notebooks of outputs, metadata, and empty cells