Data science on data without acquiring a copy
Fast, flexible and powerful Python data analysis toolkit
CKAN is an open-source DMS for powering data hubs
Machine learning in Python
Parallel computing with task scheduling
Python ETL framework for stream processing, real-time analytics, LLM
Uncover insights, surface problems, monitor, and fine tune your LLM
Dataset Management Framework, a Python library and a CLI tool to build
High-Performance Symbolic Regression in Python and Julia
The open-source tool for building high-quality datasets
Python module that helps you build complex pipelines of batch jobs
A reactive notebook for Python
WebGL-based viewer for volumetric data
A Python package for interactive geospaital analysis and visualization
Repository for the Astropy core package
Positron, a next-generation data science IDE
A Python toolbox for gaining geometric insights
An open source multi-tool for exploring and publishing data
3D plotting and mesh analysis through a streamlined interface
Orange: Interactive data analysis
Library providing end-to-end GPU-accelerated recommender systems
Training data (data labeling, annotation, workflow) for all data types
A Python package for interactive mapping and geospatial analysis
Docker image used to run data processing workloads
Detection tools for the June 2026 atomic-lockfile AUR supply-chain