Browse free open source Data Science tools and projects for Linux below. Use the toggles on the left to filter open source Data Science tools by OS, license, language, programming language, and project status.
RStudio is an integrated development environment (IDE) for R
An implementation of the Grammar of Graphics in R
Data science spreadsheet with Python & SQL
Scalable and Flexible Gradient Boosting
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Positron, a next-generation data science IDE
Solutions and Notes for Labs of Computer Systems
Vector database for scalable similarity search and AI applications
A reactive notebook for Python
A data science IDE for Python
Parallel computing with task scheduling
Course materials for the Data Science Specialization on Coursera
Graphical User Interface Toolkit for Python with minimal dependencies
Always know what to expect from your data
A fast CSV command line toolkit written in Rust
The data science OS
Linux for content creation, web scraping, coding, and data analysis.
SADSA (Software Application for Data Science and Analytics)
MCPower — simple Monte Carlo power analysis for complex models
An AI-powered data science team of agents
For building machine learning (ML) workflows and pipelines on AWS
Adhoc Data Exploration - Live & Easy
Jupyter notebooks that demonstrate how to build models using SageMaker
A curated list of data mining papers about fraud detection