Browse free open source Data Science tools and projects for Linux below. Use the toggles on the left to filter open source Data Science tools by OS, license, language, programming language, and project status.
An implementation of the Grammar of Graphics in R
RStudio is an integrated development environment (IDE) for R
Best practices on recommendation systems
An AI-powered data science team of agents
Data science spreadsheet with Python & SQL
Always know what to expect from your data
A data science IDE for Python
Course materials for the Data Science Specialization on Coursera
Scalable and Flexible Gradient Boosting
A reactive notebook for Python
Vector database for scalable similarity search and AI applications
The data science OS
A framework for real-life data science
Graphical User Interface Toolkit for Python with minimal dependencies
High-Performance Serverless event and data processing platform
Positron, a next-generation data science IDE
Simple and distributed Machine Learning
Linux for content creation, web scraping, coding, and data analysis.
Adhoc Data Exploration - Live & Easy
SADSA (Software Application for Data Science and Analytics)
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
For building machine learning (ML) workflows and pipelines on AWS
Jupyter notebooks that demonstrate how to build models using SageMaker
A curated list of data mining papers about fraud detection