Browse free open source Data Science tools and projects for Linux below. Use the toggles on the left to filter open source Data Science tools by OS, license, language, programming language, and project status.
An implementation of the Grammar of Graphics in R
RStudio is an integrated development environment (IDE) for R
Best practices on recommendation systems
A data science IDE for Python
Data science spreadsheet with Python & SQL
Scalable and Flexible Gradient Boosting
Always know what to expect from your data
An AI-powered data science team of agents
Course materials for the Data Science Specialization on Coursera
Vector database for scalable similarity search and AI applications
A reactive notebook for Python
A framework for real-life data science
The data science OS
Simple and distributed Machine Learning
Linux for content creation, web scraping, coding, and data analysis.
Graphical User Interface Toolkit for Python with minimal dependencies
Adhoc Data Exploration - Live & Easy
SADSA (Software Application for Data Science and Analytics)
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
For building machine learning (ML) workflows and pipelines on AWS
Jupyter notebooks that demonstrate how to build models using SageMaker
A curated list of data mining papers about fraud detection
Streamline your ML workflow
Project structure for doing and sharing data science work