Browse free open source Data Science tools and projects for Linux below. Use the toggles on the left to filter open source Data Science tools by OS, license, language, programming language, and project status.
RStudio is an integrated development environment (IDE) for R
Data science spreadsheet with Python & SQL
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Vector database for scalable similarity search and AI applications
An implementation of the Grammar of Graphics in R
Survival analysis in Python
A framework for real-life data science
The Go kernel for Jupyter notebooks and nteract
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
Solutions and Notes for Labs of Computer Systems
Project structure for doing and sharing data science work
High-Performance Serverless event and data processing platform
Train machine learning models within Docker containers
Scalable and Flexible Gradient Boosting
Streamline your ML workflow
Always know what to expect from your data
Lifetime value in Python
Positron, a next-generation data science IDE
Parallel computing with task scheduling
Library providing end-to-end GPU-accelerated recommender systems
Course materials for the Data Science Specialization on Coursera
A reactive notebook for Python
Automatic extraction of relevant features from time series
A data science IDE for Python