Compare the Top Data Science Software that integrates with GitHub as of June 2025

This a list of Data Science software that integrates with GitHub. Use the filters on the left to add additional filters for products that have integrations with GitHub. View the products that work with GitHub in the table below.

What is Data Science Software for GitHub?

Data science software is a collection of tools and platforms designed to facilitate the analysis, interpretation, and visualization of large datasets, helping data scientists derive insights and build predictive models. These tools support various data science processes, including data cleaning, statistical analysis, machine learning, deep learning, and data visualization. Common features of data science software include data manipulation, algorithm libraries, model training environments, and integration with big data solutions. Data science software is widely used across industries like finance, healthcare, marketing, and technology to improve decision-making, optimize processes, and predict trends. Compare and read user reviews of the best Data Science software for GitHub currently available using the table below. This list is updated regularly.

  • 1
    Saturn Cloud

    Saturn Cloud

    Saturn Cloud

    Saturn Cloud is an AI/ML platform available on every cloud. Data teams and engineers can build, scale, and deploy their AI/ML applications with any stack. Quickly spin up environments to test new ideas, then easily deploy them into production. Scale fast—from proof-of-concept to production-ready applications. Customers include NVIDIA, CFA Institute, Snowflake, Flatiron School, Nestle, and more. Get started for free at: saturncloud.io
    Leader badge
    Starting Price: $0.005 per GB per hour
  • 2
    Domo

    Domo

    Domo

    Domo puts data to work for everyone so they can multiply their impact on the business. Our cloud-native data experience platform goes beyond traditional business intelligence and analytics, making data visible and actionable with user-friendly dashboards and apps. Underpinned by a secure data foundation that connects with existing cloud and legacy systems, Domo helps companies optimize critical business processes at scale and in record time to spark the bold curiosity that powers exponential business results.
  • 3
    Gathr.ai

    Gathr.ai

    Gathr.ai

    Gathr is a Data+AI fabric, helping enterprises rapidly deliver production-ready data and AI products. Data+AI fabric enables teams to effortlessly acquire, process, and harness data, leverage AI services to generate intelligence, and build consumer applications— all with unparalleled speed, scale, and confidence. Gathr’s self-service, AI-assisted, and collaborative approach enables data and AI leaders to achieve massive productivity gains by empowering their existing teams to deliver more valuable work in less time. With complete ownership and control over data and AI, flexibility and agility to experiment and innovate on an ongoing basis, and proven reliable performance at real-world scale, Gathr allows them to confidently accelerate POVs to production. Additionally, Gathr supports both cloud and air-gapped deployments, making it the ideal choice for diverse enterprise needs. Gathr, recognized by leading analysts like Gartner and Forrester, is a go-to-partner for Fortune 500
    Leader badge
    Starting Price: $0.25/credit
  • 4
    Jupyter Notebook

    Jupyter Notebook

    Project Jupyter

    The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, data visualization, machine learning, and much more.
  • 5
    Domino Enterprise MLOps Platform
    The Domino platform helps data science teams improve the speed, quality, and impact of data science at scale. Domino is open and flexible, empowering professional data scientists to use their preferred tools and infrastructure. Data science models get into production fast and are kept operating at peak performance with integrated workflows. Domino also delivers the security, governance and compliance that enterprises expect. The Self-Service Infrastructure Portal makes data science teams become more productive with easy access to their preferred tools, scalable compute, and diverse data sets. The Integrated Model Factory includes a workbench, model and app deployment, and integrated monitoring to rapidly experiment, deploy the best models in production, ensure optimal performance, and collaborate across the end-to-end data science lifecycle. The System of Record allows teams to easily find, reuse, reproduce, and build on any data science work to amplify innovation.
  • 6
    Dataiku

    Dataiku

    Dataiku

    Dataiku is an advanced data science and machine learning platform designed to enable teams to build, deploy, and manage AI and analytics projects at scale. It empowers users, from data scientists to business analysts, to collaboratively create data pipelines, develop machine learning models, and prepare data using both visual and coding interfaces. Dataiku supports the entire AI lifecycle, offering tools for data preparation, model training, deployment, and monitoring. The platform also includes integrations for advanced capabilities like generative AI, helping organizations innovate and deploy AI solutions across industries.
  • 7
    Google Colab
    Google Colab is a free, hosted Jupyter Notebook service that provides cloud-based environments for machine learning, data science, and educational purposes. It offers no-setup, easy access to computational resources such as GPUs and TPUs, making it ideal for users working with data-intensive projects. Colab allows users to run Python code in an interactive, notebook-style environment, share and collaborate on projects, and access extensive pre-built resources for efficient experimentation and learning. Colab also now offers a Data Science Agent automating analysis, from understanding the data to delivering insights in a working Colab notebook (Sequences shortened. Results for illustrative purposes. Data Science Agent may make mistakes.)
  • 8
    Deepnote

    Deepnote

    Deepnote

    Deepnote is building the best data science notebook for teams. In the notebook, users can connect their data, explore, and analyze it with real-time collaboration and version control. Users can easily share project links with team collaborators, or with end-users to present polished assets. All of this is done through a powerful, browser-based UI that runs in the cloud. We built Deepnote because data scientists don't work alone. Features: - Sharing notebooks and projects via URL - Inviting others to view, comment and collaborate, with version control - Publishing notebooks with visualizations for presentations - Sharing datasets between projects - Set team permissions to decide who can edit vs view code - Full linux terminal access - Code completion - Automatic python package management - Importing from github - PostgreSQL DB connection
    Starting Price: Free
  • 9
    Coder

    Coder

    Coder

    Coder delivers self-hosted cloud development environments consistently provisioned as code and pre-deployed for developer activity on day one. Preferred by enterprises, Coder is open source and runs air-gapped on-premise or in your cloud, giving developers access to powerful infrastructure without compromising governance. Coder moves local development and source code to your centralized infrastructure where developers can access their remote environments via their favorite desktop or web-based IDE. This is the optimal intersection of improved developer experience, productivity, and security. Coder’s ephemeral development environments are provisioned as code from pre-defined templates, which means developers can create a new workspace with a click. Now, they can avoid days or weeks struggling with local dependency versioning conflicts or security approvals. With Coder, developers can onboard or shift between projects in minutes.
  • 10
    TrueFoundry

    TrueFoundry

    TrueFoundry

    TrueFoundry is a Cloud-native Machine Learning Training and Deployment PaaS on top of Kubernetes that enables Machine learning teams to train and Deploy models at the speed of Big Tech with 100% reliability and scalability - allowing them to save cost and release Models to production faster. We abstract out the Kubernetes for Data Scientists and enable them to operate in a way they are comfortable. It also allows teams to deploy and fine-tune large language models seamlessly with full security and cost optimization. TrueFoundry is open-ended, API Driven and integrates with the internal systems, deploys on a company's internal infrastructure and ensures complete Data Privacy and DevSecOps practices.
    Starting Price: $5 per month
  • 11
    MLJAR Studio
    It's a desktop app with Jupyter Notebook and Python built in, installed with just one click. It includes interactive code snippets and an AI assistant to make coding faster and easier, perfect for data science projects. We manually hand crafted over 100 interactive code recipes that you can use in your Data Science projects. Code recipes detect packages available in the current environment. Install needed modules with 1-click, literally. You can create and interact with all variables available in your Python session. Interactive recipes speed-up your work. AI Assistant has access to your current Python session, variables and modules. Broad context makes it smart. Our AI Assistant was designed to solve data problems with Python programming language. It can help you with plots, data loading, data wrangling, Machine Learning and more. Use AI to quickly solve issues with code, just click Fix button. The AI assistant will analyze the error and propose the solution.
    Starting Price: $20 per month
  • 12
    SAS Viya
    SAS® Viya® data science offerings provide a comprehensive, scalable analytics environment that's quick and easy to deploy, enabling you to meet diverse business needs. Automatically generated insights enable you to identify the most common variables across all models, the most important variables selected across models and assessment results for all models. Natural language generation capabilities are used to create project summaries written in plain language, enabling you to easily interpret reports. Analytics team members can add project notes to the insights report to facilitate communication and collaboration among team members. SAS lets you embed open source code within an analysis and call open source algorithms seamlessly within its environment. This facilitates collaboration across your organization because users can program in their language of choice. You can also take advantage of SAS Deep Learning with Python (DLPy), our open-source package on GitHub.
  • 13
    Hex

    Hex

    Hex

    Hex brings together the best of notebooks, BI, and docs into a seamless, collaborative UI. Hex is a modern Data Workspace. It makes it easy to connect to data, analyze it in collaborative SQL and Python-powered notebooks, and share work as interactive data apps and stories. Your default landing page in Hex is the Projects page. You can quickly find projects you created, as well as those shared with you and your workspace. The outline provides an easy-to-browse overview of all the cells in a project's Logic View. Every cell in the outline lists the variables it defines, and cells that return a displayed output (chart cells, Input Parameters, markdown cells, etc.) display a preview of that output. You can click any cell in the outline to automatically jump to that position in the logic.
    Starting Price: $24 per user per month
  • 14
    Zerve AI

    Zerve AI

    Zerve AI

    Merging the best of a notebook and an IDE into one integrated coding environment, experts can explore their data and write stable code at the same time with fully automated cloud infrastructure. Zerve’s data science development environment gives data science and ML teams a unified space to explore, collaborate, build, and deploy data science & AI projects like never before. Zerve offers true language interoperability, meaning that as well as being able to use Python, R, SQL, or Markdown all in the same canvas, users can connect these code blocks to each other. No more long-running code blocks or containers, with Zerve enjoying unlimited parallelization at any stage of the development journey. Analysis artifacts are automatically serialized, versioned, stored, and preserved for later use, meaning easily changing a step in the data flow without needing to rerun any preceding steps. Fine-grained selection of compute resources and extra memory for complex data transformation.
  • 15
    Vectice

    Vectice

    Vectice

    Enabling all enterprise’s AI/ML initiatives to result in consistent and positive impact. Data scientists deserve a solution that makes all their experiments reproducible, every asset discoverable and simplifies knowledge transfer. Managers deserve a dedicated data science solution. to secure knowledge, automate reporting and simplify reviews and processes. Vectice is on a mission to revolutionize the way data science teams work and collaborate. The goal is to ensure consistent and positive AI/ML impact for all organizations. Vectice is bringing the first automated knowledge solution that is both data science aware, actionable and compatible with the tools data scientists use. Vectice auto-captures all the assets that AI/ML teams create such as datasets, code, notebooks, models or runs. Then it auto-generates documentation from business requirements to production deployments.
  • 16
    Zepl

    Zepl

    Zepl

    Sync, search and manage all the work across your data science team. Zepl’s powerful search lets you discover and reuse models and code. Use Zepl’s enterprise collaboration platform to query data from Snowflake, Athena or Redshift and build your models in Python. Use pivoting and dynamic forms for enhanced interactions with your data using heatmap, radar, and Sankey charts. Zepl creates a new container every time you run your notebook, providing you with the same image each time you run your models. Invite team members to join a shared space and work together in real time or simply leave their comments on a notebook. Use fine-grained access controls to share your work. Allow others have read, edit, and run access as well as enable collaboration and distribution. All notebooks are auto-saved and versioned. You can name, manage and roll back all versions through an easy-to-use interface, and export seamlessly into Github.
  • Previous
  • You're on page 1
  • Next