Showing 729 open source projects for "data science"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Save Up to 91% on Cloud Compute With Spot VMs Icon
    Save Up to 91% on Cloud Compute With Spot VMs

    Automatic sustained-use discounts. One free VM per month. No negotiation needed.

    Run batch jobs at 60-91% off with Spot VMs. Long-running workloads get automatic discounts with sustained use.
    Try Free
  • 1
    JSONVisor

    JSONVisor

    Visor for mass spectrometry JSON files generated by Integrator

    Visor for mass spectrometry JSON files generated by the Integrator program (Python 3 version). For more information, you can have a look at the README.md file in the source code tree: https://sourceforge.net/p/lp-csic-uab/jsonvisor3/code/ci/default/tree/README.md - Gallardo, Ó., Ovelleiro, D., Gay, M., Carrascal, M., & Abian, J. (2014). A collection of open source applications for mass spectrometry data mining. PROTEOMICS, 14(20), 2275–2279. https://doi.org/10.1002/pmic.201400124
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ML workspace

    ML workspace

    All-in-one web-based IDE specialized for machine learning

    All-in-one web-based development environment for machine learning. The ML workspace is an all-in-one web-based IDE specialized for machine learning and data science. It is simple to deploy and gets you started within minutes to productively built ML solutions on your own machines. This workspace is the ultimate tool for developers preloaded with a variety of popular data science libraries (e.g., Tensorflow, PyTorch, Keras, Sklearn) and dev tools (e.g., Jupyter, VS Code, Tensorboard) perfectly configured, optimized, and integrated. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Data Science Notes

    Data Science Notes

    Curated collection of data science learning materials

    Data Science Notes is a large, curated collection of data science learning materials, with explanations, code snippets, and structured notes across the typical end-to-end workflow. It spans foundational math and statistics through data wrangling, visualization, machine learning, and practical project organization. The content emphasizes hands-on understanding by pairing narrative notes with runnable examples, making it useful for both self-study and classroom settings. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    pydatascope

    Software oscilloscope using Python and tkinter

    Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    earthengine-py-notebooks

    earthengine-py-notebooks

    A collection of 360+ Jupyter Python notebook examples

    ...Users can quickly adapt the examples for their own remote sensing, environmental monitoring, or spatial data science projects, and can run the code in environments like Google Colab.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    PS-Drone

    PS-Drone

    Programming a Parrot AR.Drone 2.0 with Python - The Easy Way

    The PS-Drone-API is a full featured SDK, written in and for Python, for Parrot's AR.Drone 2.0. It was designed to be easy to learn, but it offers the full set of the possibilities of the AR.Drone 2.0, including Sensor-Data (aka NavData), Configuration and full Video-support. The video function is not restricted to mere viewing, it is also possible to analyze video images data using OpenCV2. Obviously, the PS-Drone is perfect for teaching purposes; however, even the requirements for...
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Software, information, data sets and documentation for the Web as Corpus community.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    ccplot

    CloudSat and CALIPSO plotting tool

    ccplot is an open source command-line program for plotting profile, layer and earth view data sets from CloudSat, CALIPSO and Aqua MODIS products.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Tools for analysis of multiwavelength anomalous diffraction data collected on synchrotron beamlines, more specifically developed for grazing incidence diffraction on nanostructures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • 10

    OpenVCT

    The Open-Source Virtual Clinical Trial Project

    The OpenVCT project is designed to provide a common platform for performing Virtual Clinical Trials of medical imaging. OpenVCT provides tools for simulation of patient accrual and reader studies of medical imaging devices, simulating by simulating cohorts of patients and readers. OpenVCT uses common data standards, such as DICOM, to ensure inter-compatibility.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    --- IMPORTANT : This project has been moved to GitHub at https://github.com/clstoulouse/motu-client-python. Download the last version from the release page https://github.com/clstoulouse/motu-client-python/releases. --- Motu is a high efficient and robust Web Server which fills the gap between heterogeneous Data Providers to End Users. Motu handles, extracts and transforms oceanographic huge volumes of data without performance collapse. This client enables to extract and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    FusionCatcher

    Somatic fusion-genes finder for RNA-seq data

    FusionCatcher searches for novel/known somatic fusion genes, translocations, and chimeras in RNA-seq data (paired-end reads from Illumina NGS platforms like Solexa and HiSeq) from diseased samples. The aims of FusionCatcher are: - very good detection rate for finding candidate fusion genes, - very easy to use (i.e. no a priori knowledge of databases and bioinformatics is needed in order to run FusionCatcher), - very good detection of challenging fusion genes, like for example IGH...
    Leader badge
    Downloads: 79 This Week
    Last Update:
    See Project
  • 13
    PyArmadillo

    PyArmadillo

    linear algebra library for Python

    PyArmadillo - streamlined linear algebra library for Python, with emphasis on ease of use. Alternative to NumPy / SciPy. * Main page: https://pyarma.sourceforge.io * Documentation: https://pyarma.sourceforge.io/docs.html * Bug reports: https://pyarma.sourceforge.io/faq.html * Git repo: https://gitlab.com/jason-rumengan/pyarma
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Gato (Graph Animation Toolbox): Animate graph algorithms for example for computing shortest paths, minimal spanning trees, maximum flows or maximal cardinality or weight matchings. Create your own animations using the Animated Data Structures (ADS).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    OpenDAFF

    OpenDAFF

    Directional Audio File Format

    OpenDAFF is a free, open-source software package for directional audio data - like the directivity of microphones, speakers, as well as head-related transfer functions (HRTFs)
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    OpenFrames

    OpenFrames

    Real-time interactive 3D graphics API for scientific simulations

    OpenFrames has moved its primary development repository to GitHub! Everything else will follow. Get it at https://github.com/ravidavi/OpenFrames/wiki OpenFrames is an Application Programming Interface (API) that allows developers to provides the ability to add interactive 3D graphics to any scientific simulation. A simulation developer can use OpenFrames to specify what they want to visualize, without having to know any details of computer graphics programming. OpenFrames is currently...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    bioweb

    bioweb

    polyglot language framework to analyze genetic data

    polyglot framework using Python/C++/JavaScript to fast develop applications to analyze biological sequences
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    dsam
    The Development System for Auditory Modelling (DSAM) is a computational library designed specifically for producing simulations of the auditory system. It brings together many established auditory models within a flexible programming platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Tashkeela processed

    Tashkeela processed

    Tashkeela dataset cleaned and normalized.

    A version of the Tashkeela Arabic diacritized text dataset cleaned from the non-Arabic content and the undiacritized text, then divided into training, development, and testing sets. The cleaning process includes removing the XML tags and strange symbols, as well as fixing diacritics errors. After that, the tokenization is performed while focusing on the extraction of the Arabic words. The result is a space-separated tokens file, where the words and the numbers are separated, but not the...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    D0rganizer

    D0rganizer

    Program to manage data and information

    Store collected information and data properly using D0rgnizer (The organizer). Create cases, folders, files, and insert images. Compare information between files, and export data to share with your team. Useful for data collection in investigations, ethical hacking, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    GPlates

    GPlates

    Interactive visualization of plate tectonics.

    GPlates is a plate-tectonics program. Manipulate reconstructions of geological and paleo-geographic features through geological time. Interactively visualize vector, raster and volume data. PyGPlates is the GPlates Python library. Get fine-grained access to GPlates functionality in your Python scripts.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 22
    Pretty Damn Quick (PDQ) analytically solves queueing network models of computer and manufacturing systems, data networks, etc., written in conventional programming languages. Generic or customized reports of predicted performance measures are output.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    Forecasting Best Practices

    Forecasting Best Practices

    Time Series Forecasting Best Practices & Examples

    Time series forecasting is one of the most important topics in data science. Almost every business needs to predict the future in order to make better decisions and allocate resources more effectively. This repository provides examples and best practice guidelines for building forecasting solutions. The goal of this repository is to build a comprehensive set of tools and examples that leverage recent advances in forecasting algorithms to build solutions and operationalize them. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    ebfformat

    ebfformat

    An Efficient Binary data Format

    EBF, which stands for Efficient Binary Format, is a binary file format for reading and writing binary data easily. Reading writing routines are currently available in C,C++,Fortran,Java, Python, IDL, MATLAB. A program called ebftkpy which has a set of utility functions to work with the .ebf files , e.g., viewing the contents and getting a summary, is also provided. The EBF specification is designed to be concise and easy to understand to make it easier for others to write their own code...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    LIFETIMES

    LIFETIMES

    Lifetime value in Python

    LIFETIMES is a Python library for customer lifetime value and repeat purchase behavior modeling. It helps analysts estimate how frequently customers may return, how long they may remain active, and how much value they may generate over time. The library is built around probabilistic models commonly used in customer analytics, including transaction frequency and monetary value modeling. It is useful for ecommerce, subscription-adjacent businesses, retail analytics, and retention analysis. The...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo