Showing 75 open source projects for "data science"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    Kedro

    Kedro

    A Python framework for creating reproducible, maintainable code

    Kedro is an open sourced Python framework for creating maintainable and modular data science code. Provides the scaffolding to build more complex data and machine-learning pipelines. In addition, there's a focus on spending less time on the tedious "plumbing" required to maintain data science code; this means that you have more time to solve new problems. Standardises team workflows; the modular structure of Kedro facilitates a higher level of collaboration when teams solve problems together. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Nuclio

    Nuclio

    High-Performance Serverless event and data processing platform

    Nuclio is an open source and managed serverless platform used to minimize development and maintenance overhead and automate the deployment of data-science-based applications. Real-time performance running up to 400,000 function invocations per second. Portable across low laptops, edge, on-prem and multi-cloud deployments. The first serverless platform supporting GPUs for optimized utilization and sharing. Automated deployment to production in a few clicks from Jupyter notebook. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Apache Spark

    Apache Spark

    A unified analytics engine for large-scale data processing

    ...With Spark Streaming (microbatches) and Structured Streaming, it delivers low-latency event processing suitable for real-time analytics. The built-in MLlib library provides scalable machine learning algorithms, while GraphX enables graph computations integrated with data pipelines. Spark supports multiple languages—Scala, Java, Python, R—and connects with many storage systems like HDFS, S3, Cassandra, and streaming platforms like Kafka, making it a versatile choice for big data workloads in analytics, ETL, and data science.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    GeoStats.jl

    GeoStats.jl

    An extensible framework for geospatial data science

    GeoStats.jl is a Julia framework for geospatial data science and geostatistical modeling. It’s fully implemented in Julia and designed to provide an extensible, high-performance stack that handles spatial domains, interpolation, simulation, learning, and visualization. The package is modular: it breaks out geometry, spatial domains, transforms, variograms, covariance models, and modeling into subpackages (e.g., GeoStatsBase, GeoStatsModels, GeoStatsTransforms).
    Downloads: 3 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 5
    gdbgui

    gdbgui

    Browser-based frontend to gdb (gnu debugger)

    Browser-based frontend to gdb (gnu debugger). Add breakpoints, view the stack, visualize data structures, and more in C, C++, Go, Rust, and Fortran. Run gdbgui from the terminal and a new tab will open in your browser. gdbgui is a browser-based frontend to gdb, the gnu debugger. You can add breakpoints, view stack traces, and more in C, C++, Go, and Rust! It's perfect for beginners and experts. Simply run gdbgui from the terminal to start the gdbgui server, and a new tab will open in your browser. gdbgui is used by thousands of developers around the world including engineers at Google and college computer science course instructions. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    Biosphere3D

    Biosphere3D

    Interactive landscape rendering based on a virtual globe.

    Biosphere3D targets interactive landscape rendering based on a virtual globe. It supports DEM, satellite and aerial images, 3D models (Collada), 3D plant models, and Shapefiles. Biosphere3D was initially developed by the landscape visualization group of the Zuse Institute Berlin by Malte Clasen and is now developed further by Lenné3D GmbH. For more information about the used concepts have a look at the thesis of Malte Clasen: Towards Interactive Landscape Visualization Doctoral...
    Downloads: 61 This Week
    Last Update:
    See Project
  • 7
    Hibernate

    Hibernate

    An object relational-mapping (ORM) library for Java

    The Hibernate projects offer a suite of powerful Java libraries to work with data. It is best known for Hibernate ORM, which provides relational persistence for Java models and is an implementation of the Jakarta Persistence specification. Hibernate projects do not consistently release binaries or documentation to SourceForge anymore. For up-to-date information, refer to the Hibernate website: * Hibernate ORM: https://hibernate.org/orm/ * Hibernate Validator:...
    Leader badge
    Downloads: 3,446 This Week
    Last Update:
    See Project
  • 8
    Grassroots DICOM

    Grassroots DICOM

    Cross-platform DICOM implementation

    Grassroots DiCoM is a C++ library for DICOM medical files. It is accessible from Python, C#, Java and PHP. It supports RAW, JPEG, JPEG 2000, JPEG-LS, RLE and deflated transfer syntax. It comes with a super fast scanner implementation to quickly scan hundreds of DICOM files. It supports SCU network operations (C-ECHO, C-FIND, C-STORE, C-MOVE). PS 3.3 & 3.6 are distributed as XML files. It also provides PS 3.15 certificates and password based mecanism to anonymize and de-identify DICOM datasets.
    Leader badge
    Downloads: 96 This Week
    Last Update:
    See Project
  • 9

    Pytente

    Uma Ferramenta Computacional para Análise e Recuperação de Patentes

    O Pytente é uma solução avançada para automatizar o processo de coleta, armazenamento e tratamento de dados bibliográficos de patentes. A ferramenta foi projetada para simplificar a coleta de grandes volumes de dados em repositórios de acesso aberto. O Pytente garante o armazenamento estruturado das informações, além da validação e eliminação de registros duplicados. Dentre as diversas funcionalidades disponibilizadas pela ferramenta, destacam-se a extração personalizada de subconjuntos de...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    Fuzzy machine learning framework

    Fuzzy machine learning framework

    A library and a GUI front-end for fuzzy machine learning

    Fuzzy machine learning framework is a library and a GUI front-end for machine learning using intuitionistic fuzzy data. The approach is based on the intuitionistic fuzzy sets and the possibility theory. Further characteristics are fuzzy features and classes; numeric, enumeration features and features based on linguistic variables; user-defined features; derived and evaluated features; classifiers as features for building hierarchical systems; automatic refinement in case of dependent...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    stkpp

    stkpp

    C++ Statistical ToolKit

    STK++ (http://www.stkpp.org) is a versatile, fast, reliable and elegant collection of C++ classes for statistics, clustering, linear algebra, arrays (with an Eigen-like API), regression, dimension reduction, etc. Some functionalities provided by the library are available in the R environment as R functions (http://cran.at.r-project.org/web/packages/rtkore/index.html). At a convenience, we propose the source packages on sourceforge. The library offers a dense set of (mostly) template...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    ARWES

    ARWES

    Futuristic Sci-Fi UI Web Framework

    ...A node represents and interactive and stateful UI component with design, animations, and/or sound support. The nodes in a tree are interconnected with each other by inheriting data and communicating dynamically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    UnionML

    UnionML

    Build and deploy machine learning microservices

    ...UnionML is an open-source Python framework built on top of Flyte™, unifying the complex ecosystem of ML tools into a single interface. Combine the tools that you love using a simple, standardized API so you can stop writing so much boilerplate and focus on what matters: the data and the models that learn from them. Fit the rich ecosystem of tools and frameworks into a common protocol for machine learning. Using industry-standard machine learning methods, implement endpoints for fetching data, training models, serving predictions (and much more) to write a complete ML stack in one place. Data science, ML engineering, and MLOps practitioners can all gather around UnionML apps as a way of defining a single source of truth about your ML system’s behavior. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    gophernotes

    gophernotes

    The Go kernel for Jupyter notebooks and nteract

    ...Use gophernotes to create and share documents that contain live Go code, equations, visualizations and explanatory text. These notebooks, with the live Go code, can then be shared with others via email, Dropbox, GitHub and the Jupyter Notebook Viewer. Go forth and do data science, or anything else interesting, with Go notebooks! This project utilizes a Go interpreter called gomacro under the hood to evaluate Go code interactively. The gophernotes logo was designed by the brilliant Marcus Olsson and was inspired by Renee French's original Go Gopher design. If you have the JUPYTER_PATH environmental variable set or if you are using an older version of Jupyter, you may need to copy this kernel config to another directory.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    bioweb

    bioweb

    polyglot language framework to analyze genetic data

    polyglot framework using Python/C++/JavaScript to fast develop applications to analyze biological sequences
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Happy Java Library

    Happy Java Library

    Multilock, Collections, Controllers, Delegates, Generators, Streams

    Helps to develop and test event-based multi-threaded Java application. Because of method called as API-Evolution the Happy Java Library is fully downward compatible. The library contains following functionality: MultiLock, Parallel loops, Collections, Controllers, Generators, Delegates, Streams.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    JPPF

    JPPF

    The open source grid computing solution

    JPPF makes it easy to parallelize computationally intensive tasks and execute them on a Grid.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Jillion
    Java bio-informatics library to analyze and convert genomic sequence and assembly data. This library was created and used by the J. Craig Venter Institute (JCVI)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    GeOxygene
    GeOxygene aims at providing an open framework which implements OGC/ISO specifications for the development and deployment of geographic (GIS) applications. It is a open source contribution of the COGIT lab. at the IGN (the French National Mapping Agency).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    JCLTP

    A Java Class Library for Text Processing

    JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided. Thus, analysis of new types...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    An open source workbench for chemo- and bioinformatics built on the Eclipse Rich Client Platform (RCP).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22

    Coherent Logic FRED Client

    Java client API for accessing the FRED web services

    ***** WARNING ***** The delivery of the 1.0.3-RELEASE of this framework will be the final source code commit that will be made on SourceForge; all further updates can be found at the following address: https://bitbucket.org/CoherentLogic/coherent-logic-fred-client/ ***** WARNING ***** The FRED Client is a Java-based internal domain-specific language (DSL) for accessing the Federal Reserve Economic Data (FRED) web services complements of the Federal Reserve Bank of St....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Coherent Logic World Bank Client

    Java client API for accessing economic data from the World Bank

    ***** WARNING ***** The delivery of 1.0.2-RELEASE of this framework will be the final source code commit that will be made on SourceForge; all further updates can be found at the following address: https://bitbucket.org/CoherentLogic/coherent-logic-world-bank-client/ ***** WARNING ***** The World Bank Client is a Java-based internal domain-specific language (DSL) for accessing Economic Data from the World Bank web services complements of the World Bank. The API is designed to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Makumba

    Makumba

    Moved to GitHub: https://github.com/makumba

    Makumba helps you rapidly develop data driven web applications. Provides a custom JSP taglib as a main interface, but leaves API open for advanced access. It is implemented in Java. Moved to GitHub: https://github.com/makumba
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25
    Mangrove TDS Library

    Mangrove TDS Library

    A C++ Tool for the Fast Prototyping of Topological Data Structures

    The Mangrove Topological Data Structure (Mangrove TDS) Library is a C++ template tool for the fast prototyping of topological data structures under the same API: - generic representation of any topological data structure, i.e., mangrove; - extensible through dynamic plugins at run-time; - agnostic wrt the representation in use; - any representation is supported without restrictions; - dimension-independent and multi-platform; - efficient representations of...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo