Showing 176 open source projects for "data"

View related business solutions
  • Run Any Workload on Compute Engine VMs Icon
    Run Any Workload on Compute Engine VMs

    From dev environments to AI training, choose preset or custom VMs with 1–96 vCPUs and industry-leading 99.95% uptime SLA.

    Compute Engine delivers high-performance virtual machines for web apps, databases, containers, and AI workloads. Choose from general-purpose, compute-optimized, or GPU/TPU-accelerated machine types—or build custom VMs to match your exact specs. With live migration and automatic failover, your workloads stay online. New customers get $300 in free credits.
    Try Compute Engine
  • Go from Data Warehouse to Data and AI platform with BigQuery Icon
    Go from Data Warehouse to Data and AI platform with BigQuery

    Build, train, and run ML models with simple SQL. Automate data prep, analysis, and predictions with built-in AI assistance from Gemini.

    BigQuery is more than a data warehouse—it's an autonomous data-to-AI platform. Use familiar SQL to train ML models, run time-series forecasts, and generate AI-powered insights with native Gemini integration. Built-in agents handle data engineering and data science workflows automatically. Get $300 in free credit, query 1 TB, and store 10 GB free monthly.
    Try BigQuery Free
  • 1
    Zipkin

    Zipkin

    Distributed tracing system to gather timing data

    Zipkin is a distributed tracing system. It helps gather timing data needed to troubleshoot latency problems in service architectures. Features include both the collection and lookup of this data. If you have a trace ID in a log file, you can jump directly to it. Otherwise, you can query based on attributes such as service, operation name, tags and duration. Some interesting data will be summarized for you, such as the percentage of time spent in a service, and whether or not operations failed. ...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 2
    Alluxio

    Alluxio

    Open Source Data Orchestration for the Cloud

    Alluxio is the world’s first open source data orchestration technology for analytics and AI for the cloud. It bridges the gap between computation frameworks and storage systems, bringing data from the storage tier closer to the data driven applications. This enables applications to connect to numerous storage systems through a common interface. It makes data local, more accessible and as elastic as compute.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    JupyterLab

    JupyterLab

    JupyterLab computational environment

    ...Documents and activities integrate with each other, enabling new workflows for interactive computing. JupyterLab also offers a unified model for viewing and handling data formats. JupyterLab understands many file formats (images, CSV, JSON, Markdown, PDF, Vega, Vega-Lite, etc.) and can also display rich kernel output in these formats. See File and Output Formats for more information. To navigate the user interface, JupyterLab offers customizable keyboard shortcuts and the ability to use key maps from vim, emacs, and Sublime Text in the text editor.
    Downloads: 115 This Week
    Last Update:
    See Project
  • 4
    Kraken

    Kraken

    P2P Docker registry capable of distributing TBs of data in seconds

    ...Kraken is capable of distributing Docker images at > 50% of max download the speed limit on every host. Cluster size and image size do not have a significant impact on download speed. Support uploader authentication and data integrity protection through TLS. Other than pluggable storage, Kraken only has an optional dependency on DNS.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • 5
    Software to support the development of distributed data acquisition and analysis systems, used primarily in the implementation of instrumentation for pulsar astronomy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    HPCC Systems

    HPCC Systems

    End-to-end big data in a massively scalable supercomputing platform.

    HPCC Systems® (www.hpccsystems.com) from LexisNexis® Risk Solutions is a proven, open source solution for Big Data insights that can be implemented by businesses of all sizes. With HPCC Systems, developers can design applications with Big Data at their core, enabling businesses to better analyze and understand data at scale, improving business time to results and decisions. HPCC Systems offers a consistent data-centric programming language, two processing platforms and a single, complete end-to-end architecture for efficient processing. ...
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    UNICORE

    UNICORE

    UNiform Interface to COmputing and data REsources

    UNICORE is a software suite for building federated systems, providing secure and seamless access to heterogeneous resource such as compute clusters and file systems. UNICORE deals with authentication, user mapping and authorization, and provides a comprehensive set of RESTful APIs for HPC access and workflows. Contributors: visit https://github.com/UNICORE-EU
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8
    DelphiWebStart
    DelphiWebStart (DWS) is an Application Loader with TCP Sockets based on a ThinClient first spread over the Web, VPN or Intranet. So a user can download data (exes, maps, files) from a list and start it. DWS supports OpenSSL including a PKI HTTPS-Server.
    Leader badge
    Downloads: 61 This Week
    Last Update:
    See Project
  • 9
    Java Multiprocessing

    Java Multiprocessing

    Java explicit multiprocessing, SSI and cluster management tool

    ... ▪ JMP also contains a simple cluster management / load balancing tool called JMP node. ▪ JMP = SSI + load balancing + multiprocessing API Two of the most obvious JMP use cases are: a. All kinds of data parallelism (exhaustive search algorithms, optimization problems). b. Server-side load balancing between available CPUs. See project Wiki for more.
    Downloads: 135 This Week
    Last Update:
    See Project
  • Build AI Apps with Gemini 3 on Vertex AI Icon
    Build AI Apps with Gemini 3 on Vertex AI

    Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

    Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.
    Try Vertex AI Free
  • 10
    The OpenHMS Project
    OpenHMS brands a collection of projects and many subprojects developed by Health Market Science. This is a robust collection of development libraries, APIs, and tools oriented around data manipulation and professional software development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    openQRM - Cloud Computing Platform

    openQRM - Cloud Computing Platform

    Get latest Edition here: http://www.openqrm-enterprise.com.

    openQRM is a web-based open source datacenter management and hybrid cloud computing platform that integrates flexibly with existing components in enterprise data centers. Releases available at https://github.com/openQRM/openqrm-community. Please visit http://www.openqrm-enterprise.com for latest 5.3 free Community Release. Includes important security updates, bugfixes and enhancements especially for KVM and Cloud plugin. openQRM abstracts Virtualization and supports the following virtualization technologies: KVM, XEN, Citrix XenServer, VMWare ESX & LXC. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TurboVNC

    TurboVNC

    High-speed, 3D-friendly, TightVNC-compatible remote desktop software

    TurboVNC is a high-performance, enterprise-quality version of VNC based on TightVNC, TigerVNC, and X.org. It contains a variant of Tight encoding that is tuned for maximum performance and compression with 3D applications (VirtualGL), video, and other image-intensive workloads. TurboVNC, in combination with VirtualGL, provides a complete solution for remotely displaying 3D applications with interactive performance. TurboVNC's high-speed encoding methods have been adopted by TigerVNC and...
    Leader badge
    Downloads: 132,923 This Week
    Last Update:
    See Project
  • 13
    OpenDDS

    OpenDDS

    OpenDDS is open source publish/subscribe middleware

    OpenDDS is an open source implementation of the Object Management Group (OMG) Data Distribution Service (DDS), providing a publish/subscribe middleware solution for real-time distributed systems. OpenDDS includes development and run-time tools. Full product information, source code, documentation, build instructions, and license information are available from http://www.opendds.org. Commercial consulting, support, and training for OpenDDS are available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    CloudI: A Cloud at the lowest level
    CloudI is an open-source private cloud computing framework for efficient, secure, and internal data processing. CloudI provides scaling for previously unscalable source code with efficient fault-tolerant execution of ATS, C/C++, Erlang/Elixir, Go, Haskell, Java, JavaScript/node.js, OCaml, Perl, PHP, Python, Ruby, or Rust services. The bare essentials for efficient fault-tolerant processing on a cloud!
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    NuPIC

    NuPIC

    Numenta platform for intelligent computing

    ...At the core of HTM are time-based continuous learning algorithms that store and recall spatial and temporal patterns. NuPIC is suited to a variety of problems, particularly anomaly detection and prediction of streaming data sources. For more information, see numenta.org or the NuPIC Forum. If you want to build the dependent nupic.bindings from source, you should build and install from nupic.core prior to installing nupic (since a PyPI release will be installed if nupic.bindings isn't yet installed). To install from local source code, run from the repository root. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16

    dispy

    Distributed and Parallel Computing with/for Python.

    dispy is a generic and comprehensive, yet easy to use framework for creating and using compute clusters to execute computations in parallel across multiple processors in a single machine (SMP), among many machines in a cluster, grid or cloud. dispy is well suited for data parallel (SIMD) paradigm where a computation (Python function or standalone program) is evaluated with different (large) datasets independently. dispy supports public / private / hybrid cloud computing, fog / edge computing.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    dimtree

    dimtree

    Data Browser for DIM (Distributed Information Manager from CERN)

    ...DIM uses to create distributed DAQ and experiment control systems. DIMTree is browser for DIM which provides Tree View to browse DIM Servers, his published Services and it's data content. It's useful test/monitor/debug tool for DIM users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Talend Spatial Module (aka Spatial Data Integrator or SDI) is an ETL tool for geospatial. Based on Talend Open Studio, input, output and transform geocomponents are available. IO components read/write GIS formats(eg.PostGIS, GeoRSS). Transformers all
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    jDHTUQ
    ATTENTION: For more info and latest version, refer to https://github.com/estigma88/jdhtuq. jDHTUQ is a peer-to-peer DHT system based in Chord algorithm, but built like API to generalize the implementation of peer-to-peer DHT systems. jDHTUQ have two fundamental services , put and get of resource.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    JPPF

    JPPF

    The open source grid computing solution

    JPPF makes it easy to parallelize computationally intensive tasks and execute them on a Grid.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Open Event Machine

    An event driven processing runtime for multicore

    Deprecated - see new repository in GitHub instead: https://github.com/openeventmachine/em-odp Open Event Machine (OpenEM or EM) is an architectural abstraction and framework of an event driven multicore optimized processing concept originally developed for networking. It offers an easy programming concept for scalable and dynamically load balanced multicore data plane applications with a very low overhead run-to-completion principle. EM can run on bare metal for best performance or under an operating system with special arrangements. This release of the Open Event Machine contains the EM API as well as an example implementation for Intel multicore CPUs. !!! NOTE !!! Release version numbering scheme changed: after v1.5 follows v1.1.0, where 1.1 reflects the used OpenEM API version.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    NetProg is an API in C language allowing an homogeneous programming of communicating applications on many operating systems. It covers all the need for applications that requires complex way of exchanging data in synchronous or asynchronous mode. Based on messages for exchanging data between clients and servers, it, moreover, access databases or files with the same message modelling.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Vappio is a framework for building virtual appliances that supports distributed data processing in cloud computing environments using Sun Grid Engine or Hadoop. The primary target application of Vappio is bioinformatics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    GloVe

    GloVe

    GloVe model for distributed word representation

    ...The demo.sh script downloads a small corpus, consisting of the first 100M characters of Wikipedia. It collects unigram counts, constructs and shuffles cooccurrence data, and trains a simple version of the GloVe model. It also runs a word analogy evaluation script in python to verify word vector quality.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    ============================= THIS CODE IS DEPRECATED ============================= For updated information about MTSA visit: http://mtsa.dc.uba.ar
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.
Try Free →