Showing 21 open source projects for "data science"

View related business solutions
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 1
    MooseFS

    MooseFS

    Fault tolerant, POSIX-compliant, Net Distributed Storage / File System

    MooseFS (MFS) is a fault tolerant, highly performing, scaling-out, network distributed file system. It spreads data over several physical servers which are visible to the user as one resource. For standard file operations MooseFS mounted with FUSE acts as other Unix-alike file systems: * A hierarchical structure (directory tree) * Stores POSIX file attributes (permissions, last access and modification times) Supports special files (block and character devices, pipes and sockets) *...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    DataSophon

    DataSophon

    The next generation of cloud-native big data management expert

    Aiming at quickly deploying, managing, monitoring and automating the operation and maintenance of Big Data service components and nodes, helping you quickly build stable, efficient Big Data cluster services. The Three-Body Problem, a Hugo Award-winning work of the world's highest science fiction literature, is known for its stunning "hard science fiction" style, and its author Liu Cixin is credited with "single-handedly raising Chinese science fiction to a world-class level". ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    dispy

    Distributed and Parallel Computing with/for Python.

    dispy is a generic and comprehensive, yet easy to use framework for creating and using compute clusters to execute computations in parallel across multiple processors in a single machine (SMP), among many machines in a cluster, grid or cloud. dispy is well suited for data parallel (SIMD) paradigm where a computation (Python function or standalone program) is evaluated with different (large) datasets independently. dispy supports public / private / hybrid cloud computing, fog / edge computing.
    Leader badge
    Downloads: 36 This Week
    Last Update:
    See Project
  • 4
    JPPF

    JPPF

    The open source grid computing solution

    JPPF makes it easy to parallelize computationally intensive tasks and execute them on a Grid.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    Genetic Programming Classifier is a distributed evolutionary data classification program. It uses the ensemble method implemented under a parallel co-evolutionary Genetic Programming technique.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    PARUS

    PARUS

    Dataflow parallel programming language for clusters

    PARUS is a data-flow parallel programming language that allows to build parallel programs for clusters and MPP multiprocessors. The data-flow graph is automatically converted to the C++/MPI source and linked with the libparus runtime library. Also there are available tools for benchmarking cluster interconnect and visualize it.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    PBS Cluster Viz
    PBS Cluster Viz is a project to display information useful to admins and users about a computing cluster managed by a PBS-compatible resource manager. Information includes load and job distribution. Interactive as well as static output is available.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    XDAQ is a software platform designed specifically for the development of distributed data acquisition systems. The development is carried out at CERN, the European Organization for Nuclear Research. Please visit http://xdaq.web.cern.ch
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Creation of workflows that will be executed on clusters. Must be able to easily create task sequences (like Automator) and to exchange data with the cluster. Monitoring may be web-based.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    A MATLAB spectral clustering package to handle large data sets (200,000 RCV1 data) on a 4GB memory general machine. We implement various ways of approximating the dense similarity matrix, including nearest neighbors and the Nystrom method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Construct and management virtual clusters which the nodes of the clusters running on the virtual machine rather than native machine. Fast install and switch virtual clusters. Efficiently storage vm images. support variant VMMs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    This is an opensource project for the calculation of the mathematical constant Pi. If you are a mathematician or programer please contribute ideas or code for this project. Non professionals are also welcome to contribute.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A data parallel scientific programming model. Compiles efficiently to different platforms like distributed memory (MPI), shared memory multi-processor (pthreads), Cell BE processor, Nvidia Cuda, SIMD vectorization (SSE, Altivec), and sequential C++ code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Hubble in a Bottle!
    A high performance visualization tool for stars and particles able to run on Beowulfs. Rendering step is implemented separately and optimized for SSE, compatible with OpenGL. Navigation mechanism with quaternions. Ten color maps, sobel contour detect
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    NOD-MP stands for not another data-mining project. It is educational and scientific software to utilize data mining clustering algorithms through a user-friendly interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A Middleware for Distrubted Data Stream Processing
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Rackview is a Perl module that assists in visualizing a Data Center\'s computer rack layouts by creating a web page with HTML tables to show a frontview of the racks. Included are brief commandline and CGI scripts to demonstrate its implementation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    NIUS allows reservation of nodes under it's control for exclusive interactive use by a group of users. This system is in use by National Center for Data Mining at University of Illinois in Chicago to reserve workstations for researchers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    JCGrid is an easy-to-use tools for real-world grid-computing. It requires only to install Java Runtime and you can easy split your cpu-intensive task among several Workstations/PC/Mac/etc. It includes support for POVRay and MandelGrid as usage examples.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Circus is a jMonkeyEngine based cluster-solution for multi-screen VR-systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The message passing interface (MPI) standard is a library specification for message passing on parallel computers. This project develops an extension to MPI in C++ such that STL objects can be transfered just as easily as fundamental data types.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo