Showing 11 open source projects for "parallel computing datamaning"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • $300 in Free Credit for Your Google Cloud Projects Icon
    $300 in Free Credit for Your Google Cloud Projects

    Build, test, and explore on Google Cloud with $300 in free credit. No hidden charges. No surprise bills.

    Launch your next project with $300 in free Google Cloud credit—no hidden charges. Test, build, and deploy without risk. Use your credit across the Google Cloud platform to find what works best for your needs. After your credits are used, continue building with free monthly usage products. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1

    MarDRe

    MapReduce-based tool to remove duplicate DNA reads

    ...Instead, MarDRe takes advantage of the MapReduce programming model to significantly improve ParDRe performance on distributed systems, especially on cloud-based infrastructures. Written in pure Java to maximize cross-platform compatibility, MarDRe is built upon the open-source Apache Hadoop project, the most popular distributed computing framework for Big Data processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    HSRA

    Hadoop spliced read aligner for RNA-seq data

    HSRA is a MapReduce-based parallel tool for mapping reads from RNA sequencing (RNA-seq) experiments. RNA-seq analyses typically begin by mapping reads to a reference genome in order to determine the location from which the reads were originated, which is a very time-consuming step. This tool allows bioinformatics researchers to efficiently distribute their mapping tasks over the nodes of a cluster by combining a fast multithreaded spliced aligner (HISAT2) with Apache Hadoop, which is a distributed computing framework for scalable Big Data processing. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    SAT-Assembler

    Scalable and accurate targeted gene assembly for large-scale NGS data

    ...It recovers genes from gene families of particular interest to biologists with high coverage, low chimera rate, and extremely low memory usage compared with exiting gene assembly tools. Moreover, it is naturally compatible with parallel computing platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    parSRA

    Portable Framework for the Parallel Execution of Short Read Aligners

    ...It uses the following techniques in order to improve scalability: 1) a fast splitting of the input reads using the FUSE kernel module available in most of current Linux distributions; 2) a balanced on-demand distribution of the reads based on the shared locks of UPC++, an extension of C++ for parallel computing that follows the Partitioned Global Address Space (PGAS) paradigm. parSRA is portable as its configuration file allows the users to parallelize the execution of existing SRA tools without the need to modify the source code of parSRA or the aligner.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5

    Ezys

    Ezys 3D medical image registration program

    Ezys is a non-linear 3D medical image registration program. Ezys fully exploits the parallel computing power of inexpensive commercial graphics processing units (GPU), resulting in a very fast and accurate program capable of running on desktop PCs and even some laptops. On these systems, non-linear image registrations take less than a minute to complete. Ezys implements a diffeomorphic inverse consistent image registration algorithm with a demons-style regularization based on a non-parametric free form deformation model. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    PPSeq: Parallel NGS Analysis

    Parallel Processing for Next-Generation Sequencing (NGS) Analysis

    High-throughput next generation sequencing (NGS) technology has quickly emerged as a powerful tool in many aspects of biomedical research. However, along with its rapid development, the data magnitude and analysis complexity for NGS far exceed the capacity and capability of traditional small-scale computing facilities, such as multithreading algorithms on standalone workstations. To address this issue, here we present a solution using the ever-increasing supply of processing power by massive parallel processing (MPP) systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    GENIE (GEne-geNe IntEraction)

    GPU based Parallel Gene-Gene Interaction Analysis

    Gene-gene interaction in genetic association studies is computationally intensive when a large number of SNPs are involved. Most of the latest Central Processing Units (CPUs) have multiple cores, whereas Graphics Processing Units (GPUs) also have hundreds of cores and have been recently used to implement faster scientific software. However, currently there are no genetic analysis software packages that allow users to fully utilize the computing power of these multi-core devices for genetic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    PASHA: Parallelized Short Read Assembly

    One of the fastest parallel short read assemblers for large genomes.

    PASHA is a parallel short read assembler for large genomes using de Bruijn graphs. Taking advantage of both shared-memory multi-core CPUs and distributed-memory compute clusters, PASHA has demonstrated its potential to perform high-quality de-novo assembly of large genomes in reasonable time with modest computing resources. Our evaluation using three small real paired-end datasets shows that PASHA is able to produce better assemblies with comparable genome coverage and mis-assembly rates compared to three leading assemblers: Velvet, ABySS and SOAPdenovo. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Platform for parallel computation in the Amazon cloud, including machine learning ensembles written in R for computational biology and other areas of scientific research. Home to MR-Tandem, a hadoop-enabled fork of X!Tandem peptide search engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 99.99% Uptime for MySQL and PostgreSQL on Google Cloud Icon
    99.99% Uptime for MySQL and PostgreSQL on Google Cloud

    Enterprise Plus edition delivers sub-second maintenance downtime and 2x read/write performance. Built for critical apps.

    Cloud SQL Enterprise Plus gives you a 99.99% availability SLA with near-zero downtime maintenance—typically under 10 seconds. Get 2x better read/write performance, intelligent data caching, and 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server with built-in vector search for gen AI apps. New customers get $300 in free credit.
    Try Cloud SQL Free
  • 10
    Microbase is a Grid infrastructure for deploying massively parallel applications. Our notification-driven task scheduler supports the enactment of complex data processing workflows, such as those commonly used in bioinformatics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    ProteinFinder - a C language parallel computing engine for tandem protein mass spectrometry database search. ProteinFinder is interfaced with MySQL relational database MassSpec that hosts the experimental data, predicted databases, and search results.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.
Try Free →