Showing 24 open source projects for "fastq-join"

View related business solutions
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    FunSQL.jl

    FunSQL.jl

    Julia library for compositional construction of SQL queries

    FunSQL is a Julia library for the compositional construction of SQL queries. Julia programmers sometimes need to interrogate data with the Structured Query Language (SQL). But SQL is notoriously hard to write in a modular fashion. FunSQL exposes full expressive power of SQL with compositional semantics. FunSQL allows you to build queries incrementally from small independent fragments. This approach is particularly useful for building applications that programmatically construct SQL...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    DataFrames.jl

    DataFrames.jl

    In-memory tabular data in Julia

    DataFrames.jl is a powerful Julia package for working with in-memory tabular data. It provides a familiar, flexible, and efficient interface for handling datasets, making it easy to load, manipulate, join, and analyze structured data. With syntax inspired by data frames in R and pandas in Python, it offers intuitive tools while taking advantage of Julia’s speed and type system. The package is actively maintained by the JuliaData community, with contributions from over 200 developers worldwide. It is widely used for data science, research, and production applications, supported by extensive documentation, tutorials, and a free Julia Academy course. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    OrientDB

    OrientDB

    DBMS supporting graph, document, full-text and geospatial models

    OrientDB is an Open Source Multi-Model NoSQL DBMS with the support of Native Graphs, Documents, Full-Text search, Reactivity, Geo-Spatial and Object Oriented concepts. It's written in Java and it's amazingly fast. No expensive run-time JOINs, connections are managed as persistent pointers between records. You can traverse thousands of records in no time. Supports schema-less, schema-full and schema-mixed modes. Has a strong security profiling system based on user, roles and predicate...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Foxglove Studio

    Foxglove Studio

    Robotics visualization and debugging

    ...Upload recordings to your private data lake for easy storage, searching, and analysis. Stream recorded data directly into Foxglove Studio to get insights into your robots' behavior. We're long-time fans and beneficiaries of open source software. Join our community on Github and Slack to contribute bug reports, feature requests, or pull requests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Push Code. Get a Production URL. Done. Icon
    Push Code. Get a Production URL. Done.

    Cloud Run deploys any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try Cloud Run Free
  • 5
    Teem

    Teem

    Tools to process and visualize scientific data and images

    ...Includes N-dimensional image I/O and processing, volume rendering, diffusion tensor processing, and more. Latest code via git clone http://git.code.sf.net/p/teem/teem.git . Join us at https://discord.gg/xBBqZGXkF7
    Downloads: 29 This Week
    Last Update:
    See Project
  • 6
    OSHMI - Open Substation HMI

    OSHMI - Open Substation HMI

    SCADA HMI for substations, IoT and automation applications

    ...This project combines existing open source projects and tools to create a very capable, mobile and cloud-friendly HMI system that can rival proprietary software. This approach makes it possible to join forces of each project (Chromium, SVG/HTML5, PHP, Lua, SQLite, Inkscape, Lib61850, OpenDNP3, Nginx, Vega, PostgreSQL, Grafana,…) to achieve a great set of open, evergreen, modular and customizable tools for building great HMIs for automation projects. This is not a toy project! It's been actually used in dozens of substations up to 230kV level and also in control centers with configurations up to 70.000 tags. ...
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 7
    PANDORA

    PANDORA

    Revolutionizing Biomedical Research with Advanced Machine Learning

    ...In order to run a test instance of PANDORA, users will first need to prepare their local environment by downloading, installing, and configuring Docker. genular is a community behind SIMON an open-source Machine Learning KnowledgeDiscovery software, built by a vibrant community of people just like you! Join us and make SIMON even cooler! Exploratory analysis of machine learning results with the help of many different visualization techniques will give you instant insights into models and data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Javis

    Javis

    Julia Animations and Visualizations

    ...Javis makes generating simple animations a breeze! Want to learn more? Check out our documentation for tutorials, our contributing guidelines, and the mission of Javis.jl. We have a live Zulip stream that you can join to discuss Javis with other Javis users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DataStation Community Edition

    DataStation Community Edition

    App to easily query, script, and visualize data from every database

    ...It allows you to easily build graphs and tables with data pulled from SQL databases, logging databases, metrics databases, HTTP servers, and all kinds of text and binary files. Need to join or munge data? Write embedded scripts as needed in languages like Python, JavaScript, R or SQL. All in one application. Build reports with graphs, charts and tables. Script against data. Cross-platform: Windows, macOS, and Linux. Easily fetch your data, wherever it is: 18 SQL and non-SQL databases, files, HTTP server. Over 20 supported databases out of the box including: PostgreSQL, MySQL, MariaDB, SQLite, SQL Server, Oracle, ClickHouse, Influx 1 (InfluxQL), Influx 2 (Flux), Prometheus, Elasticsearch, AWS Athena, Google BigQuery, Snowflake, Cassandra, Scylla, TimescaleDB, YugabyteDB, CockroachDB, CrateDB, QuestDB, and Airtable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cut Your Data Warehouse Bill by 54% Icon
    Cut Your Data Warehouse Bill by 54%

    Migrate from Snowflake, Redshift, or Databricks with free tools. No SQL rewrites.

    BigQuery delivers 54% lower TCO with serverless scale and flexible pricing. Free migration tools handle the SQL translation automatically.
    Try Free
  • 10
    JuliaDB.jl

    JuliaDB.jl

    Parallel analytical database in pure Julia

    JuliaDB is a package for working with large persistent data set. JuliaDB provides distributed table and array datastructures with convenient functions to load data from CSV. JuliaDB is Julia all the way down. This means queries can be composed with Julia code that may use a vast ecosystem of packages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Query.jl

    Query.jl

    Query almost anything in julia

    Query is a package for querying julia data sources. It can filter, project, join and group data from any iterable data source, including all the sources supported in IterableTables.jl. One can for example query any of the following data sources: any array, DataFrames, DataStreams (including CSV, Feather, SQLite, ODBC), DataTables, IndexedTables, TimeSeries, Temporal, TypedTables and DifferentialEquations (any DESolution).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    JAETL - Just Another ETL Tool

    JAETL - Just Another ETL Tool

    Just Another ETL Tool

    JAETL - Just Another ETL tool is a tiny and fast ETL tool to develop data warehouse. JAETL allows to Extract data from ARFF (Weka), CSV, and SQL, Transform the data with join, replace missing values, remove duplicates, mapping filtering, variable selection, and Load the data into SQL server and export to CSV and ARFF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    MarDRe

    MapReduce-based tool to remove duplicate DNA reads

    MarDRe is a de novo MapReduce-based parallel tool to remove duplicate and near-duplicate DNA reads through the clustering of single-end and paired-end sequences from FASTQ/FASTA datasets. This tool allows bioinformatics to avoid the analysis of not necessary reads, reducing the time of subsequent procedures with the dataset. MarDRe is the Big Data counterpart of ParDRe (link above), which employs HPC technologies (i.e., hybrid MPI/multithreading) to reduce runtime on multicore systems. Instead, MarDRe takes advantage of the MapReduce programming model to significantly improve ParDRe performance on distributed systems, especially on cloud-based infrastructures. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    apache spark data pipeline osDQ

    apache spark data pipeline osDQ

    osDQ dedicated to create apache spark based data pipeline using JSON

    This is an offshoot project of open source data quality (osDQ) project https://sourceforge.net/projects/dataquality/ This sub project will create apache spark based data pipeline where JSON based metadata (file) will be used to run data processing , data pipeline , data quality and data preparation and data modeling features for big data. This uses java API of apache spark. It can run in local mode also. Get json example at https://github.com/arrahtech/osdq-spark How to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    HSRA

    Hadoop spliced read aligner for RNA-seq data

    ...This tool allows bioinformatics researchers to efficiently distribute their mapping tasks over the nodes of a cluster by combining a fast multithreaded spliced aligner (HISAT2) with Apache Hadoop, which is a distributed computing framework for scalable Big Data processing. HSRA currently supports single-end and paired-end read alignments from FASTQ/FASTA datasets. Moreover, our tool uses the Hadoop Sequence Parser (HSP) library (link above) to efficiently read the input datasets stored on the Hadoop Distributed File System (HDFS), being able to process datasets compressed with Gzip and BZip2 codecs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    X10

    Performance and Productivity at Scale

    ...This model introduces two key concepts -- places and asynchronous tasks -- and a few mechanisms for coordination. With these, APGAS can express both regular and irregular parallelism, message-passing-style and active-message-style computations, fork-join and bulk-synchronous parallelism. Both its modern, type-safe sequential core and simple programming model for concurrency and distribution contribute to making X10 a high-productivity language in the HPC and Big Data spaces. User productivity is further enhanced by providing tools such as an Eclipse-based IDE (X10DT). Implementations of X10 are available for a wide variety of hardware and software platforms ranging from laptops, to commodity clusters, to supercomputers.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    An integrated pipeline for forensic analysis from SNP panel data. 1. SNP caller takes a FASTQ file and reference SNP panel as input and generates SNP calls. 2. Kinship analysis 3. Ancestry prediction 4. Data quality check 5. Replicate analysis 6. Mixture analysis module available by request
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    BAIO

    BAIO

    Bioinformatics Artificial Intelligence Order

    A smart interface of AI that will interrogate and complete your bioinformatics data analysis for you. Download and start your instance of BAIO to join the network of great bioinformatics Minds.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    di-history-join-plugin

    Plugin for Pentaho Data Integration used to supply a method to join tw

    This plugin supply a method to join two tables using the date-from and date-to history. It use the two dates that indicate the life of the record and join using a query (like the database join plugin) to resolve the record's story of the two entities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Metis for Oracle
    A tool to support data-analysis on a Oracle-DB. In Greek mythology, Metis (Μῆτις) was of the Titan generation ... The Stoic commentators allegorized Metis as the embodiment of "wisdom" or "wise counsel" ... (from Wikipadia)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Toolsverse ETL Framework

    Toolsverse ETL Framework

    Open source Extract Transform Load engine written in Java

    ETL Framework is a standalone Extract Transform Load engine written in Java. It includes executables for all major platforms and can be easily integrated into other applications. Key Features: * embeddable, open source and free * fast and scalable * uses target database features to do transformations and loads * manual and automatic data mapping * data streaming * bulk data loads * data quality features using SQL, JavaScript? and regex * data transformations Requirements *...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    OpenSAP
    ...Free plug-ins for SAP Business One (SAP B1), that I developed privately. Since upgrading to 8.8.1, I have to convert them and in doing so, I am open sourcing them. I am hoping others will join with their own stable add-ons :-)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Tata is a Qt project aiming at developing a data analysis program for scientific use. It contains wave form display and other data analysis functions, you can easily add your own code to fulfill your specific need. Welcome to join us!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Dark Beginnings is a MUD server made in C++ using Qt4.3 and MySQL, it is aiming at having a higher number of stock deployment options by use of scripted modules. Anyone is free to join and help out.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB