Showing 33 open source projects for "fastq"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    123FASTQ

    123FASTQ

    An intuitive and efficient tool for preprocessing Illumina FASTQ reads

    123FASTQ performs all the pre-processes of Illumina next-generation sequencing reads (FASTQ files) easier than ever.  Download the quick user manual for the latest version: https://dl.adbioinformatics.net/NGSNeeds/myTools/123Fastq_v1.3_Manual.pdf Authors: Milad Eidi, Samaneh Abdolalizadeh, Mohammad Hossein Nassirpour Supervisors: Javad Zahiri, PhD University of California San Diego  Masoud Garshasbi, PhD Tarbiat Modares University, Tehran, Iran If you use 123FASTQ, please cite this preprint: 123FASTQ: an intuitive and efficient tool for preprocessing Illumina FASTQ reads https://www.biorxiv.org/content/10.1101/2024.03.08.584032v1 ########################################################## Take care of the details and ensure you use the latest version. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2

    BBMap

    BBMap short read aligner, and other bioinformatic tools.

    ...BBNorm: Kmer-based error-correction and normalization tool. Dedupe: Simplifies assemblies by removing duplicate or contained subsequences that share a target percent identity. Reformat: Reformats reads between fasta/fastq/scarf/fasta+qual/sam, interleaved/paired, and ASCII-33/64, at over 500 MB/s. BBDuk: Filters, trims, or masks reads with kmer matches to an artifact/contaminant file. ...and more!
    Leader badge
    Downloads: 315 This Week
    Last Update:
    See Project
  • 3
    miRDeep*

    miRDeep*

    MiRDeep*

    Please cite: An, J., Lai, J., Lehman, M.L. and Nelson, C.C. (2013) miRDeep*: an integrated application tool for miRNA identification from RNA sequencing data. Nucleic Acids Res, 41, 727-737. We will create index for you if you tell us your interested species (j.an@qut.edu.au). download command line version "MDS_command_line_Vxx.zip" clicking "Browse All Files" please find miRPlant in sourceforge for plant miRNA prediction.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    FastQC

    FastQC

    A quality control analysis tool for high throughput sequencing data

    ...Its goal is to provide a simple way by which to check the quality of raw sequence data coming from high throughput sequencing pipelines. It does this by running a modular set of analyses on one or more raw sequence files in fastq or bam format. It then produces a report summarizing the results, and highlighting any areas where the library may appear unusual. This should then direct you to where your data may have problems and allow you to take necessary steps to correct it before doing any further analysis. FastQC is not tied to any specific type of sequencing technique, so it can be used to look at libraries of various experiment types (Genomic Sequencing, ChIP-Seq, RNA-Seq, BS-Seq etc etc).
    Downloads: 49 This Week
    Last Update:
    See Project
  • Scalable restaurant tech for stellar guest experiences Icon
    Scalable restaurant tech for stellar guest experiences

    For Pizza, Delivery, Takeout, Quick Serve, Fast casual, and Full Service Restaurants with as little as one store to 100 or more.

    HungerRush helps restaurants compete in the toughest business on earth. We offer a fully integrated restaurant management system that’s easy to use and can be configured to engage your guests better, streamline your operations, master your own marketing, or all of the above. Want to offer online ordering? It’s built in. Want to get the latest performance data on your operations and marketing? No problem. Want to make customers for life by creating personalized experiences you know they’ll love? Order up. And since our system is backed by a dedicated and US-based support team, you’ll always be ready for the rush.
    Learn More
  • 5

    slimfastq

    An efficient lossless compression for fastq files.

    slimfastq is a cli application that compresses/decompresses fastq files. It features: * High compression ratio * Relatively low cpu/memory usage * Truly lossless compression/decompression
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    MPI-dot2dot

    A Parallel Tool to Find DNA Tandem Repeats on Multicore Clusters

    MPI-dot2dot is a parallel tool to accelerate the identification of Tandem Repeats on multisequence datasetes. This tool receives as input a multisequence file with FASTQ or FASTA formats. It uses MPI processes and OpenMP threads to exploit the compute capabilities of multicore clusters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    GapFiller

    A de novo local assembler for paired reads

    GapFiller is a seed-and-extend local assembler to fill the gap within paired reads. It can be used for both DNA and RNA and it has been tested on Illumina data. GapFiller can be used whenever a sequence is to be assembled starting from reads lying on its ends, provided a loose estimate of sequence length.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    CusVarDB

    CusVarDB generated variant protein database from NGS-datasets

    ...Create the variant protein database Apart from the main modules, the program also supports additional functions such as 1. Download the SRA 2. Convert the SRA file to fastq file format 3. Download the annotation (ANNOVAR) database and Dry-run concept to customize the commands Executables are available at http://bioinfo-tools.com/Downloads/CusVarDB/V1.0.0/ Test dataset is available at http://bioinfo-tools.com/Downloads/CusVarDB/V1.0.0/test_dataset.rar
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    MoPAC

    The Modular Pipeline for the Analysis of CRISPR screens

    To facilitate the comparison of gene essentialities in two or more cell samples, we propose MoPAC (Modular Pipeline for Analysis of CRISPR screens), a Shiny-driven interactive tool for differential essentiality analysis in CRISPR/Cas9 screens. For installation and usage instructions please refer to the wiki page.
    Downloads: 4 This Week
    Last Update:
    See Project
  • AI Powered Global HCM for the Evolving World of Work Icon
    AI Powered Global HCM for the Evolving World of Work

    For Start-ups, SME's, Large Enterprise

    Darwinbox is a new-age & disruptive mobile-first, cloud-based HRMS platform built for the large enterprises to attract, engage and nurture their most critical resource - talent. It is an end-to-end integrated HR system that aids in streamlining activities across the employee lifecycle (Hire to Retire). Our powerful enterprise product features are built with a clear focus on intuitiveness and scalability, with standards of best in class consumer apps. Darwinbox’s motto is to engage, empower, and inspire employees on one side in addition to automating and simplifying all HR processes for the enterprise on the other. Over 350+ leading enterprises with 850k users manage their entire employee lifecycle on this unified platform.
    Learn More
  • 10

    selectseq

    Get specific sequences from a FASTA or FASTQ file.

    A command-line utility to manipulate biological sequences from a FASTA or FASTQ file. It can, given a list of identifiers, get only a subset of the sequences (or their complement, i.e., sequences NOT in the list). Can also get sequence number N only. Compressed sequences files are supported if readable by zcat.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    MarDRe

    MapReduce-based tool to remove duplicate DNA reads

    MarDRe is a de novo MapReduce-based parallel tool to remove duplicate and near-duplicate DNA reads through the clustering of single-end and paired-end sequences from FASTQ/FASTA datasets. This tool allows bioinformatics to avoid the analysis of not necessary reads, reducing the time of subsequent procedures with the dataset. MarDRe is the Big Data counterpart of ParDRe (link above), which employs HPC technologies (i.e., hybrid MPI/multithreading) to reduce runtime on multicore systems. Instead, MarDRe takes advantage of the MapReduce programming model to significantly improve ParDRe performance on distributed systems, especially on cloud-based infrastructures. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    HSRA

    Hadoop spliced read aligner for RNA-seq data

    ...This tool allows bioinformatics researchers to efficiently distribute their mapping tasks over the nodes of a cluster by combining a fast multithreaded spliced aligner (HISAT2) with Apache Hadoop, which is a distributed computing framework for scalable Big Data processing. HSRA currently supports single-end and paired-end read alignments from FASTQ/FASTA datasets. Moreover, our tool uses the Hadoop Sequence Parser (HSP) library (link above) to efficiently read the input datasets stored on the Hadoop Distributed File System (HDFS), being able to process datasets compressed with Gzip and BZip2 codecs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    bio-cargo

    CARGO - Compressed ARchival for GenOmics

    CARGO is a high-level framework that can semi-automatically generate software systems optimized for the compressed storage of arbitrary types of large genomic data collections. Straightforward applications of CARGO methods to compress FASTQ and SAM format archives require only a few lines of code, produce solutions that match and sometimes outperform specialized format-tailored compressors, and scale well to multi-TB datasets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Flexbar

    Flexbar

    flexible barcode and adapter removal for sequencing platforms

    ...It demultiplexes barcoded runs and removes adapter sequences. Moreover, trimming and filtering features are provided. Flexbar supports next-generation sequencing data in fasta and fastq format, e.g. from the Illumina platform. Reference: Matthias Dodt, Johannes T. Roehr, Rina Ahmed, Christoph Dieterich: Flexbar — flexible barcode and adapter processing for next-generation sequencing platforms. Biology 2012, 1(3):895-905.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework with the Picard SAM JDK, and command line tools similar to SAMtools. The file formats currently supported are BAM, SAM, FASTQ, FASTA, QSEQ, BCF, and VCF. For a longer high-level description of Hadoop-BAM, refer to the article "Hadoop-BAM: directly manipulating next generation sequencing data in the cloud" in Bioinformatics Volume 28 Issue 6 pp. 876-877, available online at: http://dx.doi.org/10.1093/bioinformatics/bts054 Note that the library part of Hadoop-BAM is mainly for developers with experience in using Hadoop. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    SuRankCo

    Supervised Ranking of Contigs in de novo Assemblies

    ...Renard (http://www.biomedcentral.com/1471-2105/16/240/abstract) PLEASE NOTE, it is recommended to read the paper and the readme.txt file before using SuRankCo. Update Jun2015: * Minor changes to enable BAM support. Update Feb2014: * Added support for FASTA/SAM assemblies in addition to ACE/FASTQ(QUAL). NOTE: features of FASTA/SAM assemblies do not include BaseCount, BaseSeqmentCount and ContigQualities yet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    HBSAnalyzer

    Aligner and methylation caller for hairpin bisulfite sequencing data

    HBS Analyzer is a methylation calling tool for hairpin-bisulfite sequencing data. HBS Analyzer can accept raw hairpin-bisulfite sequencing data in FASTA or FASTQ format, align the paired end reads to the reference genome and call methylation. Given the double stranded nature of the data that is used, HBS analyzer can identify errors caused by PCR and sequencing along with identifying hemi-methylated sites. This error information is also considered while making the methylation call and hence methylation sites introduced by SNPs are also identified along with the known methylation sites in the genome.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    HBSAnalyzer

    Aligner and methylation caller for hairpin bisulfite sequencing data

    HBS Analyzer is a methylation calling tool for hairpin-bisulfite sequencing data. HBS Analyzer can accept raw hairpin-bisulfite sequencing data in FASTA or FASTQ format, align the paired end reads to the reference genome and call methylation. Given the double stranded nature of the data that is used, HBS analyzer can identify errors caused by PCR and sequencing along with identifying hemi-methylated sites. This error information is also considered while making the methylation call and hence methylation sites introduced by SNPs are also identified along with the known methylation sites in the genome.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    TriageTools

    Tools for partitioning and prioritizing fastq data

    TriageTools is a collection of tools for partitioning raw data (fastq reads) from high-throughput sequencing projects. The tools are designed for basic data management as well for prioritizing analysis of certain subsets. The project wiki contains usage information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    fqzcomp

    A fastq compression program

    Fqzcomp is a basic fastq compressor, designed primarily for high performance. Despite that it is comparable to bzip2 for compression levels.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    caplib

    caplib

    Correct, translate and analyze combinatorial library sequencing data

    Originally developped to handle PacBio CCS data for an AAV capsid library. This program will extract, correct, translate and analyze the sequencng data, starting from the CCS fastq file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    DimerRemover

    Remove adapter dimers from NGS data

    This program can be used to count or remove adapter dimers in fastq files. Using a provided adapter sequence, it generates variations of this sequence and stores them in a hash table. The reads can then be directly matched against the hash. It is far more time efficient than doing alignment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    trimMate is a tool to remove junction adapters as well as sequencing adapters from mate pair libraries and trim the sequences accordingly. It works on fastq files generated by next generation sequencing (NGS) machines. The release is source code only, please download from version control.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    GenOO-HTS

    A Modern Perl Framework for High Throughput Sequencing analysis

    GenOO-HTS [jee-noo] is an open-source; object-oriented Perl framework specifically developed for the design of High Throughput Sequencing (HTS) analysis tools. The primary aim of GenOO-HTS is to make simple HTS analyses easy and complicated analyses possible. GenOO-HTS models biological entities into Perl objects and provides relevant attributes and methods that allow for the manipulation of high throughput sequencing data. Using GenOO-HTS as a core development module reduces the overhead...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    NeedlemanWunsch

    NeedlemanWunsch

    Fast global sequence alignment for the masses!

    MOVED TO GITHUB: https://github.com/noporpoise/seq-align Global optimal sequence alignment using the Needleman-Wunsch algorithm. Aligns DNA, RNA, protein sequence and more! See our sister project local alignment using Smith-Waterman: http://sourceforge.net/projects/smithwaterman/
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next