Showing 7 open source projects for "duplicates"

View related business solutions
  • Make Your Observability Stack Effortless Icon
    Make Your Observability Stack Effortless

    For Software Engineers, DevOps, Data Architects, and IT Leaders

    The progression to modern application stacks and microservices architectures has resulted in orders of magnitude more logs, metrics, events, and traces. Like gravity, data attracts more data, making it increasingly difficult to move and process as it accumulates over time. More than ever, there is a need to be able to stream-process, filter, mask, transform, aggregate, analyze, and route that data to various data tier destinations optimized for specific usage.
  • Contract Automation Made Easy Icon
    Contract Automation Made Easy

    Use Docubee to easily gather data, generate contracts, share them your way, and collect secure eSignatures

    Docubee is an intelligent contract automation platform that allows you to quickly and painlessly generate, manage, share, and sign contracts. Featuring powerful conditional logic-based workflows, generative AI technology, and an easily adaptable interface, Docubee makes it easy to automate your most complex contracts and agreements.
  • 1

    WellMeth

    Genome-Wide DNA Methylation Analysis with RRBS

    WellMeth is a integrated framework for Reduced Representation Bisulfite-Seq analysis
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    DataCleaner

    DataCleaner

    Data quality analysis, profiling, cleansing, duplicate detection +more

    DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io
    Leader badge
    Downloads: 95 This Week
    Last Update:
    See Project
  • 3

    ParDRe

    Parallel tool to remove duplicate DNA reads

    ... number of cores and, thanks to the message-passing technology, it can be executed on clusters. There also exists a MapReduce counterpart of ParDRe, called MarDRe (see the link above). UPDATE: From version 2.0.5 ParDRe also provides support to remove only optical duplicates (and leave biologically interesting duplicates) as well as to work with compressed input/output with .gz format.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4

    CIMS

    Crosslinking induced mutation site analysis

    This package includes the scripts to detect statistically reproducible crosslinking induced mutation sites (CIMS) and cross linking induced truncation sites (CITS) from HITS-CLIP data. References: Moore, M.*, Zhang, C.*, Gantman, E.C., Mele, A., Darnell, J.C., Darnell, R.B. 2014. Mapping Argonaute and conventional RNA-binding protein interactions with RNA at single-nucleotide resolution using HITS-CLIP and CIMS analysis. Nat Protocols, 9:263-293. Zhang,C.†, Darnell, R.B.† 2011....
    Downloads: 0 This Week
    Last Update:
    See Project
  • Let your volunteer coordinators do their best work. Icon
    Let your volunteer coordinators do their best work.

    For non-profit organizations requiring a software solution to keep track of volunteers

    Stop messing with tools that aren’t designed to amplify volunteer programs. With VolunteerMatters, it’s a delight to manage everything in one place.
  • 5
    MethylExtract

    MethylExtract

    High-Quality methylation maps and SNV calling from BS-Seq experiments

    MethylExtract is a user friendly tool to generate i) high quality, whole genome methylation maps and ii) to detect sequence variation within the same sample preparation. The program is implemented into a single script and takes into account all major error sources: sequencing errors, bisulfite failure, clonal reads and single nucleotide variants. MethylExtract detects variation (SNVs – Single Nucleotide Variation) in a similar way than VarScan, a very sensitive method extensively used in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    A small command line tool to process streams of numbers and spit out the running total, bin the results or merge duplicates. Supports arbitrary precision numbers. ocdf is named for the cumulative distribution function in probability.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    This utility can process multiple POI (Points Of Interest) files (format CSV, comma separated), merge POI lists, find and eliminate (or mark) duplicates (POIs with similar coordinates, e.g. POIs having < 10m distance between them). See docs & screenshots
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next