Clone-by-clone sequencing, as a means of achieving high quality assemblies for large and complex genomes, continues to be of great relevance in the era of high throughput sequencing. However, assemblies obtained using current whole genome assemblers are often fragmented and sometimes have issues of genome completeness owing to different data characteristics introduced by multiplexed sequencing.

With iCAS the data filtering process is based on a novel kmer frequency algorithm, resulting in near perfect pre-assembly reads. Contigs are generated using different assembly algorithms and then merged together to achieve longer continuity. Re-aligning all the reads back to the draft contigs and recalibrating each sequence base achieves a final consensus. Using finished clones for QC, the pipeline is able to obtain assemblies with clone coverage of 99.7% and consensus base quality of Q39.

Developed at the Wellcome Trust Sanger Institute.

Project Activity

See All Activity >

Categories

Bio-Informatics

License

GNU General Public License version 3.0 (GPLv3)

Follow iCAS - An Illumina Clone Assembly System

iCAS - An Illumina Clone Assembly System Web Site

Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account

Additional Project Details

Intended Audience

Science/Research

Programming Language

C, Perl

Related Categories

Perl Bio-Informatics Software, C Bio-Informatics Software

Registered

2014-07-14