This project deals with de novo assembled contigs to cluster them into two groups forming the lengthwise clusters. These two groups are correctly assembled contigs and wrongly assembled contigs. For non-model organisms, without reference genome, it is very hard to state that which contig is correct and which is not. With this project, the proposed unsupervised clustering method (K-means) based clustering is done utilizing the depth of reads mapping based coverage information. The coverage based features are used for clustering.
The same can also be found at SCBB software page (http://scbb.ihbt.res.in/SCBB_dept/Software.php).
Features
- Clustering
- de novo assembly identification
- mis-assembly detection
Follow de novo assembly validation
You Might Also Like
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of de novo assembly validation!