From: Aylwyn S. <as...@sa...> - 2009-03-25 09:35:35
|
On Wed, 25 Mar 2009, Heng Li wrote: > Hello Thon, > > I think you are talking about local de novo assembly around structural > variations. SAM is designed with this application in mind. It puts unmapped > mate close to the mapped mate and therefore retrieving data for local > assembly is convenient and highly efficient. > > Aylwyn from Richard's group has a pipeline to do such local de novo assembly > based on maq's output. I guess it is straightforward to adapt the pipeline to > SAM when this is really needed in future. So far as I know, all the standard > de novo assemblers take a fastq or a fasta file as the input. You have to > develop a pipeline to feed data to them. That's correct. I have considered adapting to use the SAM format as it would actually be somewhat more convenient than the Maq map format. So I would say SAM is a good format for use in assembly. Aylwyn > Heng > > On 24 Mar 2009, at 23:33, Complete Genomics Developer Support wrote: > >> Hi, >> >> Are there any assemblers that use the SAM/BAM output as input yet? >> As I see it, the SAM file is an intermediate step between mapping and >> assembly, since assembly of the complete sequence to facilitate proper >> variation calling requires the de-novo assembly of reads that did not >> initially map to the reference, but which have a consistent mate pair some >> distance away?I understand that samtools is not attempting to do any of >> these types of assembly, but maybe others do? >> >> We are looking for a format that can represent the mapping of our read >> structure, but also that would be able to be used as the input format for >> downstream assembly. We expect that many of our customers would want to >> independently verify our assembly to other assemblies. >> >> Thanks, >> >> Thon >> __________________________________ >> Thon de Boer, Ph.D. >> Sr. Product Manager Bioinformatics >> Complete Genomics, Inc. >> 2071 Stierlin Court >> Mountain View, CA, 94043 >> +1-650-943-2735 >> mailto:td...@co... >> Powering large-scale human genome studies >> _______________________________________ >> >> ____ >> >> The contents of this e-mail and any attachments are confidential and only >> for >> use by the intended recipient. Any unauthorised use, distribution or >> copying >> of this message is strictly prohibited. If you are not the intended >> recipient >> please inform the sender immediately by reply e-mail and delete this >> message >> from your system. Thank you for your co-operation. >> ------------------------------------------------------------------------------ >> Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are >> powering Web 2.0 with engaging, cross-platform capabilities. Quickly and >> easily build your RIAs with Flex Builder, the Eclipse(TM)based development >> software that enables intelligent coding and step-through debugging. >> Download the free 60 day trial. >> http://p.sf.net/sfu/www-adobe-com_______________________________________________ >> Samtools-devel mailing list >> Sam...@li... >> https://lists.sourceforge.net/lists/listinfo/samtools-devel > -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. |