|
From: Hung Chih-M. <ym...@gm...> - 2012-12-21 16:09:55
|
Hi Dr. Li, Thank you for the feedback. For my data, I used BWA for mapping and GATK for SNP calling. What will be the best practice for applying psmc to my data? Should I just use BWA to produce a SAM and convert it to a BAM, and then use samtools to convert the BAM to fatsq for psmc? Thanks, Chih-Ming On Fri, Dec 21, 2012 at 9:31 PM, Heng Li <lh...@sa...> wrote: > Firstly, there is no way to convert a SNP-only VCF to consensus sequence > as you do not know which bases are callable. Secondly, even if you have an > all-site VCF, you cannot use GATK as it does not compute consensus quality > which is necessary for generating consensus sequence. > > Heng > > On Dec 21, 2012, at 1:07 AM, Hung Chih-Ming wrote: > > Hi > > From the website I understand the commend line below converts the input > BAM to a BCF format*, *then converts BCF to VCF, and finally to fastq. > $ samtools mpileup -uf ref.fa aln.bam | bcftools view -cg - | vcfutils.pl vcf2fq > cns.fq > > > I am wondering whether there is a way I can just input a VCF file > (produced by GATK) instead of a BAM file and convert it to a fastq file? > What I really want to do is based on SNP data to infer population history > using psmc (by Heng Li). Psmc requires fastq input, not cvf, files. So, I > am looking for a way to convert the vcf file (SNP calling data from GATK) > to fastq. > > I appreciate any comments, > > Thanks, > > Chih-Ming > > > ------------------------------------------------------------------------------ > LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial > Remotely access PCs and mobile devices and provide instant support > Improve your efficiency, and focus on delivering more value-add services > Discover what IT Professionals Know. Rescue delivers > > http://p.sf.net/sfu/logmein_12329d2d_______________________________________________ > Samtools-help mailing list > Sam...@li... > https://lists.sourceforge.net/lists/listinfo/samtools-help > > > > -- The Wellcome Trust Sanger Institute is operated by Genome Research > Limited, a charity registered in England with number 1021457 and a company > registered in England with number 2742969, whose registered office is 215 > Euston Road, London, NW1 2BE. > |