From: Harry N. <ha...@li...> - 2013-06-27 10:32:36
|
I have written a Picard Module for the picard.analysis package that parses a Bam file using the acceptRead method and extracts contigs and builds Scaffolds of adjacent contigs based on user supplied criteria for gap length between contigs. Prints out a gff3 file for Scaffold co-ordinates. This is the same functionality as the samTools TargetCut programme except that it uses a different model for joining contigs into Scaffolds and users can supply the parameters of the model. It runs slower than targetcut and generates shorter scaffolds when using default parameters. However it does have a few bells and whistles that are not available with targetcut. Users can supply a set of co-ordinates where they do not scaffolds reported, eg within repeats. A lot of metadata is also generated. mean and N50 lengths of contigs, scaffolds and gaps, % reference covered, mean coverage of contigs, distribution of contig lengths. Is there any interest in incorporating it into the Picard tools project? If so are there systematic tests that need to be run on it? The jar and java files for the class are available from our website at: http://www.genomics.liv.ac.uk/tryps/HaploSeq.html Cheers Harry Harry Noyes Room 231 BioSciences Building University of Liverpool Crown Street Liverpool L69 7ZB 0151 795 4512 www.genomics.liv.ac.uk/tryps ha...@li... |