Hi there,
Here's a way to speed up Scalpel, especially when there are fewer regions to consider. This is by way of samtools and an indexed genome. Should also lower memory consumption.
In FindVariants.pl remove lines:
:::perl
Add these lines to replace the above my $seq
:::perl
samtools faidx $REF $chr:$left-$right
, 2);(this would be easier on Github with pull requests!)
Thank you for your feedback!
Yes, this edits to the code will produce a speed up when working on very few regions. However, this might not be advisable for a very large number of regions (~millions).
Also it requires "samtools" to be installed and available at command line by all the users.
I might add this patch in the future as an optional feature/parameter...
Thanks Giuseppe,
Bcbio-nextgen parallelises variant calling by splitting the bam files and bed regions and then only submits small bits and pieces to the individual callers (multiple times in multiple threads) so I'll keep this edit in my fork for now.