Menu

Variant calling parallelization

Anonymous
2022-04-28
2022-05-25
  • Anonymous

    Anonymous - 2022-04-28

    Hi NGSEP devs,
    I was wondering if the multiple variant caller can be parallelized? I found the option (choosing the number of threads) in the alignement method, but not in the variant caller.
    Best regards

     
  • Jorge Duitama

    Jorge Duitama - 2022-04-28

    Hi

    Thanks for your interest in NGSEP. Unfortunately we are still working on the paralellization of the variant caller. We hope to make progress in this direction in the next few months.

    Best regards

    Jorge

     
  • Anonymous

    Anonymous - 2022-04-29

    Thanks Jorge.

    I've seen on the web that one can at least treat chromosomes in parallel, for instance, using gnu parallel:

    parallel 'bcftools mpileup -Ou -f ref.fa -r {} in.bam | bcftools call -m -v -Oz -o {}.vcf.gz' ::: {1..22} X Y

    (from https://github.com/samtools/bcftools/issues/949)
    Is it the way you are thinking in?

    Best regards

     
  • Jorge Duitama

    Jorge Duitama - 2022-04-29

    Hi

    Now that you mention it, parallel can be a way to run in parallel chromosomes. The command in our case would be something like this:

    parallel 'java -Xmx8g -jar /path/to/NGSEPcore_4.2.0.jar MultisampleVariantsDetector -r ref.fa -maxBaseQS 30 -minQuality 40 -querySeq {} -o {}.vcf in.bam' ::: {1..22} X Y

    We plan to implement parallelization within the software to distribute the genome by regions instead of chromosomes, which should work better if you have more than 24 threads. By now, parallel can be a good solution.

    Thanks

     
  • Anonymous

    Anonymous - 2022-05-25

    Thanks Jorge. I will try with the command you suggested. Have a nice day.

     
    👍
    1

Anonymous
Anonymous

Add attachments
Cancel





Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.