Hello, we are using Scalpel to call indels on DREAM dataset 4 as a test and parallelized it by chromosome. While jobs for majority of the chromosomes run fine, few give errors and do no produce an output VCF. I've tried rerunning the jobs multiple times and it's not the same set of chromosomes that produce errors. I was hoping to get some suggestions on figuring out what the issue is.
Localdateandtime:FriMar2309:08:032018Program:scalpel-discovery(micro-assemblyvariantdetection)Version:0.5.4(beta),January112018Contact:GiuseppeNarzisi<gnarzisi@nygenome.org>***MAINANALYSIS***--Printparameterstochr1/main/parameters.txt--Detectmutationsonnormal--Printparameterstochr1/main/normal/parameters.txtLoadingtargetsregion:1:1-249250621LoadinggenomefromFASTAfile...85sequences.AssemblyExonsstartassemblyof2492503regions.stepSize:249251**0started,pid:597491.[0..249250]**249251started,pid:597512.[249251..498501]**498502started,pid:597533.[498502..747752]**747753started,pid:597564.[747753..997003]**997004started,pid:597595.[997004..1246254]**1246255started,pid:597626.[1246255..1495505]**1495506started,pid:597657.[1495506..1744756]**1744757started,pid:597688.[1744757..1994007]**1994008started,pid:597709.[1994008..2243258]**2243259started,pid:5977210.[2243259..2492502]Commandfailure:deletefile(rmscalpel/chr1/main/normal/logs/regions-747753-997003.fa.log.gz)...Commandfailure:deletefile(rmscalpel/chr1/main/normal/logs/regions-249251-498501.fa.log.gz)...Commandfailure:deletefile(rmscalpel/chr1/main/normal/logs/regions-1495506-1744756.fa.log.gz)...Commandfailure:deletefile(rmscalpel/chr1/main/normal/regions-997004-1246254.fa)...Commandfailure:deletefile(rmscalpel/chr1/main/normal/logs/regions-1246255-1495505.fa.log.gz)...**249251justgotoutofthepool[PID:59751|exitcode:0|exitsignal:9|coredump:0]**747753justgotoutofthepool[PID:59756|exitcode:0|exitsignal:9|coredump:0]Commandfailure:deletefile(rmscalpel/chr1/main/normal/regions-1246255-1495505.fa)...Commandfailure:deletefile(rmscalpel/chr1/main/normal/logs/regions-1994008-2243258.fa.log.gz)...**997004justgotoutofthepool[PID:59759|exitcode:0|exitsignal:9|coredump:0]Commandfailure:deletefile(rmscalpel/chr1/main/normal/regions-2243259-2492502.fa)...Commandfailure:deletefile(rmscalpel/chr1/main/normal/regions-1994008-2243258.fa)...Commandfailure:deletefile(rmscalpel/chr1/main/normal/regions-0-249250.fa)...**1994008justgotoutofthepool[PID:59770|exitcode:24|exitsignal:0|coredump:0]**2243259justgotoutofthepool[PID:59772|exitcode:187|exitsignal:0|coredump:0]**0justgotoutofthepool[PID:59749|exitcode:0|exitsignal:0|coredump:0]**1246255justgotoutofthepool[PID:59762|exitcode:47|exitsignal:0|coredump:0]**1744757justgotoutofthepool[PID:59768|exitcode:117|exitsignal:0|coredump:0]**1495506justgotoutofthepool[PID:59765|exitcode:210|exitsignal:0|coredump:0]**498502justgotoutofthepool[PID:59753|exitcode:70|exitsignal:0|coredump:0]MergeDBs...Loadreferencecoverage...Argument""isn't numeric in numeric lt (<) at scalpel/scalpel-0.5.4/HashesIO.pm line 146, <FILE> line 96648269.Loadloc2keysDB...Can't open scalpel/chr1/main/normal/loc2keys.2.txt (No such file or directory)Commandfailure:findVariants(scalpel/scalpel-0.5.4/FindVariants.pl--bamsynthetic.challenge.set4.normal.bam--bed1:1-249250621--refhuman_g1k_v37_no_decoy.fasta--kmer25--mincov5--covthr5--lowcov2--covratio0.01--radius100--window600--maxregcov10000--step100--mapscore1--mismatches3--pathlimit1000000--dirscalpel/chr1/main/normal--sampleALL--numprocs10--coordsnull--formatvcf--seed0--cov2file)...Discovery:elapsedtime:0day(s)11hour(s)31minute(s)55second(s)
Thanks,
Prachi
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hello, we are using Scalpel to call indels on DREAM dataset 4 as a test and parallelized it by chromosome. While jobs for majority of the chromosomes run fine, few give errors and do no produce an output VCF. I've tried rerunning the jobs multiple times and it's not the same set of chromosomes that produce errors. I was hoping to get some suggestions on figuring out what the issue is.
Here's the command:
scalpel-discovery --somatic --normal synthetic.challenge.set4.normal.bam --tumor synthetic.challenge.set4.tumour.bam --ref human_g1k_v37_no_decoy.fasta --dir chr$SGE_TASK_ID --bed $region --window 600 --numprocs 10
where $region is defined to span each chromosome.
Here's a sample error log:
Thanks,
Prachi