You can subscribe to this list here.
2012 |
Jan
(1) |
Feb
(2) |
Mar
|
Apr
(29) |
May
(8) |
Jun
(5) |
Jul
(46) |
Aug
(16) |
Sep
(5) |
Oct
(6) |
Nov
(17) |
Dec
(7) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2013 |
Jan
(5) |
Feb
(2) |
Mar
(10) |
Apr
(13) |
May
(20) |
Jun
(7) |
Jul
(6) |
Aug
(14) |
Sep
(9) |
Oct
(19) |
Nov
(17) |
Dec
(3) |
2014 |
Jan
(3) |
Feb
|
Mar
(7) |
Apr
(1) |
May
(1) |
Jun
(30) |
Jul
(10) |
Aug
(2) |
Sep
(18) |
Oct
(3) |
Nov
(4) |
Dec
(13) |
2015 |
Jan
(27) |
Feb
|
Mar
(19) |
Apr
(12) |
May
(10) |
Jun
(18) |
Jul
(4) |
Aug
(2) |
Sep
(2) |
Oct
|
Nov
(1) |
Dec
(9) |
2016 |
Jan
(6) |
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
(1) |
Aug
(1) |
Sep
(1) |
Oct
|
Nov
|
Dec
|
From: Walenz, B. <bw...@jc...> - 2012-04-11 18:54:23
|
Hi, Ole- Yes, I overlooked a step. In the contig you insert to the latest version, update the 'data.contig_status' with what the second to last version has. FYI, the tigStore should have versions such as: seqDB.v014.ctg seqDB.v014.dat seqDB.v014.utg seqDB.v015.ctg seqDB.v015.p001.ctg seqDB.v015.p001.dat (etc) seqDB.v015.utg seqDB.v016.ctg seqDB.v016.p001.ctg seqDB.v016.p001.dat (etc) seqDB.v016.utg (the v numbers will of course be different in your assembly) v015 contains the output of scaffolder, which is the input to consensus. Contigs here have no consensus sequence, but otherwise all the data is present. It is largely just rewriting the data from v014 into partitions (p###), so each consensus job can load a single file instead of randomly accessing a large file. The status flag on each unitig/contig is also set. This flag tells if the unitig/contig was placed in a scaffold, is a surrogate, degenerate, etc. v016 is the output of consensus, the input to terminator. All terminator does is to repackage this into ASCII files. To summarize: grab the contig from v014 (the last with a consensus sequence), the status flag from v015, change the status flag in the contig you grabbed, and then insert the contig into v016. by doing this, you'll lose VAR records for this contig, but otherwise the consensus sequence is the same (or largely the same; variant detection can change it a bit). b On 4/11/12 6:23 AM, "Ole Kristian Tørresen" <o.k...@bi...> wrote: > Hi Brian, > ctgcns completed now, but I got an error with asmOutputFasta. From > 9-terminator/asmOutputFasta.err: > ERROR: Illegal unitigpos type type value 'X' (CCO) at line 1676575956 > > Is this connected with the procedure I did with inserting the contig > from an older tigStore? > > Thank you for your help so far. > > Ole > > On 11 April 2012 08:13, Ole Kristian Tørresen <o.k...@bi...> wrote: >> Hi Brian. >> >> I've done this, and rerunning ctgcns on that last partition. I'll send >> the layout and log in a separate email. >> >> Ole >> >> On 10 April 2012 21:37, Walenz, Brian <bw...@jc...> wrote: >>> Hi Ole- >>> >>> I don't see anything that looks like an error in the log, so I'll have to >>> assume it crashed. You report it runs for 20 hours, which is odd for contig >>> consensus, unless that contig is very very deep. If so, the ctgcns process >>> will also be large. Do you know how big the process was? >>> >>> Can you make the full log available? >>> >>> It is possible to force the contig to have a consensus sequence. If the job >>> did crash, the other contigs will still need to have consensus generated. >>> >>> The process is the same as editing a unitig in the tigStore: dump the contig >>> in question, edit the file to have a consensus sequence, then load that >>> contig back into the tigStore. A consensus sequence for this contig can be >>> found in one of the earlier tigStore versions; the version just before this >>> one will probably have it. That makes our process even easier: dump the >>> version with a consensus sequence, and load it back into the latest version. >>> >>> A sketch of the steps: >>> >>> 1) Dump the previous version of the contig. check that 'file' does contain >>> a consensus sequence. >>> >>> tigStore -g *gkpStore -t *tigStore <vers-1> -c <ctgID> -d layout > file >>> >>> 2) Load that pervious version into the tigStore as the latest version >>> >>> tigStore -g *gkpStore -t *tigStore <vers> <part> -c <ctgID> -R file >>> >>> Notice that this tigStore command specifies both a version and a partition >>> for the tigStore. >>> >>> 3) Rerun consensus.sh on that partition. It will not attempt to compute the >>> consensus for that contig. >>> >>> I'd be interested in seeing the contig you dump, if only to verify that it >>> is deep. >>> >>> b >>> >>> >>> >>> On 4/10/12 4:05 AM, "Ole Kristian Tørresen" <o.k...@bi...> wrote: >>> >>>> Hi, >>>> I'm having some problems while doing some low coverage sequencing >>>> assembly testing. I've tried to assemble about 10x coverage of 150 nt >>>> paired Illumina reads of 500 bp fragment size. These are from the >>>> parrot used in the Assemblathon 2 >>>> (http://assemblathon.org/pages/download-data). Everything seems to run >>>> fine, until contig consensus, where 1 partition just don't succeed. It >>>> seems to run for quite some time (20 hours or something) before >>>> failing. These are the last 20 lines from the output of the ctgcns >>>> partition that fails: >>>> Alignment params: 297 333 200 200 0 0.12 1e-06 30 1 >>>> -- e/l = 7/112 = 6.25% >>>> A -----+------+----> [] >>>> B 332 -------> 40 [] >>>> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >>>> lScore=0.026087 (112 vs 115) aScore=0.160000 (332 vs 316) >>>> bScore=0.150000 (-42 vs -27). (CONTIGF) >>>> GetAlignmentTrace()-- Overlap found between 1076523 (U) and 25763657 >>>> (R) expected hangs: a=316 b=-27 erate=0.060000 aligner=Local_Overlap >>>> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >>>> lScore=0.026087 (112 vs 115) aScore=0.160000 (332 vs 316) >>>> bScore=0.150000 (-42 vs -27). (CONTIGF) >>>> Local_Overlap_AS_forCNS found overlap between 1076523 (U) and 25763657 >>>> (R) ahang: 332, bhang: -42 (expected hang was 316) >>>> Alignment params: 298 334 200 200 0 0.12 1e-06 30 1 >>>> -- e/l = 6/112 = 5.36% >>>> A -----+------+----> [] >>>> B 332 -------> 42 [] >>>> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >>>> lScore=0.009009 (110 vs 111) aScore=0.140000 (332 vs 318) >>>> bScore=0.130000 (-42 vs -29). (CONTIGF) >>>> GetAlignmentTrace()-- Overlap found between 1076523 (U) and 57537697 >>>> (R) expected hangs: a=318 b=-29 erate=0.060000 aligner=Local_Overlap >>>> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >>>> lScore=0.009009 (110 vs 111) aScore=0.140000 (332 vs 318) >>>> bScore=0.130000 (-42 vs -29). (CONTIGF) >>>> Local_Overlap_AS_forCNS found overlap between 1076523 (U) and 57537697 >>>> (R) ahang: 332, bhang: -42 (expected hang was 318) >>>> Alignment params: 300 336 200 200 0 0.12 1e-06 30 1 >>>> -- e/l = 6/110 = 5.45% >>>> A -----+------+----> [] >>>> B 332 -------> 42 [] >>>> >>>> This is the error message: >>>> at /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 1237 >>>> main::caFailure('1 consensusAfterScaffolder jobs failed; remove >>>> 8-consensus/co...', undef) called at >>>> /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 5142 >>>> main::postScaffolderConsensus() called at >>>> /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 5885 >>>> >>>> ---------------------------------------- >>>> Failure message: >>>> >>>> 1 consensusAfterScaffolder jobs failed; remove >>>> 8-consensus/consensus.sh to try again >>>> >>>> I've tried removing consensus.sh and running again, but get the same error. >>>> >>>> This is the spec file: >>>> utgErrorRate=0.03 >>>> utgErrorLimit=2.5 >>>> ovlErrorRate=0.06 >>>> cnsErrorRate=0.06 >>>> cgwErrorRate=0.10 >>>> merSize = 22 >>>> overlapper=ovl >>>> unitigger = bogart >>>> merylMemory = 128000 >>>> merylThreads = 16 >>>> merOverlapperThreads = 2 >>>> merOverlapperExtendConcurrency = 8 >>>> merOverlapperSeedConcurrency = 8 >>>> ovlThreads = 2 >>>> mbtThreads = 2 >>>> mbtConcurrency = 8 >>>> ovlConcurrency = 8 >>>> ovlCorrConcurrency = 16 >>>> ovlRefBlockSize = 32000000 >>>> ovlHashBits = 24 >>>> ovlHashBlockLength = 800000000 >>>> ovlStoreMemory = 128000 >>>> frgCorrThreads = 2 >>>> frgCorrConcurrency = 8 >>>> ovlCorrBatchSize = 1000000 >>>> ovlCorrConcurrency = 16 >>>> cnsConcurrency = 16 >>>> doExtendClearRanges = 0 >>>> >>>> I don't need to have that unitig (1076523 (U)) in my finished >>>> assembly, so it's possible to just remove it as long as I get a >>>> finished assembly. I've also tried to just create the .success file, >>>> but then terminator fails. >>>> >>>> Does anyone have any ideas of what I might do different? Can I just >>>> remove that unitig and proceed? How do I do that? >>>> >>>> Sincerely, >>>> Ole Kristian Tørresen >>>> PhD student >>>> University of Oslo >>>> >>>> --------------------------------------------------------------------------- >>>> --- >>>> Better than sec? Nothing is better than sec when it comes to >>>> monitoring Big Data applications. Try Boundary one-second >>>> resolution app monitoring today. Free. >>>> http://p.sf.net/sfu/Boundary-dev2dev >>>> _______________________________________________ >>>> wgs-assembler-users mailing list >>>> wgs...@li... >>>> https://lists.sourceforge.net/lists/listinfo/wgs-assembler-users >>> |
From: Ole K. T. <o.k...@bi...> - 2012-04-11 10:23:38
|
Hi Brian, ctgcns completed now, but I got an error with asmOutputFasta. From 9-terminator/asmOutputFasta.err: ERROR: Illegal unitigpos type type value 'X' (CCO) at line 1676575956 Is this connected with the procedure I did with inserting the contig from an older tigStore? Thank you for your help so far. Ole On 11 April 2012 08:13, Ole Kristian Tørresen <o.k...@bi...> wrote: > Hi Brian. > > I've done this, and rerunning ctgcns on that last partition. I'll send > the layout and log in a separate email. > > Ole > > On 10 April 2012 21:37, Walenz, Brian <bw...@jc...> wrote: >> Hi Ole- >> >> I don't see anything that looks like an error in the log, so I'll have to >> assume it crashed. You report it runs for 20 hours, which is odd for contig >> consensus, unless that contig is very very deep. If so, the ctgcns process >> will also be large. Do you know how big the process was? >> >> Can you make the full log available? >> >> It is possible to force the contig to have a consensus sequence. If the job >> did crash, the other contigs will still need to have consensus generated. >> >> The process is the same as editing a unitig in the tigStore: dump the contig >> in question, edit the file to have a consensus sequence, then load that >> contig back into the tigStore. A consensus sequence for this contig can be >> found in one of the earlier tigStore versions; the version just before this >> one will probably have it. That makes our process even easier: dump the >> version with a consensus sequence, and load it back into the latest version. >> >> A sketch of the steps: >> >> 1) Dump the previous version of the contig. check that 'file' does contain >> a consensus sequence. >> >> tigStore -g *gkpStore -t *tigStore <vers-1> -c <ctgID> -d layout > file >> >> 2) Load that pervious version into the tigStore as the latest version >> >> tigStore -g *gkpStore -t *tigStore <vers> <part> -c <ctgID> -R file >> >> Notice that this tigStore command specifies both a version and a partition >> for the tigStore. >> >> 3) Rerun consensus.sh on that partition. It will not attempt to compute the >> consensus for that contig. >> >> I'd be interested in seeing the contig you dump, if only to verify that it >> is deep. >> >> b >> >> >> >> On 4/10/12 4:05 AM, "Ole Kristian Tørresen" <o.k...@bi...> wrote: >> >>> Hi, >>> I'm having some problems while doing some low coverage sequencing >>> assembly testing. I've tried to assemble about 10x coverage of 150 nt >>> paired Illumina reads of 500 bp fragment size. These are from the >>> parrot used in the Assemblathon 2 >>> (http://assemblathon.org/pages/download-data). Everything seems to run >>> fine, until contig consensus, where 1 partition just don't succeed. It >>> seems to run for quite some time (20 hours or something) before >>> failing. These are the last 20 lines from the output of the ctgcns >>> partition that fails: >>> Alignment params: 297 333 200 200 0 0.12 1e-06 30 1 >>> -- e/l = 7/112 = 6.25% >>> A -----+------+----> [] >>> B 332 -------> 40 [] >>> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >>> lScore=0.026087 (112 vs 115) aScore=0.160000 (332 vs 316) >>> bScore=0.150000 (-42 vs -27). (CONTIGF) >>> GetAlignmentTrace()-- Overlap found between 1076523 (U) and 25763657 >>> (R) expected hangs: a=316 b=-27 erate=0.060000 aligner=Local_Overlap >>> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >>> lScore=0.026087 (112 vs 115) aScore=0.160000 (332 vs 316) >>> bScore=0.150000 (-42 vs -27). (CONTIGF) >>> Local_Overlap_AS_forCNS found overlap between 1076523 (U) and 25763657 >>> (R) ahang: 332, bhang: -42 (expected hang was 316) >>> Alignment params: 298 334 200 200 0 0.12 1e-06 30 1 >>> -- e/l = 6/112 = 5.36% >>> A -----+------+----> [] >>> B 332 -------> 42 [] >>> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >>> lScore=0.009009 (110 vs 111) aScore=0.140000 (332 vs 318) >>> bScore=0.130000 (-42 vs -29). (CONTIGF) >>> GetAlignmentTrace()-- Overlap found between 1076523 (U) and 57537697 >>> (R) expected hangs: a=318 b=-29 erate=0.060000 aligner=Local_Overlap >>> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >>> lScore=0.009009 (110 vs 111) aScore=0.140000 (332 vs 318) >>> bScore=0.130000 (-42 vs -29). (CONTIGF) >>> Local_Overlap_AS_forCNS found overlap between 1076523 (U) and 57537697 >>> (R) ahang: 332, bhang: -42 (expected hang was 318) >>> Alignment params: 300 336 200 200 0 0.12 1e-06 30 1 >>> -- e/l = 6/110 = 5.45% >>> A -----+------+----> [] >>> B 332 -------> 42 [] >>> >>> This is the error message: >>> at /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 1237 >>> main::caFailure('1 consensusAfterScaffolder jobs failed; remove >>> 8-consensus/co...', undef) called at >>> /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 5142 >>> main::postScaffolderConsensus() called at >>> /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 5885 >>> >>> ---------------------------------------- >>> Failure message: >>> >>> 1 consensusAfterScaffolder jobs failed; remove >>> 8-consensus/consensus.sh to try again >>> >>> I've tried removing consensus.sh and running again, but get the same error. >>> >>> This is the spec file: >>> utgErrorRate=0.03 >>> utgErrorLimit=2.5 >>> ovlErrorRate=0.06 >>> cnsErrorRate=0.06 >>> cgwErrorRate=0.10 >>> merSize = 22 >>> overlapper=ovl >>> unitigger = bogart >>> merylMemory = 128000 >>> merylThreads = 16 >>> merOverlapperThreads = 2 >>> merOverlapperExtendConcurrency = 8 >>> merOverlapperSeedConcurrency = 8 >>> ovlThreads = 2 >>> mbtThreads = 2 >>> mbtConcurrency = 8 >>> ovlConcurrency = 8 >>> ovlCorrConcurrency = 16 >>> ovlRefBlockSize = 32000000 >>> ovlHashBits = 24 >>> ovlHashBlockLength = 800000000 >>> ovlStoreMemory = 128000 >>> frgCorrThreads = 2 >>> frgCorrConcurrency = 8 >>> ovlCorrBatchSize = 1000000 >>> ovlCorrConcurrency = 16 >>> cnsConcurrency = 16 >>> doExtendClearRanges = 0 >>> >>> I don't need to have that unitig (1076523 (U)) in my finished >>> assembly, so it's possible to just remove it as long as I get a >>> finished assembly. I've also tried to just create the .success file, >>> but then terminator fails. >>> >>> Does anyone have any ideas of what I might do different? Can I just >>> remove that unitig and proceed? How do I do that? >>> >>> Sincerely, >>> Ole Kristian Tørresen >>> PhD student >>> University of Oslo >>> >>> ------------------------------------------------------------------------------ >>> Better than sec? Nothing is better than sec when it comes to >>> monitoring Big Data applications. Try Boundary one-second >>> resolution app monitoring today. Free. >>> http://p.sf.net/sfu/Boundary-dev2dev >>> _______________________________________________ >>> wgs-assembler-users mailing list >>> wgs...@li... >>> https://lists.sourceforge.net/lists/listinfo/wgs-assembler-users >> |
From: Ole K. T. <o.k...@bi...> - 2012-04-11 06:13:46
|
Hi Brian. I've done this, and rerunning ctgcns on that last partition. I'll send the layout and log in a separate email. Ole On 10 April 2012 21:37, Walenz, Brian <bw...@jc...> wrote: > Hi Ole- > > I don't see anything that looks like an error in the log, so I'll have to > assume it crashed. You report it runs for 20 hours, which is odd for contig > consensus, unless that contig is very very deep. If so, the ctgcns process > will also be large. Do you know how big the process was? > > Can you make the full log available? > > It is possible to force the contig to have a consensus sequence. If the job > did crash, the other contigs will still need to have consensus generated. > > The process is the same as editing a unitig in the tigStore: dump the contig > in question, edit the file to have a consensus sequence, then load that > contig back into the tigStore. A consensus sequence for this contig can be > found in one of the earlier tigStore versions; the version just before this > one will probably have it. That makes our process even easier: dump the > version with a consensus sequence, and load it back into the latest version. > > A sketch of the steps: > > 1) Dump the previous version of the contig. check that 'file' does contain > a consensus sequence. > > tigStore -g *gkpStore -t *tigStore <vers-1> -c <ctgID> -d layout > file > > 2) Load that pervious version into the tigStore as the latest version > > tigStore -g *gkpStore -t *tigStore <vers> <part> -c <ctgID> -R file > > Notice that this tigStore command specifies both a version and a partition > for the tigStore. > > 3) Rerun consensus.sh on that partition. It will not attempt to compute the > consensus for that contig. > > I'd be interested in seeing the contig you dump, if only to verify that it > is deep. > > b > > > > On 4/10/12 4:05 AM, "Ole Kristian Tørresen" <o.k...@bi...> wrote: > >> Hi, >> I'm having some problems while doing some low coverage sequencing >> assembly testing. I've tried to assemble about 10x coverage of 150 nt >> paired Illumina reads of 500 bp fragment size. These are from the >> parrot used in the Assemblathon 2 >> (http://assemblathon.org/pages/download-data). Everything seems to run >> fine, until contig consensus, where 1 partition just don't succeed. It >> seems to run for quite some time (20 hours or something) before >> failing. These are the last 20 lines from the output of the ctgcns >> partition that fails: >> Alignment params: 297 333 200 200 0 0.12 1e-06 30 1 >> -- e/l = 7/112 = 6.25% >> A -----+------+----> [] >> B 332 -------> 40 [] >> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >> lScore=0.026087 (112 vs 115) aScore=0.160000 (332 vs 316) >> bScore=0.150000 (-42 vs -27). (CONTIGF) >> GetAlignmentTrace()-- Overlap found between 1076523 (U) and 25763657 >> (R) expected hangs: a=316 b=-27 erate=0.060000 aligner=Local_Overlap >> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >> lScore=0.026087 (112 vs 115) aScore=0.160000 (332 vs 316) >> bScore=0.150000 (-42 vs -27). (CONTIGF) >> Local_Overlap_AS_forCNS found overlap between 1076523 (U) and 25763657 >> (R) ahang: 332, bhang: -42 (expected hang was 316) >> Alignment params: 298 334 200 200 0 0.12 1e-06 30 1 >> -- e/l = 6/112 = 5.36% >> A -----+------+----> [] >> B 332 -------> 42 [] >> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >> lScore=0.009009 (110 vs 111) aScore=0.140000 (332 vs 318) >> bScore=0.130000 (-42 vs -29). (CONTIGF) >> GetAlignmentTrace()-- Overlap found between 1076523 (U) and 57537697 >> (R) expected hangs: a=318 b=-29 erate=0.060000 aligner=Local_Overlap >> GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 >> lScore=0.009009 (110 vs 111) aScore=0.140000 (332 vs 318) >> bScore=0.130000 (-42 vs -29). (CONTIGF) >> Local_Overlap_AS_forCNS found overlap between 1076523 (U) and 57537697 >> (R) ahang: 332, bhang: -42 (expected hang was 318) >> Alignment params: 300 336 200 200 0 0.12 1e-06 30 1 >> -- e/l = 6/110 = 5.45% >> A -----+------+----> [] >> B 332 -------> 42 [] >> >> This is the error message: >> at /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 1237 >> main::caFailure('1 consensusAfterScaffolder jobs failed; remove >> 8-consensus/co...', undef) called at >> /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 5142 >> main::postScaffolderConsensus() called at >> /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 5885 >> >> ---------------------------------------- >> Failure message: >> >> 1 consensusAfterScaffolder jobs failed; remove >> 8-consensus/consensus.sh to try again >> >> I've tried removing consensus.sh and running again, but get the same error. >> >> This is the spec file: >> utgErrorRate=0.03 >> utgErrorLimit=2.5 >> ovlErrorRate=0.06 >> cnsErrorRate=0.06 >> cgwErrorRate=0.10 >> merSize = 22 >> overlapper=ovl >> unitigger = bogart >> merylMemory = 128000 >> merylThreads = 16 >> merOverlapperThreads = 2 >> merOverlapperExtendConcurrency = 8 >> merOverlapperSeedConcurrency = 8 >> ovlThreads = 2 >> mbtThreads = 2 >> mbtConcurrency = 8 >> ovlConcurrency = 8 >> ovlCorrConcurrency = 16 >> ovlRefBlockSize = 32000000 >> ovlHashBits = 24 >> ovlHashBlockLength = 800000000 >> ovlStoreMemory = 128000 >> frgCorrThreads = 2 >> frgCorrConcurrency = 8 >> ovlCorrBatchSize = 1000000 >> ovlCorrConcurrency = 16 >> cnsConcurrency = 16 >> doExtendClearRanges = 0 >> >> I don't need to have that unitig (1076523 (U)) in my finished >> assembly, so it's possible to just remove it as long as I get a >> finished assembly. I've also tried to just create the .success file, >> but then terminator fails. >> >> Does anyone have any ideas of what I might do different? Can I just >> remove that unitig and proceed? How do I do that? >> >> Sincerely, >> Ole Kristian Tørresen >> PhD student >> University of Oslo >> >> ------------------------------------------------------------------------------ >> Better than sec? Nothing is better than sec when it comes to >> monitoring Big Data applications. Try Boundary one-second >> resolution app monitoring today. Free. >> http://p.sf.net/sfu/Boundary-dev2dev >> _______________________________________________ >> wgs-assembler-users mailing list >> wgs...@li... >> https://lists.sourceforge.net/lists/listinfo/wgs-assembler-users > |
From: Walenz, B. <bw...@jc...> - 2012-04-10 19:38:18
|
Hi Ole- I don't see anything that looks like an error in the log, so I'll have to assume it crashed. You report it runs for 20 hours, which is odd for contig consensus, unless that contig is very very deep. If so, the ctgcns process will also be large. Do you know how big the process was? Can you make the full log available? It is possible to force the contig to have a consensus sequence. If the job did crash, the other contigs will still need to have consensus generated. The process is the same as editing a unitig in the tigStore: dump the contig in question, edit the file to have a consensus sequence, then load that contig back into the tigStore. A consensus sequence for this contig can be found in one of the earlier tigStore versions; the version just before this one will probably have it. That makes our process even easier: dump the version with a consensus sequence, and load it back into the latest version. A sketch of the steps: 1) Dump the previous version of the contig. check that 'file' does contain a consensus sequence. tigStore -g *gkpStore -t *tigStore <vers-1> -c <ctgID> -d layout > file 2) Load that pervious version into the tigStore as the latest version tigStore -g *gkpStore -t *tigStore <vers> <part> -c <ctgID> -R file Notice that this tigStore command specifies both a version and a partition for the tigStore. 3) Rerun consensus.sh on that partition. It will not attempt to compute the consensus for that contig. I'd be interested in seeing the contig you dump, if only to verify that it is deep. b On 4/10/12 4:05 AM, "Ole Kristian Tørresen" <o.k...@bi...> wrote: > Hi, > I'm having some problems while doing some low coverage sequencing > assembly testing. I've tried to assemble about 10x coverage of 150 nt > paired Illumina reads of 500 bp fragment size. These are from the > parrot used in the Assemblathon 2 > (http://assemblathon.org/pages/download-data). Everything seems to run > fine, until contig consensus, where 1 partition just don't succeed. It > seems to run for quite some time (20 hours or something) before > failing. These are the last 20 lines from the output of the ctgcns > partition that fails: > Alignment params: 297 333 200 200 0 0.12 1e-06 30 1 > -- e/l = 7/112 = 6.25% > A -----+------+----> [] > B 332 -------> 40 [] > GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 > lScore=0.026087 (112 vs 115) aScore=0.160000 (332 vs 316) > bScore=0.150000 (-42 vs -27). (CONTIGF) > GetAlignmentTrace()-- Overlap found between 1076523 (U) and 25763657 > (R) expected hangs: a=316 b=-27 erate=0.060000 aligner=Local_Overlap > GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 > lScore=0.026087 (112 vs 115) aScore=0.160000 (332 vs 316) > bScore=0.150000 (-42 vs -27). (CONTIGF) > Local_Overlap_AS_forCNS found overlap between 1076523 (U) and 25763657 > (R) ahang: 332, bhang: -42 (expected hang was 316) > Alignment params: 298 334 200 200 0 0.12 1e-06 30 1 > -- e/l = 6/112 = 5.36% > A -----+------+----> [] > B 332 -------> 42 [] > GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 > lScore=0.009009 (110 vs 111) aScore=0.140000 (332 vs 318) > bScore=0.130000 (-42 vs -29). (CONTIGF) > GetAlignmentTrace()-- Overlap found between 1076523 (U) and 57537697 > (R) expected hangs: a=318 b=-29 erate=0.060000 aligner=Local_Overlap > GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 > lScore=0.009009 (110 vs 111) aScore=0.140000 (332 vs 318) > bScore=0.130000 (-42 vs -29). (CONTIGF) > Local_Overlap_AS_forCNS found overlap between 1076523 (U) and 57537697 > (R) ahang: 332, bhang: -42 (expected hang was 318) > Alignment params: 300 336 200 200 0 0.12 1e-06 30 1 > -- e/l = 6/110 = 5.45% > A -----+------+----> [] > B 332 -------> 42 [] > > This is the error message: > at /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 1237 > main::caFailure('1 consensusAfterScaffolder jobs failed; remove > 8-consensus/co...', undef) called at > /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 5142 > main::postScaffolderConsensus() called at > /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 5885 > > ---------------------------------------- > Failure message: > > 1 consensusAfterScaffolder jobs failed; remove > 8-consensus/consensus.sh to try again > > I've tried removing consensus.sh and running again, but get the same error. > > This is the spec file: > utgErrorRate=0.03 > utgErrorLimit=2.5 > ovlErrorRate=0.06 > cnsErrorRate=0.06 > cgwErrorRate=0.10 > merSize = 22 > overlapper=ovl > unitigger = bogart > merylMemory = 128000 > merylThreads = 16 > merOverlapperThreads = 2 > merOverlapperExtendConcurrency = 8 > merOverlapperSeedConcurrency = 8 > ovlThreads = 2 > mbtThreads = 2 > mbtConcurrency = 8 > ovlConcurrency = 8 > ovlCorrConcurrency = 16 > ovlRefBlockSize = 32000000 > ovlHashBits = 24 > ovlHashBlockLength = 800000000 > ovlStoreMemory = 128000 > frgCorrThreads = 2 > frgCorrConcurrency = 8 > ovlCorrBatchSize = 1000000 > ovlCorrConcurrency = 16 > cnsConcurrency = 16 > doExtendClearRanges = 0 > > I don't need to have that unitig (1076523 (U)) in my finished > assembly, so it's possible to just remove it as long as I get a > finished assembly. I've also tried to just create the .success file, > but then terminator fails. > > Does anyone have any ideas of what I might do different? Can I just > remove that unitig and proceed? How do I do that? > > Sincerely, > Ole Kristian Tørresen > PhD student > University of Oslo > > ------------------------------------------------------------------------------ > Better than sec? Nothing is better than sec when it comes to > monitoring Big Data applications. Try Boundary one-second > resolution app monitoring today. Free. > http://p.sf.net/sfu/Boundary-dev2dev > _______________________________________________ > wgs-assembler-users mailing list > wgs...@li... > https://lists.sourceforge.net/lists/listinfo/wgs-assembler-users |
From: Ole K. T. <o.k...@bi...> - 2012-04-10 08:05:38
|
Hi, I'm having some problems while doing some low coverage sequencing assembly testing. I've tried to assemble about 10x coverage of 150 nt paired Illumina reads of 500 bp fragment size. These are from the parrot used in the Assemblathon 2 (http://assemblathon.org/pages/download-data). Everything seems to run fine, until contig consensus, where 1 partition just don't succeed. It seems to run for quite some time (20 hours or something) before failing. These are the last 20 lines from the output of the ctgcns partition that fails: Alignment params: 297 333 200 200 0 0.12 1e-06 30 1 -- e/l = 7/112 = 6.25% A -----+------+----> [] B 332 -------> 40 [] GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 lScore=0.026087 (112 vs 115) aScore=0.160000 (332 vs 316) bScore=0.150000 (-42 vs -27). (CONTIGF) GetAlignmentTrace()-- Overlap found between 1076523 (U) and 25763657 (R) expected hangs: a=316 b=-27 erate=0.060000 aligner=Local_Overlap GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 lScore=0.026087 (112 vs 115) aScore=0.160000 (332 vs 316) bScore=0.150000 (-42 vs -27). (CONTIGF) Local_Overlap_AS_forCNS found overlap between 1076523 (U) and 25763657 (R) ahang: 332, bhang: -42 (expected hang was 316) Alignment params: 298 334 200 200 0 0.12 1e-06 30 1 -- e/l = 6/112 = 5.36% A -----+------+----> [] B 332 -------> 42 [] GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 lScore=0.009009 (110 vs 111) aScore=0.140000 (332 vs 318) bScore=0.130000 (-42 vs -29). (CONTIGF) GetAlignmentTrace()-- Overlap found between 1076523 (U) and 57537697 (R) expected hangs: a=318 b=-29 erate=0.060000 aligner=Local_Overlap GetAlignmentTrace()-- Overlap ACCEPTED! accept=1000.000000 lScore=0.009009 (110 vs 111) aScore=0.140000 (332 vs 318) bScore=0.130000 (-42 vs -29). (CONTIGF) Local_Overlap_AS_forCNS found overlap between 1076523 (U) and 57537697 (R) ahang: 332, bhang: -42 (expected hang was 318) Alignment params: 300 336 200 200 0 0.12 1e-06 30 1 -- e/l = 6/110 = 5.45% A -----+------+----> [] B 332 -------> 42 [] This is the error message: at /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 1237 main::caFailure('1 consensusAfterScaffolder jobs failed; remove 8-consensus/co...', undef) called at /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 5142 main::postScaffolderConsensus() called at /usit/titan/u1/olekto/src/wgs-7.0/Linux-amd64/bin/runCA line 5885 ---------------------------------------- Failure message: 1 consensusAfterScaffolder jobs failed; remove 8-consensus/consensus.sh to try again I've tried removing consensus.sh and running again, but get the same error. This is the spec file: utgErrorRate=0.03 utgErrorLimit=2.5 ovlErrorRate=0.06 cnsErrorRate=0.06 cgwErrorRate=0.10 merSize = 22 overlapper=ovl unitigger = bogart merylMemory = 128000 merylThreads = 16 merOverlapperThreads = 2 merOverlapperExtendConcurrency = 8 merOverlapperSeedConcurrency = 8 ovlThreads = 2 mbtThreads = 2 mbtConcurrency = 8 ovlConcurrency = 8 ovlCorrConcurrency = 16 ovlRefBlockSize = 32000000 ovlHashBits = 24 ovlHashBlockLength = 800000000 ovlStoreMemory = 128000 frgCorrThreads = 2 frgCorrConcurrency = 8 ovlCorrBatchSize = 1000000 ovlCorrConcurrency = 16 cnsConcurrency = 16 doExtendClearRanges = 0 I don't need to have that unitig (1076523 (U)) in my finished assembly, so it's possible to just remove it as long as I get a finished assembly. I've also tried to just create the .success file, but then terminator fails. Does anyone have any ideas of what I might do different? Can I just remove that unitig and proceed? How do I do that? Sincerely, Ole Kristian Tørresen PhD student University of Oslo |
From: Walenz, B. <bw...@jc...> - 2012-04-09 19:32:38
|
Hi- I’ll look into the crash; it’s a strange one for sure. Without being able to reproduce it here, I don’t predict much success. To move the assembly forward, you can switch to the ‘bogart’ unitigger. Remove the 4-unitigger and *tigStore directories before restarting. I’d guess that your disk usage is from using 25% error. The defaults of ovl=0.06 utg=0.03 work well for bacteria. You can also try sampling down the reads to 50x-100x coverage with ‘fastqSample’. 46m reads * 100bp = 4.6g sequence = 1000x coverage? b -- Brian Walenz Senior Software Engineer J. Craig Venter Institute On 4/6/12 12:17 PM, "Ishwor Thapa" <ish...@pk...> wrote: Hi, I am trying to use celera7 for an assembly of paired ends reads from a Bacterial Genome. It ran upto Unitigger step and it aborted due to assertion fail in buildUnitigs step. Below is the content of 4-unitigger/unitigger.err Bubble popping = on Intersection breaking = on Bad mate threshold = -7 Error threshold = 0.250 (25.000%) Error limit = 4.500 errors sizeof(ufPath) = 24 FragmentInfo()-- Loading fragment information deleted: 55232 active: 9944768 FragmentInfo()-- Loading fragment information deleted: 113524 active: 19886476 FragmentInfo()-- Loading fragment information deleted: 188235 active: 29811765 FragmentInfo()-- Loading fragment information deleted: 363003 active: 39636997 FragmentInfo()-- Loaded 45918788 alive fragments, skipped 401422 dead fragments. FragmentInfo()-- Saving fragment information to cache '/scratch/ithapa/celera/first/4-unitigger/UNK-2.fragmentInfo ' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.001.bestoverlapgraph-containments.lo g' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.002.bestoverlapgraph-dovetails.log' BestOverlapGraph()-- Saving overlap graph to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.bog'. setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.003.ChunkGraph.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.004.buildUnitigs.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.005.bubblePopping.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.006.intersectionBreaking.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.007.placeContains.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.008.placeZombies.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.009.bubblePopping.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.010.libraryStats.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.011.evaluateMates.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.012.moveContains1.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.013.splitDiscontinuous1.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.014.splitBadMates.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.015.splitDiscontinuous2.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.016.moveContains2.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.017.splitDiscontinuous3.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.018.output.log' buildUnitigs: AS_BOG_Outputs.cc:116: void UnitigGraph::writeIUMtoFile(char*, char*, uint32, bool): Assertion `utg- >getLength() > 0' failed. Aborted I have the spec file as follows: utgErrorRate=0.25 utgErrorLimit=4.5 cnsErrorRate=0.25 cgwErrorRate=0.25 ovlErrorRate=0.25 ovlHashBits=23 ovlHashBlockLength=300000000 ovlThreads = 2 ovlConcurrency = 2 Can you please let me know what might have gone wrong? Another question I had is, if its normal to take more than 800GB disk space to reach to Unitigger step. Originally I had 12.4GB fastq file. Thanks, Ishwor |
From: Ishwor T. <ish...@pk...> - 2012-04-06 16:47:26
|
Hi, I am trying to use celera7 for an assembly of paired ends reads from a Bacterial Genome. It ran upto Unitigger step and it aborted due to assertion fail in buildUnitigs step. Below is the content of 4-unitigger/unitigger.err Bubble popping = on Intersection breaking = on Bad mate threshold = -7 Error threshold = 0.250 (25.000%) Error limit = 4.500 errors sizeof(ufPath) = 24 FragmentInfo()-- Loading fragment information deleted: 55232 active: 9944768 FragmentInfo()-- Loading fragment information deleted: 113524 active: 19886476 FragmentInfo()-- Loading fragment information deleted: 188235 active: 29811765 FragmentInfo()-- Loading fragment information deleted: 363003 active: 39636997 FragmentInfo()-- Loaded 45918788 alive fragments, skipped 401422 dead fragments. FragmentInfo()-- Saving fragment information to cache '/scratch/ithapa/celera/first/4-unitigger/UNK-2.fragmentInfo ' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.001.bestoverlapgraph-containments.lo g' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.002.bestoverlapgraph-dovetails.log' BestOverlapGraph()-- Saving overlap graph to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.bog'. setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.003.ChunkGraph.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.004.buildUnitigs.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.005.bubblePopping.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.006.intersectionBreaking.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.007.placeContains.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.008.placeZombies.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.009.bubblePopping.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.010.libraryStats.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.011.evaluateMates.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.012.moveContains1.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.013.splitDiscontinuous1.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.014.splitBadMates.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.015.splitDiscontinuous2.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.016.moveContains2.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.017.splitDiscontinuous3.log' setLogFile()-- Now logging to '/scratch/ithapa/celera/first/4-unitigger/UNK-2.018.output.log' buildUnitigs: AS_BOG_Outputs.cc:116: void UnitigGraph::writeIUMtoFile(char*, char*, uint32, bool): Assertion `utg- >getLength() > 0' failed. Aborted I have the spec file as follows: utgErrorRate=0.25 utgErrorLimit=4.5 cnsErrorRate=0.25 cgwErrorRate=0.25 ovlErrorRate=0.25 ovlHashBits=23 ovlHashBlockLength=300000000 ovlThreads = 2 ovlConcurrency = 2 Can you please let me know what might have gone wrong? Another question I had is, if its normal to take more than 800GB disk space to reach to Unitigger step. Originally I had 12.4GB fastq file. Thanks, Ishwor |
From: Santiago R. <san...@gm...> - 2012-04-06 16:14:15
|
Hi, Brian. Thanks for the tips. I completely forgot to mention the organism: I'm trying to assemble a Xanthomonas axonopodis bacteria. It has a 5.2Mb genome with a ~65% GC content and two plasmids (64Kb and 33Kb). The problem is that despite being different in size, the plasmids are very similar because they share similar genes. They share, not consecutively, almost 20-25Kbp. I have two sets of data: a) the pool of plasmids sequeced single-ended, and b) the whole genome (including the plasmids) sequenced paired-ended. Though I managed to obtain a good number of scaffolds (18) using Newbler v2.6 I was unable to close the plasmids. That's why I'm now trying with Celera and what I'm trying to accomplish. So, any further advice would be really helpful. Best, Santino On Thu, Apr 5, 2012 at 2:39 PM, Walenz, Brian <bw...@jc...> wrote: > Hi, Santino- > > Without knowing what you’re assembling, I’d say you should put nothing in > the spec file. The biggest knob to change is utgErrorRate (default 0.03). > Lower can separate repeats, or fail to assemble anything. Higher can > overcome polymorphism, or over collapse repeats and duplications. The > second biggest knob would be unitigger=bog vs unitigger=bogart. Bogart is > still ‘experimental’ in the 7.0 release; better in CVS tip. > > You might get better compute performance by setting the various > “concurrency” options (especially cnsConcurrency) to the number of CPUs you > have, and tuning overlapper for your hardware. Generally, the defaults > work well for something this size. > > b > -- > Brian Walenz > Senior Software Engineer > J. Craig Venter Institute > > > > On 4/5/12 1:14 PM, "Santiago Revale" <san...@gm...> wrote: > > Hello everybody! > > I'm new to paired-ends assembly, so I'm hoping you could help me. > I have two 454 FLX Titanium libraries: one single-end (common wgs) and one > pair-end (8Kb). > > I ran sffToCA two times, one for each library, using "-trim chop" for the > pair-end library (amongst other paired-end parameters). > > Now, what parameters you suggest should I put in the specFile to perform > this assembly for better results? > > Thank you very much in advance. > > Best, > > Santino > > > |
From: Walenz, B. <bw...@jc...> - 2012-04-05 17:56:28
|
Hi, Santino- Without knowing what you’re assembling, I’d say you should put nothing in the spec file. The biggest knob to change is utgErrorRate (default 0.03). Lower can separate repeats, or fail to assemble anything. Higher can overcome polymorphism, or over collapse repeats and duplications. The second biggest knob would be unitigger=bog vs unitigger=bogart. Bogart is still ‘experimental’ in the 7.0 release; better in CVS tip. You might get better compute performance by setting the various “concurrency” options (especially cnsConcurrency) to the number of CPUs you have, and tuning overlapper for your hardware. Generally, the defaults work well for something this size. b -- Brian Walenz Senior Software Engineer J. Craig Venter Institute On 4/5/12 1:14 PM, "Santiago Revale" <san...@gm...> wrote: Hello everybody! I'm new to paired-ends assembly, so I'm hoping you could help me. I have two 454 FLX Titanium libraries: one single-end (common wgs) and one pair-end (8Kb). I ran sffToCA two times, one for each library, using "-trim chop" for the pair-end library (amongst other paired-end parameters). Now, what parameters you suggest should I put in the specFile to perform this assembly for better results? Thank you very much in advance. Best, Santino |
From: Santiago R. <san...@gm...> - 2012-04-05 17:15:03
|
Hello everybody! I'm new to paired-ends assembly, so I'm hoping you could help me. I have two 454 FLX Titanium libraries: one single-end (common wgs) and one pair-end (8Kb). I ran sffToCA two times, one for each library, using "-trim chop" for the pair-end library (amongst other paired-end parameters). Now, what parameters you suggest should I put in the specFile to perform this assembly for better results? Thank you very much in advance. Best, Santino |
From: Ole K. T. <o.k...@bi...> - 2012-02-16 19:52:02
|
Hi Sebastian. I'm not completely sure I understand what you want, but dNC is part of CA 7.0. As far as I understand, this is how you use it. Put this in your spec file: dncMPlibraries = shortjump dncBBlibraries = frag where "shortjump" is the name of your mate pair library, your Illumina library in this case, and "frag" is the name of a library it will compare with to find PE reads, I think your 454 reads will work fine here. Good luck! Ole On 16 February 2012 15:58, Sebastian Jaenicke <sja...@ce...> wrote: > Hi, > > I'm trying to assemble a bacterial genome based on 454 (WGS) and > Illumina (PE) data with WGS 7.0; however, I'm seeing a large number > of degenerated mate pairs ('ReadsWithBothDegenMate', up to 85% > of the Illumina data). > > I'd like to use DNC to verify the PE distance, but according to > http://sourceforge.net/apps/mediawiki/wgs-assembler/index.php?title=Pair_classification_within_Illumina_mate_pair_data > the DNC software should be part of the WGS 7.0 release, but seems > to be missing. > > Any hints? > > Regards, > > - Sebastian > > -- > A: Maybe because some people are too annoyed by top-posting. > Q: Why do I not get an answer to my question(s)? > A: Because it messes up the order in which people normally read text. > Q: Why is top-posting such a bad thing? > > ------------------------------------------------------------------------------ > Virtualization & Cloud Management Using Capacity Planning > Cloud computing makes use of virtualization - but cloud computing > also focuses on allowing computing to be delivered as a service. > http://www.accelacomm.com/jaw/sfnl/114/51521223/ > _______________________________________________ > wgs-assembler-users mailing list > wgs...@li... > https://lists.sourceforge.net/lists/listinfo/wgs-assembler-users |
From: Sebastian J. <sja...@Ce...> - 2012-02-16 14:58:23
|
Hi, I'm trying to assemble a bacterial genome based on 454 (WGS) and Illumina (PE) data with WGS 7.0; however, I'm seeing a large number of degenerated mate pairs ('ReadsWithBothDegenMate', up to 85% of the Illumina data). I'd like to use DNC to verify the PE distance, but according to http://sourceforge.net/apps/mediawiki/wgs-assembler/index.php?title=Pair_classification_within_Illumina_mate_pair_data the DNC software should be part of the WGS 7.0 release, but seems to be missing. Any hints? Regards, - Sebastian -- A: Maybe because some people are too annoyed by top-posting. Q: Why do I not get an answer to my question(s)? A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? |
From: Walenz, B. <bw...@jc...> - 2012-01-16 16:51:15
|
I just uploaded Celera Assembler 7.0 to sourceforge.net. Sorry for the delay(s). Happy assembling! http://sourceforge.net/projects/wgs-assembler/files/wgs-assembler/wgs-7.0/ http://sourceforge.net/apps/mediawiki/wgs-assembler/index.php?title=Main_Page b -- Brian Walenz Senior Software Engineer J. Craig Venter Institute |