Menu

#32 PBCR blasr failure

v1.0_(example)
open
nobody
5
2016-11-01
2016-10-31
Seigfried
No

Hello

I have been trying to run PBCR on my PacBio data. The pipeline was running smoothly and made 4 sam files as per my specifications in pacbio.spec

1.sam 2.sam 3.sam 4.sam got converted into 1.ovls 2.ovls 3.ovls 4.ovls
Currently my PBCR pipeline is running and forming 5.sam 6.sam 7.sam and 8.sam

However I see that blasr for 6.sam has failed

Running partition 000002 with options -h 659897171-660029876 -r 425206338-659897170 --hashstrings 132706 --hashdatalen 1000002482 start 425206337 end 659897170 total 234690833 zero job 1 and stride 2
Running partition 000001 with options -h 659897171-660029876 -r 1-425206337 --hashstrings 132706 --hashdatalen 1000002482 start 0 end 425206337 total 425206337 zero job 0 and stride 2
Running partition 000003 with options -h 660029877-660162937 -r 1-425206337 --hashstrings 133061 --hashdatalen 1000008291 start 0 end 425206337 total 425206337 zero job 0 and stride 2
Running partition 000004 with options -h 660029877-660162937 -r 425206338-659897170 --hashstrings 133061 --hashdatalen 1000008291 start 425206337 end 659897170 total 234690833 zero job 1 and stride 2
Blasr completed.
Blasr completed.
SamToCA conversion completed.
/home/iivr/Desktop/PBCR//tempec_pacbio/1-overlapper/overlap.sh 5
Running partition 000005 with options -h 660162938-660298518 -r 1-425206337 --hashstrings 135581 --hashdatalen 1000000244 start 0 end 425206337 total 425206337 zero job 0 and stride 2
SamToCA conversion completed.
/home/iivr/Desktop/PBCR//tempec_pacbio/1-overlapper/overlap.sh 6
Running partition 000006 with options -h 660162938-660298518 -r 425206338-659897170 --hashstrings 135581 --hashdatalen 1000000244 start 425206337 end 659897170 total 234690833 zero job 1 and stride 2
Blasr completed.
Blasr failed.
[INFO] 2016-10-30T02:36:04 [blasr] started.
blasr: ../../lib/cpp/alignment/algorithms/anchoring/MapBySuffixArrayImpl.hpp:303: int MapReadToGenome(T_RefSequence&, T_SuffixArray&, T_Sequence&, unsigned int, std::vector<T_MatchPos, std::allocator<_Tp2=""> >&, AnchorParameters&) [with T_SuffixArray = SuffixArray<unsigned char,="" std::vector<int,="" std::allocator<int=""> >, DefaultCompareStrings<unsigned char="">, DNATuple>, T_RefSequence = FASTASequence, T_Sequence = SMRTSequence, T_MatchPos = ChainedMatchPos]: Assertion `sa.index[mp] + matchLength[matchIndex] <= reference.length' failed.
Command terminated by signal 6
564420.20user 36035.95system 9:15:09elapsed 1802%CPU (0avgtext+0avgdata 10329280maxresident)k
9933936inputs+158959392outputs (3637major+4045822067minor)pagefaults 0swaps
/home/iivr/Desktop/PBCR//tempec_pacbio/1-overlapper/overlap.sh 7
Running partition 000007 with options -h 660298519-660438109 -r 1-425206337 --hashstrings 139591 --hashdatalen 1000002830 start 0 end 425206337 total 425206337 zero job 0 and stride 2
Blasr completed.

Right now PBCR is still running and I don't want to interrupt it. Is there any way that I can restart my failed blasr?

2 Attachments

Related

Support Requests: #32

Discussion

  • Sergey Koren

    Sergey Koren - 2016-10-31

    This looks like an internal blasr error so restarting won’t likely fix it (the match went out of the index bounds). The pipeline will stop when it finds a missing job from the run at which point you could try re-running blasr or trying an updated version of blasr. However, I would recommend using Canu instead as PBcR is no longer being supported/maintained.

    On Oct 31, 2016, at 1:45 AM, Seigfried seigfried@users.sf.net wrote:

    [support-requests:#32] https://sourceforge.net/p/wgs-assembler/support-requests/32/ PBCR blasr failure

    Status: open
    Group: v1.0_(example)
    Labels: blasr pbcr
    Created: Mon Oct 31, 2016 05:45 AM UTC by Seigfried
    Last Updated: Mon Oct 31, 2016 05:45 AM UTC
    Owner: nobody
    Attachments:

    pacbio.spec https://sourceforge.net/p/wgs-assembler/support-requests/32/attachment/pacbio.spec (924 Bytes; text/x-rpm-spec)
    run.out https://sourceforge.net/p/wgs-assembler/support-requests/32/attachment/run.out (70.4 kB; application/octet-stream)
    Hello

    I have been trying to run PBCR on my PacBio data. The pipeline was running smoothly and made 4 sam files as per my specifications in pacbio.spec

    1.sam 2.sam 3.sam 4.sam got converted into 1.ovls 2.ovls 3.ovls 4.ovls
    Currently my PBCR pipeline is running and forming 5.sam 6.sam 7.sam and 8.sam

    However I see that blasr for 6.sam has failed

    Running partition 000002 with options -h 659897171-660029876 -r 425206338-659897170 --hashstrings 132706 --hashdatalen 1000002482 start 425206337 end 659897170 total 234690833 zero job 1 and stride 2
    Running partition 000001 with options -h 659897171-660029876 -r 1-425206337 --hashstrings 132706 --hashdatalen 1000002482 start 0 end 425206337 total 425206337 zero job 0 and stride 2
    Running partition 000003 with options -h 660029877-660162937 -r 1-425206337 --hashstrings 133061 --hashdatalen 1000008291 start 0 end 425206337 total 425206337 zero job 0 and stride 2
    Running partition 000004 with options -h 660029877-660162937 -r 425206338-659897170 --hashstrings 133061 --hashdatalen 1000008291 start 425206337 end 659897170 total 234690833 zero job 1 and stride 2
    Blasr completed.
    Blasr completed.
    SamToCA conversion completed.
    /home/iivr/Desktop/PBCR//tempec_pacbio/1-overlapper/overlap.sh 5
    Running partition 000005 with options -h 660162938-660298518 -r 1-425206337 --hashstrings 135581 --hashdatalen 1000000244 start 0 end 425206337 total 425206337 zero job 0 and stride 2
    SamToCA conversion completed.
    /home/iivr/Desktop/PBCR//tempec_pacbio/1-overlapper/overlap.sh 6
    Running partition 000006 with options -h 660162938-660298518 -r 425206338-659897170 --hashstrings 135581 --hashdatalen 1000000244 start 425206337 end 659897170 total 234690833 zero job 1 and stride 2
    Blasr completed.
    Blasr failed.
    [INFO] 2016-10-30T02:36:04 [blasr] started.
    blasr: ../../lib/cpp/alignment/algorithms/anchoring/MapBySuffixArrayImpl.hpp:303: int MapReadToGenome(T_RefSequence&, T_SuffixArray&, T_Sequence&, unsigned int, std::vector<T_MatchPos, std::allocator<_Tp2=""> >&, AnchorParameters&) [with T_SuffixArray = SuffixArray<unsigned char,="" std::vector<int,="" std::allocator<int=""> >, DefaultCompareStrings<unsigned char="">, DNATuple>, T_RefSequence = FASTASequence, T_Sequence = SMRTSequence, T_MatchPos = ChainedMatchPos]: Assertion `sa.index[mp] + matchLength[matchIndex] <= reference.length' failed.
    Command terminated by signal 6
    564420.20user 36035.95system 9:15:09elapsed 1802%CPU (0avgtext+0avgdata 10329280maxresident)k
    9933936inputs+158959392outputs (3637major+4045822067minor)pagefaults 0swaps
    /home/iivr/Desktop/PBCR//tempec_pacbio/1-overlapper/overlap.sh 7
    Running partition 000007 with options -h 660298519-660438109 -r 1-425206337 --hashstrings 139591 --hashdatalen 1000002830 start 0 end 425206337 total 425206337 zero job 0 and stride 2
    Blasr completed.

    Right now PBCR is still running and I don't want to interrupt it. Is there any way that I can restart my failed blasr?

    Sent from sourceforge.net because you indicated interest in https://sourceforge.net/p/wgs-assembler/support-requests/32/ https://sourceforge.net/p/wgs-assembler/support-requests/32/
    To unsubscribe from further messages, please visit https://sourceforge.net/auth/subscriptions/ https://sourceforge.net/auth/subscriptions/

     

    Related

    Support Requests: #32

  • Seigfried

    Seigfried - 2016-11-01

    Hello Sergey

    'This looks like an internal blasr error so restarting won’t likely fix it (the match went out of the index bounds).'
    Does this mean that this particular step 1-overlapper is unable to be completed if I restart it again after it fails?

    I cannot use Canu since I am using a Hybrid assembly with only around 20x pacbio reads and 150x Illumina reads.
    Also I cannot use the latest version of blasr because of the recent changes and I had to downgrade to version that actually works with pbcr
    ./blasr --version
    Note : prior to Blasr version 5.1 , use ./blasr -version (single dash) incompatible with pbcr 8.3

    I understand that pbcr is not being maintained. Can you possibly advise me on how to best complete a hybrid assembly OR how I can use the latest blasr with this pbcr because that would probably solve all my problems

    Thanks!

     
    • Sergey Koren

      Sergey Koren - 2016-11-02

      20X coverage is the minimum recommended for Canu so you can give it a shot with the low-coverage parameters suggested here:
      http://canu.readthedocs.io/en/latest/quick-start.html#assembling-low-coverage-datasets http://canu.readthedocs.io/en/latest/quick-start.html#assembling-low-coverage-datasets

      If you want to get PBcR to work with the latest blasr, you would probably have to update the code. Change any reference to -version to be —version. It’s just a perl script so if you change it you don’t need to recompile. You can pass arbitrary parameters for blasr with the blasr option to PBcR, that should let you customize the parameters to match what the latest blasr expects.

      On Oct 31, 2016, at 10:43 PM, Seigfried seigfried@users.sf.net wrote:

      Hello Sergey

      'This looks like an internal blasr error so restarting won’t likely fix it (the match went out of the index bounds).'
      Does this mean that this particular step 1-overlapper is unable to be completed if I restart it again after it fails?

      I cannot use Canu since I am using a Hybrid assembly with only around 20x pacbio reads and 150x Illumina reads.
      Also I cannot use the latest version of blasr because of the recent changes and I had to downgrade to version that actually works with pbcr
      ./blasr --version
      Note : prior to Blasr version 5.1 , use ./blasr -version (single dash) incompatible with pbcr 8.3

      I understand that pbcr is not being maintained. Can you possibly advise me on how to best complete a hybrid assembly OR how I can use the latest blasr with this pbcr because that would probably solve all my problems

      Thanks!

      [support-requests:#32] https://sourceforge.net/p/wgs-assembler/support-requests/32/ PBCR blasr failure

      Status: open
      Group: v1.0_(example)
      Labels: blasr pbcr
      Created: Mon Oct 31, 2016 05:45 AM UTC by Seigfried
      Last Updated: Mon Oct 31, 2016 05:45 AM UTC
      Owner: nobody
      Attachments:

      pacbio.spec https://sourceforge.net/p/wgs-assembler/support-requests/32/attachment/pacbio.spec (924 Bytes; text/x-rpm-spec)
      run.out https://sourceforge.net/p/wgs-assembler/support-requests/32/attachment/run.out (70.4 kB; application/octet-stream)
      Hello

      I have been trying to run PBCR on my PacBio data. The pipeline was running smoothly and made 4 sam files as per my specifications in pacbio.spec

      1.sam 2.sam 3.sam 4.sam got converted into 1.ovls 2.ovls 3.ovls 4.ovls
      Currently my PBCR pipeline is running and forming 5.sam 6.sam 7.sam and 8.sam

      However I see that blasr for 6.sam has failed

      Running partition 000002 with options -h 659897171-660029876 -r 425206338-659897170 --hashstrings 132706 --hashdatalen 1000002482 start 425206337 end 659897170 total 234690833 zero job 1 and stride 2
      Running partition 000001 with options -h 659897171-660029876 -r 1-425206337 --hashstrings 132706 --hashdatalen 1000002482 start 0 end 425206337 total 425206337 zero job 0 and stride 2
      Running partition 000003 with options -h 660029877-660162937 -r 1-425206337 --hashstrings 133061 --hashdatalen 1000008291 start 0 end 425206337 total 425206337 zero job 0 and stride 2
      Running partition 000004 with options -h 660029877-660162937 -r 425206338-659897170 --hashstrings 133061 --hashdatalen 1000008291 start 425206337 end 659897170 total 234690833 zero job 1 and stride 2
      Blasr completed.
      Blasr completed.
      SamToCA conversion completed.
      /home/iivr/Desktop/PBCR//tempec_pacbio/1-overlapper/overlap.sh 5
      Running partition 000005 with options -h 660162938-660298518 -r 1-425206337 --hashstrings 135581 --hashdatalen 1000000244 start 0 end 425206337 total 425206337 zero job 0 and stride 2
      SamToCA conversion completed.
      /home/iivr/Desktop/PBCR//tempec_pacbio/1-overlapper/overlap.sh 6
      Running partition 000006 with options -h 660162938-660298518 -r 425206338-659897170 --hashstrings 135581 --hashdatalen 1000000244 start 425206337 end 659897170 total 234690833 zero job 1 and stride 2
      Blasr completed.
      Blasr failed.
      [INFO] 2016-10-30T02:36:04 [blasr] started.
      blasr: ../../lib/cpp/alignment/algorithms/anchoring/MapBySuffixArrayImpl.hpp:303: int MapReadToGenome(T_RefSequence&, T_SuffixArray&, T_Sequence&, unsigned int, std::vector<T_MatchPos, std::allocator<_Tp2=""> >&, AnchorParameters&) [with T_SuffixArray = SuffixArray<unsigned char,="" std::vector<int,="" std::allocator<int=""> >, DefaultCompareStrings<unsigned char="">, DNATuple>, T_RefSequence = FASTASequence, T_Sequence = SMRTSequence, T_MatchPos = ChainedMatchPos]: Assertion `sa.index[mp] + matchLength[matchIndex] <= reference.length' failed.
      Command terminated by signal 6
      564420.20user 36035.95system 9:15:09elapsed 1802%CPU (0avgtext+0avgdata 10329280maxresident)k
      9933936inputs+158959392outputs (3637major+4045822067minor)pagefaults 0swaps
      /home/iivr/Desktop/PBCR//tempec_pacbio/1-overlapper/overlap.sh 7
      Running partition 000007 with options -h 660298519-660438109 -r 1-425206337 --hashstrings 139591 --hashdatalen 1000002830 start 0 end 425206337 total 425206337 zero job 0 and stride 2
      Blasr completed.

      Right now PBCR is still running and I don't want to interrupt it. Is there any way that I can restart my failed blasr?

      Sent from sourceforge.net because you indicated interest in https://sourceforge.net/p/wgs-assembler/support-requests/32/ https://sourceforge.net/p/wgs-assembler/support-requests/32/
      To unsubscribe from further messages, please visit https://sourceforge.net/auth/subscriptions/ https://sourceforge.net/auth/subscriptions/

       

      Related

      Support Requests: #32


Log in to post a comment.