detectMITE / Tickets / #3 Error when running

JuanC - 2017-03-30

Content of one of the clstr file.

>Cluster 0 0 149nt, >129|252|400|2... * >Cluster 1 0 149nt, >143|242|390|4... * 1 149nt, >579|242|390|4... at +/95.97% 2 149nt, >581|194|342|4... at +/91.95% >Cluster 2 0 149nt, >882|170|318|3... * >Cluster 3 >.....
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

JuanC - 2017-03-30

It seems like it is iterating through 8 candidates but from 4th on they are empty files

Last edit: JuanC 2017-03-30

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Congting Ye - 2017-03-31
  
  Hello JuanC,
  
  The error is produced when some temporary file is empty (means no candidate was found in that stage), this bug is fixed in the new version, please use the new one.
  
  Thanks for your bug report!
  
  Congting Ye
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Congting Ye - 2017-05-19

labels: --> Fixed

status: open --> closed
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Virginia Markham - 2019-01-29

Hi,
I'm still getting this error even though I'm quite sure I'm using the newest version of detectMITE. It gives results for only the first 8 out of 15 iterations. Any idea what the problem might be? Thanks!!

I'm running this on our server which is SUSE Linux Enterprise Server 12 SP3, and I'm using Matlab version R2016a and cd-hit v4.6.1-2012-08-27.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Congting Ye - 2019-02-02
  
  Hi Virginia Markham,
  
  According to the error info you post, I figure out that you were using an old version of detectMITE. The code "Cluster = fastaread(['./result/' genome '.clusteredCandidateTE' num2str(count) '.fasta.clstr']);" is at line 60 of the old verion (such as detectMITE.20160818.tar.gz), but at line 62 of the newest version. Please go the link (https://sourceforge.net/projects/detectmite/files/detectMITE.20170425.tar.gz/download) to download the newest version.
  
  Error using fastaread (line 158)
  Input does not exist or is not a valid FASTA file.
  
  Error in detectMITE (line 60)
  Cluster = fastaread(['./result/' genome '.clusteredCandidateTE' num2str(count) '.fasta.clstr']);
  
  Error in do_MITE_detection (line 141)
  detectMITE(data_file,p_TIR_Len,p_TSD_minLen ,p_TSD_maxLen ,p_MITE_minLen ,p_MITE_maxLen
  ,p_FlankSeq_Len,genome,p_cpu);
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
  - Virginia Markham - 2019-02-02
    
    Thanks so much for your response! Sorry, I should have been more clear. This is my first time posting on sourceforge, apologies for my mistakes. That error you mention ( "Cluster = fastaread(['./result/' ...") is from Juan C. I did not get that error. In fact, I did not get any error message at all. But like Juan C., only my first few iterations are giving clusters. I have 15 iterations and the first 8 look good (for example: "80597 finished 51831 clusters"), but the last 7 are not giving results ("0 finished 0 clusters").
    
    These output files are empty:
    my_genome.mite.fasta
    my_genome.miteSet.fasta
    result/my_genome.candidateTE*_*.fasta
    result/my_genome.clusteredCandidateTE*_*.fasta
    result/my_genome.clusteredCandidateTE*_*.fasta.clstr
    result/my_genome.miteCandidate*.fasta
    result/my_genome.miteCandidate*_*.fasta
    
    These output files are not empty:
    result/my_genome.candidateTE*.fasta
    result/my_genome.clusteredCandidateTE*.fasta
    result/my_genome.clusteredCandidateTE*.fasta.clstr
    
    Do you have any ideas about what I'm doing wrong? Thanks so much in advance, I really appreciate it. My output is very long so I am only posting the beginning and the end, with an "etc." in the middle:
    < M A T L A B (R) >
    Copyright 1984-2016 The MathWorks, Inc.
    R2016a (9.0.0.341360) 64-bit (glnxa64)
    February 11, 2016
    
    To get started, type one of these: helpwin, helpdesk, or demo.
    For product information, visit www.mathworks.com.
    
    Starting parallel pool (parpool) using the 'local' profile ... connected to 16 workers. ===============================================================
    Program: CD-HIT, V4.6, Jan 23 2019, 14:52:54
    Command: ./cd-hit/cd-hit-est -i
    ./result/Bd21.candidateTE1.fasta -o
    ./result/Bd21.clusteredCandidateTE1.fasta -c 0.80 -n 5
    -d 0 -T 0 -aL 0.99 -s 0.8 -M 0
    
    Started: Mon Jan 28 13:12:47 2019 ================================================================
    Output
    
    Option -T is ignored: multi-threading with OpenMP is NOT enabled!
    total seq: 80597
    longest and shortest : 149 and 50
    Total letters: 7739197
    Sequences have been sorted
    
    Approximated minimal memory consumption:
    Sequence : 18M
    Buffer : 1 X 13M = 13M
    Table : 1 X 1M = 1M
    Miscellaneous : 1M
    Total : 33M
    
    Table limit with the given memory limit:
    Max number of representatives: 4000000
    Max number of word counting entries: 104574500
    
    comparing sequences from 0 to 80597
    .......... 10000 finished 7548 clusters
    .......... 20000 finished 14234 clusters
    .......... 30000 finished 18927 clusters
    .......... 40000 finished 26529 clusters
    .......... 50000 finished 33365 clusters
    .......... 60000 finished 38920 clusters
    .......... 70000 finished 45481 clusters
    .......... 80000 finished 51523 clusters
    
    80597 finished 51831 clusters
    
    Apprixmated maximum memory consumption: 67M
    writing new database
    writing clustering information
    program completed !
    
    ================================================================
    
    ....[etc.]...
    
    ================================================================
    Output
    ----------------------------------------------------------------
    Option -T is ignored: multi-threading with OpenMP is NOT enabled!
    total seq: 0
    longest and shortest : 0 and 18446744073709551615
    Total letters: 0
    Sequences have been sorted
    
    Approximated minimal memory consumption:
    Sequence : 0M
    Buffer : 1 X 12M = 12M
    Table : 1 X 0M = 0M
    Miscellaneous : 0M
    Total : 12M
    
    Table limit with the given memory limit:
    Max number of representatives: 4000000
    Max number of word counting entries: 50000000
    
    0 finished 0 clusters
    
    Apprixmated maximum memory consumption: 12M
    writing new database
    writing clustering information
    program completed !
    
    Total CPU time 0.01
    IdleTimeout has been reached.
    Parallel pool using the 'local' profile is shutting down.
    
    Last edit: Virginia Markham 2019-02-02
    
    If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
    - Congting Ye - 2019-02-03
      
      In your case, this indicates that no candidate MITEs was detected from your input. I guess that you may run detectMITE using a small fasta file for test, a small input often results in nothing detected. You could run detectMITE with Arabidopsis genome to test whether it could run correctly.
      
      Thanks for your interest in detectMITE!
      
      If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
      - Virginia Markham - 2019-02-03
        
        Hm okay, I'll try that. Thanks!!
        
        If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Error when running

Detection of Miniature Inverted Repeat Transposable Elements

Milestone

Searches

Help

#3 Error when running

Started: Thu Mar 30 17:40:41 2017

Discussion