transdecoder-users Mailing List for TranscriptDecoder (Page 3)

Extracting likely coding regions from transcript sequences

Brought to you by: bhaas

transdecoder-users — Technical support and project announcements

You can subscribe to this list here.

2013	_Jan	_Feb	_Mar	_Apr	_May	_Jun	_Jul	_Aug	_Sep	_Oct	_Nov	_Dec (8)
2014	_Jan (12)	_Feb (14)	_Mar (4)	_Apr (8)	_May (17)	_Jun (14)	_Jul (21)	_Aug (8)	_Sep (5)	_Oct (8)	_Nov (1)	_Dec (1)
2015	_Jan (9)	_Feb	_Mar	_Apr	_May	_Jun	_Jul	_Aug	_Sep	_Oct	_Nov	_Dec

Flat | Threaded

<< < 1 2 3 4 5 6 > >> (Page 3 of 6)

[Transdecoder-users] Announcement: Transdecoder release r20140704

From: Brian H. <bh...@br...> - 2014-07-04 11:58:00

Greetings all,

The latest release of TransDecoder is now available:

http://sourceforge.net/projects/transdecoder/files/TransDecoder_r20140704.tar.gz/download

including minor changes from the previous release to ensure better
compatibility with other projects, including Trinity, PASA, and Trinotate

Release notes:

-added 'make simple' to build just the essential components involving
parafly and cdhit

-removed the 'cds.' prefix from the pep and cds sequence accessions.




-- 
--
Brian J. Haas
The Broad Institute
http://broad.mit.edu/~bhaas

Re: [Transdecoder-users] Licensing of TransDecoder

From: Brian H. <bh...@br...> - 2014-07-02 15:56:42

Hi Scott.  We use the very liberal BSD Open Source license:

Copyright (c) 2012, The Broad Institute, Inc. All rights reserved.

Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions are met:

<C2><B7>         Redistributions of source code must retain the above
copyright notice, this list of conditions and the following disclaimer.

<C2><B7>         Redistributions in binary form must reproduce the above
copyright notice, this list of conditions and the following disclaimer in
the documentation and/or other materials provided with the distribution.

<C2><B7>         Neither the name of the Broad Institute nor the names of
its contributors may be used to endorse or promote products derived from
this software without specific prior written permission.**

THIS SOFTWARE IS PROVIDED BY THE BROAD INSTITUTE  ''AS IS'' AND ANY EXPRESS
OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO,

THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE BROAD INSTITUTE

BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR
CONSEQUENTIAL DAMAGES(INCLUDING, BUT NOT LIMITED TO,

PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS;
OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY,

WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR
OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE,

EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

On Wed, Jul 2, 2014 at 11:50 AM, Shorkey, Scott <Sco...@ag...>
wrote:

> Good afternoon,
>
> I'm a system administrator from Agriculture and Agri-Food Canada, a
> department of the Government of Canada. Our scientists are interested in
> using TransDecoder, but our department policy requires that we determine
> the licensing of a software package before it may be used on our systems.
> What license is TransDecoder released under, or, if it is not released
> under any formal license, can you verify that it is free to utilize for
> research purposes?
>
> Scott Shorkey
> Bioinformatics System Administrator
> Agriculture and Agri-Food Canada | Agriculture et Agroalimentaire Canada
> KW Neatby Bldg | éd. KW Neatby
> 960 Carling Ave| 960, avenue Carling
> Ottawa, ON | Ottawa (ON) K1A 0C6
> E-mail Address / Adresse courriel: Sco...@ag...
> Telephone | Téléphone 613-759-6409
> Government of Canada | Gouvernement du Canada
>
>
> ------------------------------------------------------------------------------
> Open source business process management suite built on Java and Eclipse
> Turn processes into business applications with Bonita BPM Community Edition
> Quickly connect people, data, and systems into organized workflows
> Winner of BOSSIE, CODIE, OW2 and Gartner awards
> http://p.sf.net/sfu/Bonitasoft
> _______________________________________________
> Transdecoder-users mailing list
> Tra...@li...
> https://lists.sourceforge.net/lists/listinfo/transdecoder-users
>

-- 
--
Brian J. Haas
The Broad Institute
http://broad.mit.edu/~bhaas

[Transdecoder-users] Licensing of TransDecoder

From: Shorkey, S. <Sco...@AG...> - 2014-07-02 15:50:22

Good afternoon,

I'm a system administrator from Agriculture and Agri-Food Canada, a department of the Government of Canada. Our scientists are interested in using TransDecoder, but our department policy requires that we determine the licensing of a software package before it may be used on our systems. What license is TransDecoder released under, or, if it is not released under any formal license, can you verify that it is free to utilize for research purposes?

Scott Shorkey
Bioinformatics System Administrator
Agriculture and Agri-Food Canada | Agriculture et Agroalimentaire Canada
KW Neatby Bldg | éd. KW Neatby
960 Carling Ave| 960, avenue Carling
Ottawa, ON | Ottawa (ON) K1A 0C6
E-mail Address / Adresse courriel: Sco...@ag...
Telephone | Téléphone 613-759-6409
Government of Canada | Gouvernement du Canada

Re: [Transdecoder-users] Using BLASTN evidence for CDS predictions

From: Martin M. <mmo...@gm...> - 2014-06-25 07:23:34

Hi Alexie,
  hmm, I thought it is calling BLASTP already. I don't think it is worth to come up with yet another
package but I understand your reasonings.
Best,
Martin

Ale...@cs... wrote:
> Hi Martin
> 
> 
>> Yes, I don't mind it would resurrected mistakenly truly dead genes (pseudogenes).
> I think that might be a bit out of the scope of the TransDecoder program as a lot of people would mind. Creating a second software that uses the TransDecoder output and the transcriptome assembly sounds like a better way to do it.
> 
> a
> 
> ________________________________________
> From: Martin MOKREJŠ [mmo...@gm...]
> Sent: Tuesday, 24 June 2014 11:11 PM
> To: tra...@li...
> Subject: Re: [Transdecoder-users] Using BLASTN evidence for CDS predictions
> 
> Martin MOKREJŠ wrote:
>> Hi,
>>   I wonder whether TransDecoder could also run blastn against a somewhat related genome/transcriptome
>> and considered the matching regions. They will be matching exons and the UTRs won't be conserved.
>>
>>   Further, could TransDecoder ignore a STOP codon and merge two ORFs if the macthes are in adjacent
>> frames on the same target strand?
> 
> Aha, forgot to explain a bit more: the alignment to DNA will reveal gaps in either strand so quite helpful
> hint for an artificial insertion/deletion. It should be easy for TransDecoder to realize that at that position
> is likely an error. Yes, I don't mind it would resurrected mistakenly truly dead genes (pseudogenes).
> 
> Martin

Re: [Transdecoder-users] Using BLASTN evidence for CDS predictions

From: <Ale...@cs...> - 2014-06-25 01:39:53

Hi Martin


>Yes, I don't mind it would resurrected mistakenly truly dead genes (pseudogenes).
I think that might be a bit out of the scope of the TransDecoder program as a lot of people would mind. Creating a second software that uses the TransDecoder output and the transcriptome assembly sounds like a better way to do it.

a

________________________________________
From: Martin MOKREJŠ [mmo...@gm...]
Sent: Tuesday, 24 June 2014 11:11 PM
To: tra...@li...
Subject: Re: [Transdecoder-users] Using BLASTN evidence for CDS predictions

Martin MOKREJŠ wrote:
> Hi,
>   I wonder whether TransDecoder could also run blastn against a somewhat related genome/transcriptome
> and considered the matching regions. They will be matching exons and the UTRs won't be conserved.
>
>   Further, could TransDecoder ignore a STOP codon and merge two ORFs if the macthes are in adjacent
> frames on the same target strand?

Aha, forgot to explain a bit more: the alignment to DNA will reveal gaps in either strand so quite helpful
hint for an artificial insertion/deletion. It should be easy for TransDecoder to realize that at that position
is likely an error. Yes, I don't mind it would resurrected mistakenly truly dead genes (pseudogenes).

Martin

------------------------------------------------------------------------------
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
_______________________________________________
Transdecoder-users mailing list
Tra...@li...
https://lists.sourceforge.net/lists/listinfo/transdecoder-users

Re: [Transdecoder-users] Using BLASTN evidence for CDS predictions

From: Martin M. <mmo...@gm...> - 2014-06-24 13:12:28

Martin MOKREJŠ wrote:
> Hi,
>   I wonder whether TransDecoder could also run blastn against a somewhat related genome/transcriptome
> and considered the matching regions. They will be matching exons and the UTRs won't be conserved.
> 
>   Further, could TransDecoder ignore a STOP codon and merge two ORFs if the macthes are in adjacent
> frames on the same target strand?

Aha, forgot to explain a bit more: the alignment to DNA will reveal gaps in either strand so quite helpful
hint for an artificial insertion/deletion. It should be easy for TransDecoder to realize that at that position
is likely an error. Yes, I don't mind it would resurrected mistakenly truly dead genes (pseudogenes).

Martin

[Transdecoder-users] Using BLASTN evidence for CDS predictions

From: Martin M. <mmo...@gm...> - 2014-06-24 13:08:38

Hi,
  I wonder whether TransDecoder could also run blastn against a somewhat related genome/transcriptome
and considered the matching regions. They will be matching exons and the UTRs won't be conserved.

  Further, could TransDecoder ignore a STOP codon and merge two ORFs if the macthes are in adjacent
frames on the same target strand?

At least ideas for the future,
Martin

Re: [Transdecoder-users] Protein found in domtbl file miss in .pep file

From: Brian H. <bh...@br...> - 2014-06-11 01:58:01

Hi David,

All orfs found to have pfam hits should be included in the final
transdecoder.pep output file. If this is not the case, then there could be
a bug that we weren't aware of.

If you take your 'missing' transcripts and run them through transdecoder
separately, is it not picking them up and reporting them in the final
output?  If there's a bug, we'll need some example data to help
troubleshoot it.

many thanks,

~brian



On Tue, Jun 10, 2014 at 9:42 AM, 卢 汉斌 <lh...@gm...> wrote:

> Hello everyone,
>
> I use the following command to  find coding region in my trinity assembly:
>
> TransDecoder -t target_transcripts.fasta --reuse  --search_pfam /path_to_transdecoder/pfam/Pfam-AB.hmm.bin --CPU 5
>
>
> It generates several output files. Next, I want to find the transcripts that contain the domain I interested. I search the target_transcripts.transdecorder.pfam.dat ( .domtbl ) file for lines that contain the name of the domain I am interested. A typical record is display as follow:
>
>
> DUF640               PF04852.7    133 Unigene0069328|m.26647 -            127   1.2e-50  172.2   0.1   1   1   5.6e-55     2e-50  171.6   0.1    39   133     1    95     1    95 0.99 Protein of unknown function (DUF640)
>
> I get the ids ( "Unigene0069328|m.26647" in the example line ) and  pick up those protein sequences in the target_transcripts.transdecorder.pep file, output of the TransDecoder. However, many records I found in the pfam.dat cannot be found in .pep file.
>
> I select several "missing sequences" and predict their coding region on NCBI. They all have ORFs and the domain I interested. So why are these sequences not transformed to peptide sequences and record in the TransDecoder output file —— target_transcripts.transdecorder.pep.
>
> Thank you for your help.
>
> Best,
> David Lu
>
>
>
> ------------------------------------------------------------------------------
> HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions
> Find What Matters Most in Your Big Data with HPCC Systems
> Open Source. Fast. Scalable. Simple. Ideal for Dirty Data.
> Leverages Graph Analysis for Fast Processing & Easy Data Exploration
> http://p.sf.net/sfu/hpccsystems
> _______________________________________________
> Transdecoder-users mailing list
> Tra...@li...
> https://lists.sourceforge.net/lists/listinfo/transdecoder-users
>
>


-- 
--
Brian J. Haas
The Broad Institute
http://broad.mit.edu/~bhaas

[Transdecoder-users] Protein found in domtbl file miss in .pep file

From: 卢汉斌 <lh...@gm...> - 2014-06-10 13:42:21

Hello everyone,

I use the following command to  find coding region in my trinity assembly:

TransDecoder -t target_transcripts.fasta --reuse  --search_pfam /path_to_transdecoder/pfam/Pfam-AB.hmm.bin --CPU 5

It generates several output files. Next, I want to find the transcripts that contain the domain I interested. I search the target_transcripts.transdecorder.pfam.dat ( .domtbl ) file for lines that contain the name of the domain I am interested. A typical record is display as follow: 

DUF640               PF04852.7    133 Unigene0069328|m.26647 -            127   1.2e-50  172.2   0.1   1   1   5.6e-55     2e-50  171.6   0.1    39   133     1    95     1    95 0.99 Protein of unknown function (DUF640)

I get the ids ( "Unigene0069328|m.26647" in the example line ) and  pick up those protein sequences in the target_transcripts.transdecorder.pep file, output of the TransDecoder. However, many records I found in the pfam.dat cannot be found in .pep file. 

I select several "missing sequences" and predict their coding region on NCBI. They all have ORFs and the domain I interested. So why are these sequences not transformed to peptide sequences and record in the TransDecoder output file —— target_transcripts.transdecorder.pep.

Thank you for your help.

Best,
David Lu