From: Oler, A. (NIH/N. [C] <and...@ni...> - 2012-04-27 16:55:24
|
Hi David, I noticed something about how MergeUCSCGeneTable treats transcripts with the same gene name on different chromosomes. For example, if I have this UCSC-formatted table: cat test.ucsc.real.table OR4F16 NM_001005277 chr1 + 367658 368597 367658 368597 1 367658, 368597, OR4F16 NM_001005277 chr1 - 621095 622034 621095 622034 1 621095, 622034, OR4F16 NM_001005277 chr5 + 180794287 180795226 180794287 180795226 1 180794287, 180795226, I run this command to get a merged table: java -jar ~/USeq/MergeUCSCGeneTable -u test.ucsc.real.table Arguments: -u test.ucsc.real.table 3 transcripts collapsed to 1 genes. I get this output: cat test.real.table_Merged.ucsc OR4F16 OR4F16 chr1 + 367658 180795226 367658 180795226 3 367658,621095,180794287 368597,622034,180795226 I think it should make one merged transcript per chromosome (or else not merge them at all since they are far apart and/or don't have any shared exons, although not much harm is done by merging them within the same chromosome). Thanks, Andrew Andrew Oler, Ph.D. High-Throughput Sequencing Bioinformatics Specialist Contractor – Medical Science & Computing, Inc. Computational Biology Section Bioinformatics and Computational Biosciences Branch (BCBB) OCICB/OSMO/OD/NIAID/NIH 31 Center Drive, Room 3B62 Bethesda, MD 20892 Mobile: 240-507-3791 Office: 301-402-5685 http://bioinformatics.niaid.nih.gov<http://bioinformatics.niaid.nih.gov/> (Within NIH) http://exon.niaid.nih.gov<http://exon.niaid.nih.gov/> (Public) Disclaimer: The information in this e-mail and any of its attachments is confidential and may contain sensitive information. It should not be used by anyone who is not the original intended recipient. If you have received this e-mail in error please inform the sender and delete it from your mailbox or any other storage devices. National Institute of Allergy and Infectious Diseases shall not accept liability for any statements made that are sender's own and not expressly made on behalf of the NIAID by one of its representatives. |