From: Alec W. <al...@br...> - 2010-11-15 17:01:06
|
Picard release 1.35 15 November 2010 Nothing terribly exciting in this release... - CalculateHsMetrics.java: Added validation that bait and target files are the same reference as the reads. - ExtractSequences.java: Added a new tool to extract subsequences from a fasta file. - Provide more informative error message when countMismatches has an ArrayIndexOutOfBoundsException. -Alec |
From: Alec W. <al...@br...> - 2010-11-30 18:32:41
|
Hi Hyun, [CCing samtools-help because we try to share the knowledge of the community...] SamToFastq is not designed to be very efficient with memory. It could be implemented better, but it hasn't been a focus for us. The problem is that is writes out the two FASTQs in the same order, and if one end of the pair is not near the other in the input BAM, then the first end encountered is saved in memory until the second is found. If there are lots of pairs with one end unmapped and without a coordinate, and/or chimeric reads, or long jumping reads, then this map can cause the program to run out of memory. The solution is to feed it a queryname-sorted file so that the two mates are next to each other in the input BAM. You could sort the file (with samtools or SortSam) and save it, and then feed the queryname sorted file to SamToFastq. Alternately, you could have the sort program write to stdout, and then have SamToFastq read the sorted file from stdin. Details on stdin and stdout for Picard can be found here: http://sourceforge.net/apps/mediawiki/picard/index.php?title=Main_Page#Q:_Can_Picard_programs_read_from_stdin_and_write_to_stdout.3F -Alec On 11/30/10 1:05 PM, Hyun Min Kang wrote: > Hi Alec, > > We are using picard 1.35's SamtoFastq.jar to convert a 20GB Exome BAM file to FASTQ file, and encountered the following errors. Even if we increased the memory to 16GB, it didn't resolve the errors. Do you have an idea why these error happen? The BAM files contains quite a large number of QC failed reads and duplicates. > > Thanks, > Hyun. > > Error message is: > Runtime.totalMemory()=9090039808 > Exception in thread "main" java.lang.OutOfMemoryError: GC overhead limit exceeded > at java.lang.String.<init>(String.java:347) > at net.sf.samtools.util.StringUtil.bytesToString(StringUtil.java:259) > at net.sf.samtools.BAMRecord.decodeReadName(BAMRecord.java:325) > at net.sf.samtools.BAMRecord.getReadName(BAMRecord.java:214) > at net.sf.picard.sam.SamToFastq.doPaired(SamToFastq.java:148) > at net.sf.picard.sam.SamToFastq.doWork(SamToFastq.java:111) > at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:156) > at net.sf.picard.sam.SamToFastq.main(SamToFastq.java:96) > > samtools flagstat results are : > 199846817 in total > 25275848 QC failure > 61789693 duplicates > 170451665 mapped (85.29%) > 131246536 paired in sequencing > 65623268 read1 > 65623268 read2 > 113393300 properly paired (86.40%) > 113532490 with itself and mate mapped > 302704 singletons (0.23%) > 100912 with mate mapped to a different chr > 77082 with mate mapped to a different chr (mapQ>=5) > > > On Nov 15, 2010, at 12:00 PM, Alec Wysoker wrote: > > >> Picard release 1.35 >> 15 November 2010 >> >> Nothing terribly exciting in this release... >> >> - CalculateHsMetrics.java: Added validation that bait and target files are the same reference as the reads. >> >> - ExtractSequences.java: Added a new tool to extract subsequences from a fasta file. >> >> - Provide more informative error message when countMismatches has an ArrayIndexOutOfBoundsException. >> >> >> -Alec >> ------------------------------------------------------------------------------ >> Centralized Desktop Delivery: Dell and VMware Reference Architecture >> Simplifying enterprise desktop deployment and management using >> Dell EqualLogic storage and VMware View: A highly scalable, end-to-end >> client virtualization framework. Read more! >> http://p.sf.net/sfu/dell-eql-dev2dev_______________________________________________ >> Samtools-devel mailing list >> Sam...@li... >> https://lists.sourceforge.net/lists/listinfo/samtools-devel >> > |
From: Alec W. <al...@br...> - 2010-12-06 15:32:37
|
Picard Release 1.36 6 December 2010 - BAMIndexMetaData.java: Make public getAlignedRecordCount, getUnalignedRecordCount, getNoCoordinateCount. Improve interface to bam index metaData - public getMetaData in AbstractBamFileIndex. - Support bgzipped SAM files in SAMFileReader. - AbstractAlignmentMerger.java: Allow Sam or Bam output. - FixMateInformation.java; When merging the input files, if they are all sorted then take advantage of that to produce a sorted input iterator. - Soft-clip CIGAR operator inside of hard-clip operator is now considered valid. - Non-standard SAM header tag values are no longer typed. They are all stored as strings. Any existing files that have a non-standard header tag value with a value type encoded in it will need to be repaired. These header tags will parse, but may not yield the expected results. AbstractSAMHeaderRecord.setAttribute(String, Object) has been deprecated. Use AbstractSAMHeaderRecord(String, String) instead. AbstractSAMHeaderRecord.getAttribute and .getAttributes have been changed to return String and Map<String, String>, respectively. Existing code that called these methods may need to be changed, in the cases where other than a String return value was expected. -Alec |
From: Alec W. <al...@br...> - 2010-12-20 14:24:44
|
Picard Release 1.37 20 December 2010 - SamRecordIntervalIteratorFactory.java: When not using index, halt iteration as soon as a read is seen that is after the last interval requested to avoid uselessly reading the entire file. - Changes to IntervalList to ensure that only intervals defined against valid sequences can be loaded from file. - MetricsFile.read will not throw an exception if the EOF is reached in before any metrics are read. Instead an empty metrics file object is returned - CollectAlignmentSummaryMetrics.java: If no reads are found than an empty unpaired category metric is output - MarkDuplicates.java: The SAM Header is now parsed for libaries in order to try to pre-populate the metrics objects because otherwise there would be no duplication metrics objects for a library if no reads were found - build.xml: Fix dependencies for package-commands target. - Change behavior of RevertSam so that if REMOVE_DUPLICATE_INFORMATION=false, then the output can have records with the duplicate flag set despite being unaligned. - CollectAlignmentSummaryMetrics.java: Reads shorter than ADAPTER_MATCH_LENGTH are not considered adapters. Fixes ArrayIndexOutOfBoundsException from samtools-dev when 0-length read -Alec |
From: Alec W. <al...@br...> - 2011-01-03 18:33:05
|
Picard release 1.38 3 January 2011 - Histogram.java: More efficient implementation of Historam.getStandardDeviation() and unit tests for the same. - First version of a multi-metrics tool. Still fairly primitive. - Added IntervalTreeMap. - CommandLineProgram.java: Changed tmpdir.mkdir() to tmpdir.mkdirs() - Suppress validation for zero-length read bases if SAMRecord is not primary. -Alec |
From: Alec W. <al...@br...> - 2011-01-31 15:30:28
|
Hi Folks, SourceForge was attacked last week, and as a result has temporarily shut down various services, including file upload. Details here: http://sourceforge.net/blog/sourceforge-attack-full-report/ Picard release 1.39 is delayed until SourceForge services are restored. -Alec |
From: Alec W. <al...@br...> - 2011-02-02 14:09:46
|
Picard release 1.39 2 February 2011 - FastqToSam.java: Added options to set the sequencing center, PI, DS, DT tags in read group. - CollectGcBiasMetrics.java: Fixed a bug whereby windows from a reference sequence were ignored if the reference sequence had no reads aligned to it. - CollectGcBiasMetrics; When counting errors in GC window, add # of inserted bases and # of deleted bases to # of mismatched bases. - CommandLineParser.java: Eliminate requirement that fields annotated with @Option and @Usage be public. SourceForge appears to be partially functional now. I am not able to browse the SVN repository via HTTP, but other SVN operations seem to be working. -Alec |
From: Alec W. <al...@br...> - 2011-02-15 15:46:42
|
Picard release 1.40 15 February 2011 - CollectMultipleMetrics.java: Made sure that the list of programs to run is unique. - Faster implementation of MeanQualityByCycle that cuts off about 2/3 of the runtime. - Adds a .equals method to MetricBase that compares the values for all public fields in the class; modifies MetricFile to have separate comparison methods for header, metrics, and histogram and to call all three of them in the .equals method. FormatUtil now handles Bytes. - QualityScoreDistribution.java, MetricsFile.java: Fixed to cope with the situation when there are no valid bases in an input sam/bam file. - Hack to fix strange problem with reading BAM via HTTP from SourceForge. -Alec |
From: Alec W. <al...@br...> - 2011-03-29 14:18:44
|
Picard release 1.42 29 March 2011 - SAMRecordQueryNameComparator.java: Augment comparator to test not-primary flag and hit-index (HI) tag value, if present. Encapsulate read name comparator in case it gets changed. - Restore value of SAMFileReader.defaultValidationStringency in various places where it is changed, so that methods that are typically main programs can be called in the context of some other program, and don't mess up the original validation stringency. - SeekableHTTPStream.java: Eliminate work-around for SourceForge HTTP problem http://sourceforge.net/apps/trac/sourceforge/ticket/18367, since it no longer seems to be a problem. - Added a little progress logging to SortSam. - Changed SortingCollection to use a TreeSet in place of a PriorityQueue; this results in a speed up on the order of ~20% for the merging phase of SortingCollection when there are large numbers of temp files in use. - CommandLineProgram.java: Modified to throw the original error, even if there is an exception when writing out the final log lines. - SortingCollection.java: Make SortingCollection in-memory iteration destructive by default in order to reduce memory footprint. If you want to call iterator() on a SortingCollection more than once, call setDestructiveIteration(false) to ensure that this will be allowed. For SortingCollections that are large enough to require spilling to disk, iteration is non-destructive currently. - Changes to centralize the size of buffers use in various buffering file read/writing classes and default them to 128kb. - Add option to BAM index reading code to disable memory mapping of index file. Memory-mapping can be problematic in Java because there is no way to force the release of a memory map before the garbage collector decides to do so, and therefore if many indices are opened and then closed, the JVM can run out of space in which to memory map. By calling SAMFileReader.enableIndexMemoryMapping(false), RandomAccessFile is used rather than memory mapping, which results in slower performance but timely release of resources. Change courtesy of Bob Handsaker. -Alec |
From: Cliff Y. <mm...@ho...> - 2011-04-07 20:36:28
|
Hi, I used samtools mpileup and bcftools to get variant vcf files and noticed the vcf format is 4.1. I then tried vcftools to analyze the data but got the following error message: VCFtools - v0.1.3 (C) Adam Auton 2009 Error:VCF version must be v4.0: You are using version VCFv4.1 Is there any tool that I can use for vcf4.1? Thanks -C |
From: Heng Li <lh...@sa...> - 2011-04-07 20:55:46
|
Samtools takes the GL/PL ordering required by the upcoming VCF-4.1. Thus it labels 4.1 in the header. However, the release of VCF-4.1 is delayed, which is causing the inconsistency. For now, you may just replace "VCFv4.1" with "VCFv4.0". This will have no side effect for both samtools and the current version of vcftools. Heng On Apr 7, 2011, at 4:36 PM, Cliff Yiu wrote: > Hi, > > I used samtools mpileup and bcftools to get variant vcf files and noticed the vcf format is 4.1. I then tried vcftools to analyze the data but got the following error message: > > > VCFtools - v0.1.3 > (C) Adam Auton 2009 > > Error:VCF version must be v4.0: > You are using version VCFv4.1 > > Is there any tool that I can use for vcf4.1? > > Thanks > > -C > > > ------------------------------------------------------------------------------ > Xperia(TM) PLAY > It's a major breakthrough. An authentic gaming > smartphone on the nation's most reliable network. > And it wants your games. > http://p.sf.net/sfu/verizon-sfdev_______________________________________________ > Samtools-help mailing list > Sam...@li... > https://lists.sourceforge.net/lists/listinfo/samtools-help -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. |
From: Petr D. <pd...@sa...> - 2011-04-08 18:25:52
|
Hi, the perl script and API from the VCFtools package support VCFv4.1 Best, Petr On Thu, 2011-04-07 at 15:36 -0500, Cliff Yiu wrote: > Hi, > > I used samtools mpileup and bcftools to get variant vcf files and > noticed the vcf format is 4.1. I then tried vcftools to analyze the > data but got the following error message: > > > VCFtools - v0.1.3 > (C) Adam Auton 2009 > > Error:VCF version must be v4.0: > You are using version VCFv4.1 > > Is there any tool that I can use for vcf4.1? > > Thanks > > -C > > > ------------------------------------------------------------------------------ > Xperia(TM) PLAY > It's a major breakthrough. An authentic gaming > smartphone on the nation's most reliable network. > And it wants your games. > http://p.sf.net/sfu/verizon-sfdev > _______________________________________________ Samtools-help mailing list Sam...@li... https://lists.sourceforge.net/lists/listinfo/samtools-help -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. |
From: Alec W. <al...@br...> - 2011-04-11 14:23:14
|
Picard release 1.43 12 April 2011 - Enable MergeBamAlignment to handle multiple alignments for the same read. There must be only one primary alignment, and alignments for mate pairs must be correlated with HI (hit index) tag. -Alec |
From: Alec W. <al...@br...> - 2011-05-09 14:27:08
|
Picard release 1.45 9 May 2011 - changes to avoid NPE when calling equals() method on SAMRecord - New program CollectCDnaMetrics -- beta release - Simplified CollectAlignmentSummaryMetrics.java and added PF_ALIGNED_BASES, and PF_MISMATCH_RATE to AlignedSummaryMetrics, CollectAlignmentSummaryMetrics. - BAMRemoteFileTest.java: Move location of HTTP-accessed BAM from SourceForge web to Broad web, because SF doesn't serve this file reliably (perhaps because BAM is GZIP-encoded?) - CollectAlignmentSummaryMetrics.java, MeanQualityByCycle.java, QualityScoreDistribution.java: Ignore non-primary alignments when gathering metrics. - Refactored AbstractTextFileParser, TabbedTextFileParser, and BasicTextFileParser as AbstractInputParser, TabbedInputParser, and BasicInputParser. All now have additional constructors that take InputStreams, and the old constructors that take Files are still there. Except for the name change, they should continue to work the same as before. - ExtractIlluminaBarcodes.java: Fixed Option for BASECALLS_DIR - CollectAlignmentSummaryMetrics.java: Fixed PF_HQ_ERROR_RATE, overflow error due to previous simplification of CollectAlignmentSummaryMetrics.java - RevertSam.java: Do not copy non-primary alignments into reverted SAM. - CommandLineProgram.java: Added an output of the elapsed time when a program ends. - Changes to ensure that cleanup() is called on all instances of SortingCollection and SortingLongCollection after use, to ensure that temporary files are cleaned up ASAP. - SeekableHTTPStream.java: Allow FTP as well as HTTP URL connection (patch courtesy of Lance Frohman). -Alec |
From: Alec W. <al...@br...> - 2011-05-23 15:15:52
|
Picard release 1.46 23 May 2011 - SeekableHTTPStream.java: Revert change to support FTP URLs because they aren't seekable. - New program CollectRnaSeqMetrics.java. - SAMRecord.java: Added SAMRecord.getReferencePositionAtReadPosition. - TabbedTextFileWithHeaderParser.java: added ability to retrieve full (non-split) line from Row. - Added option of providing a header array to TabbedTextFileWithHeaderParser. - Fix bug in BasicInputParser where getCurrentLine and getCurrentLineNumber methods return information about the *next* line rather than the current one. - AbstractInputParser.java: Printing out offending line when parser fails. - HsMetricsCalculator.java: Fix for a bug in calculating the HS_PENALTY_nnX numbers that was triggered if the input sequence data provided massively more coverage than needed to cover 80% of targets at nX. In cases where the coverage was below, at or modestly above target coverage levels this wasn't a problem. - FixMateInformation.java: Fix to correctly rename bam index file along without bam output when creating index. - Changes to CollectGcBias metrics to remove a number of poorly tested and documented summary metrics and replace them with two new ones called AT_DROPOUT and GC_DROPOUT that mirror the metrics produced by Illumina. - Allow for other AdapterPairs in ClippingUtility than just the ones defined in IlluminaUtil. -Alec |
From: mei ge <rge...@ya...> - 2011-05-25 19:15:26
|
Hi Alec, I use the CollectRnaSeqMetrics in the new release Picard. I don't know where to get the Ribosomal_intervals file. Can you point a place for the file? Thanks Mei --- On Mon, 5/23/11, Alec Wysoker <al...@br...> wrote: From: Alec Wysoker <al...@br...> Subject: [Samtools-help] Picard release 1.46 To: "samtools help" <sam...@li...>, "samtools-devel" <sam...@li...>, sam...@li... Date: Monday, May 23, 2011, 11:15 AM Picard release 1.46 23 May 2011 - SeekableHTTPStream.java: Revert change to support FTP URLs because they aren't seekable. - New program CollectRnaSeqMetrics.java. - SAMRecord.java: Added SAMRecord.getReferencePositionAtReadPosition. - TabbedTextFileWithHeaderParser.java: added ability to retrieve full (non-split) line from Row. - Added option of providing a header array to TabbedTextFileWithHeaderParser. - Fix bug in BasicInputParser where getCurrentLine and getCurrentLineNumber methods return information about the *next* line rather than the current one. - AbstractInputParser.java: Printing out offending line when parser fails. - HsMetricsCalculator.java: Fix for a bug in calculating the HS_PENALTY_nnX numbers that was triggered if the input sequence data provided massively more coverage than needed to cover 80% of targets at nX. In cases where the coverage was below, at or modestly above target coverage levels this wasn't a problem. - FixMateInformation.java: Fix to correctly rename bam index file along without bam output when creating index. - Changes to CollectGcBias metrics to remove a number of poorly tested and documented summary metrics and replace them with two new ones called AT_DROPOUT and GC_DROPOUT that mirror the metrics produced by Illumina. - Allow for other AdapterPairs in ClippingUtility than just the ones defined in IlluminaUtil. -Alec ------------------------------------------------------------------------------ What Every C/C++ and Fortran developer Should Know! Read this article and learn how Intel has extended the reach of its next-generation tools to help Windows* and Linux* C/C++ and Fortran developers boost performance applications - including clusters. http://p.sf.net/sfu/intel-dev2devmay _______________________________________________ Samtools-help mailing list Sam...@li... https://lists.sourceforge.net/lists/listinfo/samtools-help |
From: Alec W. <al...@br...> - 2011-05-25 20:54:44
|
Hi Mei, Are you looking for human? If so, what genome build? -Alec On 5/25/11 3:15 PM, mei ge wrote: > Hi Alec, > I use the CollectRnaSeqMetrics in the new release Picard. I don't > know where to get the Ribosomal_intervals file. Can you point a place > for the file? > Thanks > Mei > |
From: mei ge <rge...@ya...> - 2011-05-26 13:54:08
|
Hi Alec: I am looking for human (hg19) and mouse(mm9). Thanks Robin --- On Wed, 5/25/11, Alec Wysoker <al...@br...> wrote: From: Alec Wysoker <al...@br...> Subject: Re: [Samtools-help] Picard release 1.46 To: "mei ge" <rge...@ya...> Cc: "samtools help" <sam...@li...> Date: Wednesday, May 25, 2011, 4:52 PM Hi Mei, Are you looking for human? If so, what genome build? -Alec On 5/25/11 3:15 PM, mei ge wrote: Hi Alec, I use the CollectRnaSeqMetrics in the new release Picard. I don't know where to get the Ribosomal_intervals file. Can you point a place for the file? Thanks Mei |
From: Alec W. <al...@br...> - 2011-05-26 14:58:30
|
@HD VN:1.0 SO:unsorted @SQ SN:1 LN:249250621 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:1b22b98cdeb4a9304cb5d48026a85128 SP:Homo Sapiens @SQ SN:2 LN:243199373 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:a0d9851da00400dec1098a9255ac712e SP:Homo Sapiens @SQ SN:3 LN:198022430 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:fdfd811849cc2fadebc929bb925902e5 SP:Homo Sapiens @SQ SN:4 LN:191154276 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:23dccd106897542ad87d2765d28a19a1 SP:Homo Sapiens @SQ SN:5 LN:180915260 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:0740173db9ffd264d728f32784845cd7 SP:Homo Sapiens @SQ SN:6 LN:171115067 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:1d3a93a248d92a729ee764823acbbc6b SP:Homo Sapiens @SQ SN:7 LN:159138663 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:618366e953d6aaad97dbe4777c29375e SP:Homo Sapiens @SQ SN:8 LN:146364022 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:96f514a9929e410c6651697bded59aec SP:Homo Sapiens @SQ SN:9 LN:141213431 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:3e273117f15e0a400f01055d9f393768 SP:Homo Sapiens @SQ SN:10 LN:135534747 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:988c28e000e84c26d552359af1ea2e1d SP:Homo Sapiens @SQ SN:11 LN:135006516 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:98c59049a2df285c76ffb1c6db8f8b96 SP:Homo Sapiens @SQ SN:12 LN:133851895 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:51851ac0e1a115847ad36449b0015864 SP:Homo Sapiens @SQ SN:13 LN:115169878 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:283f8d7892baa81b510a015719ca7b0b SP:Homo Sapiens @SQ SN:14 LN:107349540 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:98f3cae32b2a2e9524bc19813927542e SP:Homo Sapiens @SQ SN:15 LN:102531392 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:e5645a794a8238215b2cd77acb95a078 SP:Homo Sapiens @SQ SN:16 LN:90354753 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:fc9b1a7b42b97a864f56b348b06095e6 SP:Homo Sapiens @SQ SN:17 LN:81195210 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:351f64d4f4f9ddd45b35336ad97aa6de SP:Homo Sapiens @SQ SN:18 LN:78077248 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:b15d4b2d29dde9d3e4f93d1d0f2cbc9c SP:Homo Sapiens @SQ SN:19 LN:59128983 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:1aacd71f30db8e561810913e0b72636d SP:Homo Sapiens @SQ SN:20 LN:63025520 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:0dec9660ec1efaaf33281c0d5ea2560f SP:Homo Sapiens @SQ SN:21 LN:48129895 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:2979a6085bfe28e3ad6f552f361ed74d SP:Homo Sapiens @SQ SN:22 LN:51304566 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:a718acaa6135fdca8357d5bfe94211dd SP:Homo Sapiens @SQ SN:X LN:155270560 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:7e0e2e580297b7764e31dbc80c2540dd SP:Homo Sapiens @SQ SN:Y LN:59373566 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:1fa3474750af0948bdf97d5a0ee52e51 SP:Homo Sapiens @SQ SN:MT LN:16569 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:c68f52674c9fb33aef52dcf399755519 SP:Homo Sapiens @SQ SN:GL000207.1 LN:4262 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:f3814841f1939d3ca19072d9e89f3fd7 SP:Homo Sapiens @SQ SN:GL000226.1 LN:15008 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:1c1b2cd1fccbc0a99b6a447fa24d1504 SP:Homo Sapiens @SQ SN:GL000229.1 LN:19913 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:d0f40ec87de311d8e715b52e4c7062e1 SP:Homo Sapiens @SQ SN:GL000231.1 LN:27386 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:ba8882ce3a1efa2080e5d29b956568a4 SP:Homo Sapiens @SQ SN:GL000210.1 LN:27682 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:851106a74238044126131ce2a8e5847c SP:Homo Sapiens @SQ SN:GL000239.1 LN:33824 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:99795f15702caec4fa1c4e15f8a29c07 SP:Homo Sapiens @SQ SN:GL000235.1 LN:34474 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:118a25ca210cfbcdfb6c2ebb249f9680 SP:Homo Sapiens @SQ SN:GL000201.1 LN:36148 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:dfb7e7ec60ffdcb85cb359ea28454ee9 SP:Homo Sapiens @SQ SN:GL000247.1 LN:36422 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:7de00226bb7df1c57276ca6baabafd15 SP:Homo Sapiens @SQ SN:GL000245.1 LN:36651 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:89bc61960f37d94abf0df2d481ada0ec SP:Homo Sapiens @SQ SN:GL000197.1 LN:37175 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:6f5efdd36643a9b8c8ccad6f2f1edc7b SP:Homo Sapiens @SQ SN:GL000203.1 LN:37498 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:96358c325fe0e70bee73436e8bb14dbd SP:Homo Sapiens @SQ SN:GL000246.1 LN:38154 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:e4afcd31912af9d9c2546acf1cb23af2 SP:Homo Sapiens @SQ SN:GL000249.1 LN:38502 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:1d78abec37c15fe29a275eb08d5af236 SP:Homo Sapiens @SQ SN:GL000196.1 LN:38914 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:d92206d1bb4c3b4019c43c0875c06dc0 SP:Homo Sapiens @SQ SN:GL000248.1 LN:39786 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:5a8e43bec9be36c7b49c84d585107776 SP:Homo Sapiens @SQ SN:GL000244.1 LN:39929 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:0996b4475f353ca98bacb756ac479140 SP:Homo Sapiens @SQ SN:GL000238.1 LN:39939 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:131b1efc3270cc838686b54e7c34b17b SP:Homo Sapiens @SQ SN:GL000202.1 LN:40103 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:06cbf126247d89664a4faebad130fe9c SP:Homo Sapiens @SQ SN:GL000234.1 LN:40531 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:93f998536b61a56fd0ff47322a911d4b SP:Homo Sapiens @SQ SN:GL000232.1 LN:40652 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:3e06b6741061ad93a8587531307057d8 SP:Homo Sapiens @SQ SN:GL000206.1 LN:41001 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:43f69e423533e948bfae5ce1d45bd3f1 SP:Homo Sapiens @SQ SN:GL000240.1 LN:41933 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:445a86173da9f237d7bcf41c6cb8cc62 SP:Homo Sapiens @SQ SN:GL000236.1 LN:41934 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:fdcd739913efa1fdc64b6c0cd7016779 SP:Homo Sapiens @SQ SN:GL000241.1 LN:42152 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:ef4258cdc5a45c206cea8fc3e1d858cf SP:Homo Sapiens @SQ SN:GL000243.1 LN:43341 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:cc34279a7e353136741c9fce79bc4396 SP:Homo Sapiens @SQ SN:GL000242.1 LN:43523 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:2f8694fc47576bc81b5fe9e7de0ba49e SP:Homo Sapiens @SQ SN:GL000230.1 LN:43691 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:b4eb71ee878d3706246b7c1dbef69299 SP:Homo Sapiens @SQ SN:GL000237.1 LN:45867 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:e0c82e7751df73f4f6d0ed30cdc853c0 SP:Homo Sapiens @SQ SN:GL000233.1 LN:45941 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:7fed60298a8d62ff808b74b6ce820001 SP:Homo Sapiens @SQ SN:GL000204.1 LN:81310 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:efc49c871536fa8d79cb0a06fa739722 SP:Homo Sapiens @SQ SN:GL000198.1 LN:90085 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:868e7784040da90d900d2d1b667a1383 SP:Homo Sapiens @SQ SN:GL000208.1 LN:92689 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:aa81be49bf3fe63a79bdc6a6f279abf6 SP:Homo Sapiens @SQ SN:GL000191.1 LN:106433 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:d75b436f50a8214ee9c2a51d30b2c2cc SP:Homo Sapiens @SQ SN:GL000227.1 LN:128374 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:a4aead23f8053f2655e468bcc6ecdceb SP:Homo Sapiens @SQ SN:GL000228.1 LN:129120 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:c5a17c97e2c1a0b6a9cc5a6b064b714f SP:Homo Sapiens @SQ SN:GL000214.1 LN:137718 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:46c2032c37f2ed899eb41c0473319a69 SP:Homo Sapiens @SQ SN:GL000221.1 LN:155397 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:3238fb74ea87ae857f9c7508d315babb SP:Homo Sapiens @SQ SN:GL000209.1 LN:159169 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:f40598e2a5a6b26e84a3775e0d1e2c81 SP:Homo Sapiens @SQ SN:GL000218.1 LN:161147 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:1d708b54644c26c7e01c2dad5426d38c SP:Homo Sapiens @SQ SN:GL000220.1 LN:161802 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:fc35de963c57bf7648429e6454f1c9db SP:Homo Sapiens @SQ SN:GL000213.1 LN:164239 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:9d424fdcc98866650b58f004080a992a SP:Homo Sapiens @SQ SN:GL000211.1 LN:166566 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:7daaa45c66b288847b9b32b964e623d3 SP:Homo Sapiens @SQ SN:GL000199.1 LN:169874 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:569af3b73522fab4b40995ae4944e78e SP:Homo Sapiens @SQ SN:GL000217.1 LN:172149 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:6d243e18dea1945fb7f2517615b8f52e SP:Homo Sapiens @SQ SN:GL000216.1 LN:172294 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:642a232d91c486ac339263820aef7fe0 SP:Homo Sapiens @SQ SN:GL000215.1 LN:172545 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:5eb3b418480ae67a997957c909375a73 SP:Homo Sapiens @SQ SN:GL000205.1 LN:174588 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:d22441398d99caf673e9afb9a1908ec5 SP:Homo Sapiens @SQ SN:GL000219.1 LN:179198 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:f977edd13bac459cb2ed4a5457dba1b3 SP:Homo Sapiens @SQ SN:GL000224.1 LN:179693 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:d5b2fc04f6b41b212a4198a07f450e20 SP:Homo Sapiens @SQ SN:GL000223.1 LN:180455 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:399dfa03bf32022ab52a846f7ca35b30 SP:Homo Sapiens @SQ SN:GL000195.1 LN:182896 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:5d9ec007868d517e73543b005ba48535 SP:Homo Sapiens @SQ SN:GL000212.1 LN:186858 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:563531689f3dbd691331fd6c5730a88b SP:Homo Sapiens @SQ SN:GL000222.1 LN:186861 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:6fe9abac455169f50470f5a6b01d0f59 SP:Homo Sapiens @SQ SN:GL000200.1 LN:187035 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:75e4c8d17cd4addf3917d1703cacaf25 SP:Homo Sapiens @SQ SN:GL000193.1 LN:189789 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:dbb6e8ece0b5de29da56601613007c2a SP:Homo Sapiens @SQ SN:GL000194.1 LN:191469 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:6ac8f815bf8e845bb3031b73f812c012 SP:Homo Sapiens @SQ SN:GL000225.1 LN:211173 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:63945c3e6962f28ffd469719a747e73c SP:Homo Sapiens @SQ SN:GL000192.1 LN:547496 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:GRCh37 M5:325ba9e808f669dfeee210fdd7b470ac SP:Homo Sapiens @SQ SN:NC_007605 LN:171823 UR:http://www.broadinstitute.org/ftp/pub/seq/references/Homo_sapiens_assembly19.fasta AS:NC_007605.1 M5:6743bd63b3ff2b5b8985d8933c53290a SP:Epstein-Barr virus 1 9497728 9497837 - 5S_rRNA.493 1 13949679 13949779 - 5S_rRNA.311 1 34578550 34578664 + 5S_rRNA.116 1 37730278 37730387 - 5S_rRNA.427 1 39619836 39619968 - 5S_rRNA.277 1 41932608 41932699 - 5S_rRNA.447 1 43662088 43662207 + 5S_rRNA.2 1 45397995 45398103 + 5S_rRNA.503 1 52439082 52439207 + 5S_rRNA.100 1 63652007 63652121 - 5S_rRNA.408 1 74215200 74215306 - 5S_rRNA.85 1 78080554 78080637 + 5S_rRNA.361 1 78245589 78245684 + 5S_rRNA.306 1 78560491 78560599 - 5S_rRNA.187 1 78840849 78840953 - 5S_rRNA.224 1 86349363 86349481 - 5S_rRNA.146 1 87918923 87919056 - 5S_rRNA.6 1 93953890 93954009 - 5S_rRNA.247 1 111584456 111584547 + 5S_rRNA.471 1 117504969 117505093 - 5S_rRNA.440 1 118806995 118807112 - 5S_rRNA.263 1 146124161 146124283 + 5S_rRNA.343 1 147665993 147666115 + 5S_rRNA.341 1 148913273 148913382 - 5S_rRNA.307 1 159148263 159148349 + 5S_rRNA.292 1 162308433 162308532 + 5S_rRNA.411 1 163438286 163438395 - 5S_rRNA.74 1 163479274 163479385 + 5S_rRNA.239 1 166011481 166011587 + 5S_rRNA.208 1 166975196 166975314 + 5S_rRNA.56 1 169036502 169036602 + 5S_rRNA.500 1 173890208 173890338 + 5S_rRNA.405 1 173938456 173938574 - 5S_rRNA.97 1 178530048 178530164 + 5S_rRNA.125 1 181740702 181740780 + 5S_rRNA.499 1 182913500 182913625 - 5S_rRNA.43 1 184984083 184984199 - 5S_rRNA.123 1 189635279 189635386 - 5S_rRNA.446 1 204531541 204531649 - 5S_rRNA.76 1 204676448 204676558 - 5S_rRNA.459 1 219935131 219935217 + 5S_rRNA.393 1 227748882 227749001 + 5S_rRNA.188 1 228743494 228743602 + 5S_rRNA.176 1 228746015 228746133 - RN5S1 1 228748256 228748374 - RN5S2 1 228750497 228750615 - RN5S3 1 228752738 228752856 - RN5S4 1 228754979 228755097 - RN5S5 1 228757194 228757312 - RN5S6 1 228759414 228759532 - RN5S7 1 228761656 228761774 - RN5S8 1 228763895 228764013 - RN5S9 1 228766137 228766255 - RN5S10 1 228768378 228768496 - RN5S11 1 228770618 228770736 - RN5S12 1 228772843 228772961 - RN5S13 1 228775084 228775202 - RN5S14 1 228777315 228777433 - RN5S15 1 228779556 228779674 - RN5S16 1 228781787 228781905 - RN5S17 1 228783659 228783779 - 5S_rRNA.214 1 229685652 229685769 + 5S_rRNA.136 1 230955996 230956104 + 5S_rRNA.178 1 231417160 231417270 - 5S_rRNA.280 1 242297574 242297683 + 5S_rRNA.254 1 247367872 247367963 - 5S_rRNA.442 2 11657523 11657629 + 5S_rRNA.144 2 11701787 11701905 - 5S_rRNA.186 2 20601388 20601485 - 5S_rRNA.211 2 22561758 22561875 - 5S_rRNA.204 2 24787499 24787607 - 5S_rRNA.200 2 28906842 28906949 + 5S_rRNA.265 2 31451178 31451276 + 5S_rRNA.147 2 33510836 33510954 - 5S_rRNA.213 2 33557965 33558074 + 5S_rRNA.406 2 56462352 56462465 - 5S_rRNA.12 2 59921897 59922004 + 5S_rRNA.467 2 61225887 61226023 + 5S_rRNA.269 2 69409029 69409135 - 5S_rRNA.365 2 74195250 74195333 - 5S_rRNA.164 2 77000035 77000171 + 5S_rRNA.55 2 81723338 81723451 - 5S_rRNA.104 2 91862883 91862992 + 5S_rRNA.304 2 97622445 97622552 - 5S_rRNA.478 2 125690492 125690597 + 5S_rRNA.463 2 129202655 129202772 - 5S_rRNA.336 2 133010727 133010878 - 5_8S_rRNA.5 2 134988766 134988883 + 5S_rRNA.309 2 138269668 138269780 - 5S_rRNA.232 2 148471024 148471123 - 5S_rRNA.509 2 155446919 155447040 + 5S_rRNA.333 2 162266065 162266181 + 5S_rRNA.448 2 163353446 163353561 + 5S_rRNA.225 2 165714942 165715056 + 5S_rRNA.22 2 165752187 165752287 - 5S_rRNA.335 2 178003427 178003525 + 5S_rRNA.353 2 182913549 182913632 + 5S_rRNA.282 2 189141459 189141571 - 5S_rRNA.90 2 201713761 201713867 - 5S_rRNA.321 2 208918885 208918982 - 5S_rRNA.376 2 209785271 209785388 - 5S_rRNA.181 2 210586637 210586745 - 5S_rRNA.275 2 212400378 212400494 + 5S_rRNA.31 2 217658212 217658323 - 5S_rRNA.303 2 228627768 228627868 - 5S_rRNA.424 2 242524383 242524490 + 5S_rRNA.206 3 12552594 12552712 - 5S_rRNA.60 3 14436148 14436257 + 5S_rRNA.33 3 25066498 25066617 + 5S_rRNA.152 3 25393775 25393883 - 5S_rRNA.170 3 31270207 31270339 + 5S_rRNA.314 3 33533475 33533590 - 5S_rRNA.182 3 37383140 37383257 + 5S_rRNA.262 3 49504548 49504661 - 5S_rRNA.122 3 50457212 50457330 + 5S_rRNA.191 3 51728481 51728598 - 5S_rRNA.143 3 56219132 56219250 + 5S_rRNA.65 3 63531116 63531211 + 5S_rRNA.417 3 68932118 68932242 + 5S_rRNA.476 3 72740856 72740962 - 5S_rRNA.305 3 124777148 124777282 - 5S_rRNA.456 3 126283408 126283521 - 5S_rRNA.171 3 127682451 127682567 + 5S_rRNA.119 3 133428920 133429011 - 5S_rRNA.128 3 134502278 134502397 - 5S_rRNA.129 3 137236976 137237097 - 5S_rRNA.124 3 142310519 142310633 - 5S_rRNA.468 3 143905747 143905862 - 5S_rRNA.301 3 150905886 150906010 + 5S_rRNA.53 3 156891707 156891829 - 5S_rRNA.296 3 176481021 176481129 + 5S_rRNA.418 3 178146985 178147078 - 5S_rRNA.422 3 179879674 179879765 - 5S_rRNA.160 3 181540660 181540778 - 5S_rRNA.44 3 182976088 182976204 - 5S_rRNA.203 4 8394742 8394851 + 5S_rRNA.175 4 9412904 9412979 - 5S_rRNA.419 4 9794375 9794485 - 5S_rRNA.385 4 10117380 10117508 - 5S_rRNA.320 4 11415675 11415784 - 5S_rRNA.259 4 19183437 19183479 - 5S_rRNA.362 4 38760412 38760513 - 5S_rRNA.264 4 39938373 39938475 + 5S_rRNA.497 4 40992171 40992290 + 5S_rRNA.155 4 56963556 56963670 + 5S_rRNA.96 4 57197343 57197383 - 5S_rRNA.363 4 72625235 72625333 - 5S_rRNA.266 4 94741322 94741431 + 5S_rRNA.423 4 148666454 148666568 - 5S_rRNA.492 4 149479099 149479200 + 5S_rRNA.161 4 150828712 150828827 + 5S_rRNA.193 4 151891823 151891934 - 5S_rRNA.435 4 152892649 152892765 - 5S_rRNA.507 4 166974339 166974439 - 5S_rRNA.495 4 167839515 167839629 - 5S_rRNA.230 4 178378265 178378382 - 5S_rRNA.167 4 179327849 179327984 + 5S_rRNA.153 4 190936293 190936410 + 5S_rRNA.30 4 190938851 190938969 + 5S_rRNA.115 5 7047340 7047449 + 5S_rRNA.218 5 9761019 9761125 + 5S_rRNA.250 5 16033338 16033465 - 5S_rRNA.274 5 16854103 16854223 + 5S_rRNA.510 5 17157080 17157180 - 5S_rRNA.381 5 36485445 36485542 + 5S_rRNA.258 5 51182522 51182603 + 5S_rRNA.367 5 55073297 55073415 + 5S_rRNA.7 5 55442566 55442673 - 5S_rRNA.315 5 55556160 55556273 - 5S_rRNA.219 5 75387533 75387626 - 5S_rRNA.477 5 87570075 87570193 - 5S_rRNA.205 5 101466711 101466839 - 5S_rRNA.39 5 105258695 105258809 - 5S_rRNA.158 5 118781411 118781528 + 5S_rRNA.179 5 129450102 129450203 + 5S_rRNA.20 5 132184269 132184388 + 5S_rRNA.120 5 136304889 136304995 - 5S_rRNA.271 5 138348018 138348130 + 5S_rRNA.140 5 138635541 138635657 + 5S_rRNA.27 5 145965662 145965763 - 5S_rRNA.473 5 150857020 150857138 - 5S_rRNA.64 5 151255127 151255253 - 5S_rRNA.273 5 155272447 155272538 + 5S_rRNA.68 5 172719520 172719622 - 5S_rRNA.400 6 3088920 3089043 - 5S_rRNA.268 6 4428197 4428315 + 5S_rRNA.118 6 10753024 10753159 - 5S_rRNA.278 6 17722017 17722134 - 5S_rRNA.130 6 19438514 19438649 + 5S_rRNA.138 6 32046285 32046405 - 5S_rRNA.441 6 41207258 41207364 - 5S_rRNA.212 6 68177124 68177201 - 5S_rRNA.379 6 76574956 76575052 - 5S_rRNA.326 6 82331917 82332036 + 5S_rRNA.228 6 106897256 106897372 + 5S_rRNA.197 6 108573605 108573711 + 5S_rRNA.238 6 114541845 114541950 - 5S_rRNA.227 6 117381845 117381962 + 5S_rRNA.67 6 121007767 121007888 + 5S_rRNA.72 6 126300958 126301080 - 5S_rRNA.373 6 127639028 127639118 - 5S_rRNA.358 6 134578173 134578281 + 5S_rRNA.58 6 136951381 136951493 + 5S_rRNA.216 6 140479728 140479831 - 5S_rRNA.395 6 143770946 143771048 + 5S_rRNA.328 6 146661556 146661660 + 5S_rRNA.313 6 152731410 152731529 + 5S_rRNA.173 6 153480355 153480480 - 5S_rRNA.168 6 153741573 153741707 + 5S_rRNA.302 6 165823051 165823165 + 5S_rRNA.24 7 22338213 22338325 + 5S_rRNA.382 7 24172324 24172456 + 5S_rRNA.388 7 33217352 33217450 + 5S_rRNA.299 7 44165351 44165472 - 5S_rRNA.479 7 68188898 68189035 - 5S_rRNA.285 7 71378723 71378841 - 5S_rRNA.108 7 73901758 73901870 - 5S_rRNA.337 7 79283425 79283543 + 5S_rRNA.98 7 82647214 82647327 + 5S_rRNA.189 7 106422046 106422161 - 5S_rRNA.380 7 111593708 111593820 - 5S_rRNA.276 7 114253842 114253961 - 5S_rRNA.194 7 116584340 116584468 - 5S_rRNA.257 7 120621480 120621607 - 5S_rRNA.185 7 120723754 120723867 - 5S_rRNA.249 7 128337493 128337610 + 5S_rRNA.110 7 128337764 128337798 + 5S_rRNA.482 7 129396106 129396217 + 5S_rRNA.215 7 129667117 129667237 + 5S_rRNA.114 7 130287672 130287790 - 5S_rRNA.346 7 140070241 140070345 + 5S_rRNA.322 7 140086581 140086707 - 5S_rRNA.201 7 147546976 147547074 + 5S_rRNA.375 7 152290058 152290176 - 5S_rRNA.14 8 3558014 3558150 + 5S_rRNA.291 8 10544079 10544197 + 5S_rRNA.233 8 11944905 11944954 - 5S_rRNA.387 8 12191153 12191202 - 5S_rRNA.443 8 13393538 13393644 + 5S_rRNA.401 8 17605627 17605739 + 5S_rRNA.221 8 20147685 20147800 + 5S_rRNA.48 8 25994396 25994514 + 5S_rRNA.34 8 28387928 28388036 + 5S_rRNA.198 8 28906011 28906121 - 5S_rRNA.94 8 31455615 31455720 - 5S_rRNA.288 8 32049544 32049669 - 5S_rRNA.410 8 32114012 32114139 + 5S_rRNA.66 8 36621963 36622069 + 5S_rRNA.52 8 56657791 56657901 + 5S_rRNA.312 8 58039214 58039321 - 5S_rRNA.260 8 60368444 60368569 + 5S_rRNA.156 8 68011537 68011660 + 5S_rRNA.455 8 69607878 69608014 + 5S_rRNA.349 8 70842140 70842257 + 5S_rRNA.334 8 73269963 73270050 + 5S_rRNA.439 8 89408666 89408776 - 5S_rRNA.371 8 91523356 91523468 + 5S_rRNA.9 8 94453505 94453612 + 5S_rRNA.281 8 108896722 108896839 + 5S_rRNA.99 8 117034911 117035004 - 5S_rRNA.18 8 120903372 120903469 - 5S_rRNA.195 8 141950661 141950762 + 5S_rRNA.481 9 1333837 1333953 + 5S_rRNA.196 9 27198285 27198399 + 5S_rRNA.325 9 32293556 32293690 + 5S_rRNA.420 9 34206132 34206239 - 5S_rRNA.398 9 66458144 66458262 - 5S_rRNA.202 9 68408924 68409041 + 5S_rRNA.207 9 74914636 74914754 - 5S_rRNA.433 9 76365888 76365993 - 5S_rRNA.234 9 84500411 84500513 + 5S_rRNA.399 9 97904731 97904823 - 5S_rRNA.368 9 98667287 98667423 + 5S_rRNA.1 9 101908122 101908248 - 5S_rRNA.490 9 106451755 106451859 - 5S_rRNA.502 9 109606030 109606147 - 5S_rRNA.220 9 110681147 110681259 - 5S_rRNA.287 9 111754689 111754849 - 5_8S_rRNA.3 9 114173924 114174022 - 5S_rRNA.329 9 115014538 115014658 + 5S_rRNA.25 9 130603391 130603501 - 5S_rRNA.286 10 200771 200883 - 5S_rRNA.226 10 327955 328065 - 5S_rRNA.163 10 8698679 8698794 - 5S_rRNA.237 10 12966679 12966785 - 5S_rRNA.236 10 13769796 13769916 + 5S_rRNA.15 10 14705394 14705493 - 5S_rRNA.157 10 19922612 19922741 - 5S_rRNA.384 10 23286184 23286287 + 5S_rRNA.323 10 24929095 24929213 + 5S_rRNA.26 10 26102740 26102846 - 5S_rRNA.317 10 26798518 26798633 + 5S_rRNA.37 10 29163322 29163436 - 5S_rRNA.13 10 31548528 31548633 + 5S_rRNA.437 10 46351755 46351870 + 5S_rRNA.347 10 46742028 46742143 + 5S_rRNA.338 10 47222341 47222456 + 5S_rRNA.483 10 48206895 48207010 - 5S_rRNA.470 10 48931789 48931904 + 5S_rRNA.279 10 49248476 49248591 + 5S_rRNA.396 10 51255380 51255495 + 5S_rRNA.251 10 51739425 51739540 - 5S_rRNA.252 10 55217957 55218048 + 5S_rRNA.397 10 70221164 70221282 + 5S_rRNA.35 10 75466248 75466371 + 5S_rRNA.392 10 79346807 79346917 + 5S_rRNA.32 10 87812205 87812323 + 5S_rRNA.80 10 95270305 95270437 - 5S_rRNA.231 10 98379133 98379240 + 5S_rRNA.174 10 108729779 108729908 + 5S_rRNA.88 10 109221318 109221427 + 5S_rRNA.59 10 112520900 112521000 + 5S_rRNA.10 10 127834043 127834151 - 5S_rRNA.298 11 6038226 6038344 + 5S_rRNA.180 11 8866810 8866905 + 5S_rRNA.485 11 13929031 13929137 - 5S_rRNA.162 11 14156532 14156646 + 5S_rRNA.229 11 18269936 18270053 - 5S_rRNA.75 11 18287940 18288057 + 5S_rRNA.149 11 19423419 19423528 - 5S_rRNA.91 11 21022427 21022543 - 5S_rRNA.111 11 21405347 21405455 - 5S_rRNA.475 11 22709897 22709981 + 5S_rRNA.284 11 27543152 27543273 - 5S_rRNA.222 11 47825848 47825954 - 5S_rRNA.489 11 57217656 57217772 + 5S_rRNA.391 11 71556242 71556291 - 5S_rRNA.484 11 73909793 73909899 - 5S_rRNA.89 11 75645980 75646083 + 5S_rRNA.316 11 95573181 95573288 - 5S_rRNA.293 11 96207736 96207856 - 5S_rRNA.77 11 97528464 97528582 + 5S_rRNA.5 11 104123307 104123343 + 5S_rRNA.407 11 108991605 108991713 - 5S_rRNA.86 11 110910822 110910932 - 5S_rRNA.62 11 111799124 111799232 + 5S_rRNA.345 11 124506355 124506488 + 5S_rRNA.69 12 13593818 13593935 - 5S_rRNA.135 12 26526531 26526666 - 5S_rRNA.242 12 28658327 28658461 + 5S_rRNA.183 12 30459364 30459457 - 5S_rRNA.360 12 34358634 34358752 + 5S_rRNA.36 12 38555268 38555384 - 5S_rRNA.45 12 38557205 38557317 - 5S_rRNA.151 12 42052478 42052595 + 5S_rRNA.425 12 45511554 45511671 + 5S_rRNA.28 12 66460001 66460118 + 5S_rRNA.366 12 80272085 80272199 - 5S_rRNA.169 12 88445478 88445555 + 5S_rRNA.469 12 90120713 90120819 + 5S_rRNA.403 12 99477561 99477683 - 5S_rRNA.192 12 101866117 101866234 + 5S_rRNA.426 12 102173592 102173707 + 5S_rRNA.117 12 102230598 102230705 + 5S_rRNA.318 12 104519014 104519134 + 5S_rRNA.16 12 107880288 107880412 - 5S_rRNA.272 12 109497791 109497905 + 5S_rRNA.209 12 111777331 111777425 + 5S_rRNA.508 12 119267486 119267610 + 5S_rRNA.42 12 123767463 123767581 - 5S_rRNA.46 12 131784185 131784296 - 5S_rRNA.217 12 132145608 132145719 + 5S_rRNA.210 12 132285860 132285979 - 5S_rRNA.81 12 133300531 133300643 - 5S_rRNA.134 13 19662627 19662721 + 5S_rRNA.415 13 21943929 21944038 - 5S_rRNA.300 13 38428989 38429107 - 5S_rRNA.3 13 46125620 46125738 - 5S_rRNA.92 13 51387124 51387242 - 5S_rRNA.93 13 51601883 51601988 - 5S_rRNA.308 13 58565484 58565598 + 5S_rRNA.199 13 61187742 61187868 + 5S_rRNA.319 13 72552226 72552344 + 5S_rRNA.57 13 79963669 79963766 - 5S_rRNA.438 13 90787966 90788095 - 5S_rRNA.465 13 94677724 94677852 - 5S_rRNA.243 13 95303651 95303783 - 5S_rRNA.256 13 98015226 98015355 - 5S_rRNA.190 13 106807721 106807839 - 5S_rRNA.297 13 108953560 108953679 - 5S_rRNA.327 14 20558289 20558386 + 5S_rRNA.501 14 20614276 20614314 - 5S_rRNA.369 14 20883146 20883257 + 5S_rRNA.63 14 24627812 24627931 + 5S_rRNA.40 14 50019351 50019466 + 5S_rRNA.432 14 52159470 52159579 + 5S_rRNA.354 14 68123822 68123928 - 5S_rRNA.17 14 76070516 76070621 + 5S_rRNA.113 14 78644146 78644264 - 5S_rRNA.19 14 107092677 107092791 + 5S_rRNA.21 15 25983655 25983767 + 5S_rRNA.255 15 27525465 27525583 - 5S_rRNA.73 15 40623738 40623849 - 5S_rRNA.466 15 42144111 42144227 - 5S_rRNA.70 15 50274640 50274758 + 5S_rRNA.112 15 50755578 50755695 + 5S_rRNA.429 15 59866302 59866421 - 5S_rRNA.150 15 61028968 61029085 + 5S_rRNA.248 15 69659924 69660020 + 5S_rRNA.498 15 72150911 72151027 + 5S_rRNA.103 15 86681500 86681610 - 5S_rRNA.496 15 98015356 98015482 - 5S_rRNA.352 15 100614286 100614395 - 5S_rRNA.494 16 9658247 9658354 - 5S_rRNA.78 16 9693954 9694085 + 5S_rRNA.487 16 26040233 26040346 + 5S_rRNA.50 16 33965426 33965577 + 5_8S_rRNA.2 16 34968764 34968875 + 5S_rRNA.372 16 34969010 34969119 + 5S_rRNA.261 16 34969743 34969842 + 5S_rRNA.374 16 34969996 34970104 + 5S_rRNA.454 16 34980360 34980465 + 5S_rRNA.394 16 34980894 34981007 + 5S_rRNA.377 16 34981143 34981265 + 5S_rRNA.389 16 34981403 34981511 + 5S_rRNA.402 16 34982891 34983026 + 5S_rRNA.430 16 34984432 34984546 + 5S_rRNA.505 16 34985945 34986055 + 5S_rRNA.457 16 34986199 34986306 + 5S_rRNA.413 16 34987197 34987313 + 5S_rRNA.355 16 34988167 34988301 + 5S_rRNA.351 16 34988417 34988553 + 5S_rRNA.474 16 34988926 34989059 + 5S_rRNA.462 16 34989294 34989404 + 5S_rRNA.449 16 34989548 34989656 + 5S_rRNA.488 16 47484258 47484372 - 5S_rRNA.223 16 47539343 47539454 + 5S_rRNA.270 16 50903366 50903486 - 5S_rRNA.38 16 53371365 53371483 + 5S_rRNA.184 16 66335712 66335843 + 5S_rRNA.172 16 68776383 68776491 + 5S_rRNA.83 16 75894946 75895055 - 5S_rRNA.4 16 79298351 79298470 + 5S_rRNA.267 16 84073523 84073638 - 5S_rRNA.8 16 84296421 84296534 - 5S_rRNA.127 17 3960050 3960170 - 5S_rRNA.436 17 6504264 6504371 - 5S_rRNA.105 17 15685655 15685769 + 5S_rRNA.364 17 30290824 30290952 - 5S_rRNA.506 17 32447075 32447186 - 5S_rRNA.428 17 34847043 34847166 - 5S_rRNA.486 17 36888055 36888173 - 5S_rRNA.61 17 38530341 38530428 + 5S_rRNA.452 17 39874406 39874512 + 5S_rRNA.165 17 43404732 43404863 - 5S_rRNA.51 17 64263861 64263966 + 5S_rRNA.460 17 64635387 64635501 + 5S_rRNA.350 17 64891608 64891689 + 5S_rRNA.356 17 65264226 65264344 + 5S_rRNA.87 17 72413733 72413843 - 5S_rRNA.54 18 9844714 9844843 - 5S_rRNA.464 18 9923917 9924018 + 5S_rRNA.324 18 19508946 19509055 - 5S_rRNA.290 18 21750469 21750587 - 5S_rRNA.47 18 29756013 29756131 - 5S_rRNA.11 18 39850554 39850680 + 5S_rRNA.504 18 41651548 41651666 + 5S_rRNA.177 18 46001858 46001971 - 5S_rRNA.102 18 47472636 47472759 - 5S_rRNA.82 18 47845372 47845490 + 5S_rRNA.383 18 52813777 52813876 - 5S_rRNA.434 18 70419187 70419286 + 5S_rRNA.451 18 75575420 75575523 - 5S_rRNA.409 19 453134 453245 + 5S_rRNA.444 19 7951862 7951981 - 5S_rRNA.49 19 12107624 12107731 + 5S_rRNA.390 19 12138728 12138836 - 5S_rRNA.235 19 12180979 12181086 + 5S_rRNA.445 19 12217312 12217429 + 5S_rRNA.245 19 18082834 18082926 + 5S_rRNA.450 19 21295934 21296042 - 5S_rRNA.109 19 24187160 24187309 - 5_8S_rRNA.4 19 29492577 29492682 + 5S_rRNA.414 19 32146262 32146367 - 5S_rRNA.472 19 32734871 32734989 - 5S_rRNA.95 19 58874805 58874929 + 5S_rRNA.404 20 5079033 5079122 - 5S_rRNA.240 20 5326652 5326806 - 5_8S_rRNA.1 20 15261410 15261519 - 5S_rRNA.289 20 18414486 18414588 + 5S_rRNA.431 20 21119597 21119715 - 5S_rRNA.23 20 23141494 23141608 - 5S_rRNA.139 20 23361533 23361649 - 5S_rRNA.154 20 29944673 29944781 + 5S_rRNA.458 20 30596154 30596275 + 5S_rRNA.453 20 30638391 30638508 + 5S_rRNA.244 20 33783109 33783226 + 5S_rRNA.71 20 39482759 39482869 - 5S_rRNA.344 20 44188397 44188515 - 5S_rRNA.166 20 46501935 46502017 + 5S_rRNA.359 20 54780108 54780229 - 5S_rRNA.253 21 15443192 15443307 + 5S_rRNA.159 21 26574521 26574629 + 5S_rRNA.340 21 33935385 33935520 + 5S_rRNA.491 21 38224211 38224328 + 5S_rRNA.41 21 43641283 43641396 + 5S_rRNA.331 22 22146723 22146813 + 5S_rRNA.106 22 26111008 26111136 - 5S_rRNA.294 22 26786633 26786742 - 5S_rRNA.412 22 31701390 31701498 + 5S_rRNA.241 22 33031088 33031184 - 5S_rRNA.370 X 1419149 1419268 - 5S_rRNA.332 X 9101869 9101987 + 5S_rRNA.101 X 29000969 29001087 - 5S_rRNA.126 X 32120182 32120293 + 5S_rRNA.310 X 41152051 41152169 + 5S_rRNA.29 X 48065693 48065809 + 5S_rRNA.480 X 52694281 52694397 + 5S_rRNA.295 X 53935488 53935595 + 5S_rRNA.145 X 68892323 68892441 + 5S_rRNA.148 X 69472892 69473012 + 5S_rRNA.79 X 75875410 75875528 - 5S_rRNA.330 X 76287130 76287248 + 5S_rRNA.133 X 94837063 94837161 + 5S_rRNA.141 X 103105726 103105840 - 5S_rRNA.283 X 110913057 110913176 - 5S_rRNA.131 X 128570507 128570624 - 5S_rRNA.137 X 131072874 131072984 - 5S_rRNA.84 X 134562093 134562210 - 5S_rRNA.132 X 141498839 141498957 + 5S_rRNA.107 X 145508974 145509090 - 5S_rRNA.142 X 147089620 147089735 - 5S_rRNA.121 X 150194767 150194878 - 5S_rRNA.246 Y 9928019 9928137 - 5S_rRNA.357 Y 9930484 9930602 - 5S_rRNA.416 Y 10037764 10037915 + 5_8S_rRNA.6 Y 19669744 19669856 + 5S_rRNA.421 Y 19671654 19671769 + 5S_rRNA.386 Y 20508009 20508124 - 5S_rRNA.378 Y 20509921 20510033 - 5S_rRNA.461 MT 648 1601 + J01415.23 MT 1671 3229 + MTRNR2L2 |
From: mei ge <rge...@ya...> - 2011-05-26 18:10:04
|
Hi Alec: I used your ribosomal interval file and I got an error as follow: [Thu May 26 13:50:45 EDT 2011] net.sf.picard.analysis.CollectRnaSeqMetrics REF_FLAT=/home/gtf/hg19/refFlat_table_for_picard.txt RIBOSOMAL_INTERVALS=/home/gtf/ribosomal_RNA/Homo_sapiens_assembly19.rRNA.interval_list STRAND_SPECIFICITY=NONE INPUT=/s_1_2_pefca_2/s_1_2_pefca_2_sorted.bam OUTPUT=/s_1_2_pefca_2/s_1_2_pefca_2_RNAseq_metrics.txt REFERENCE_SEQUENCE=/databases/hs_chromosomes/hg19.fa ASSUME_SORTED=true VALIDATION_STRINGENCY=SILENT STOP_AFTER=0 VERBOSITY=INFO QUIET=false COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false INFO 2011-05-26 13:50:46 CollectRnaSeqMetrics Loaded 21987 genes. [Thu May 26 13:50:46 EDT 2011] net.sf.picard.analysis.CollectRnaSeqMetrics done. Elapsed time: 0.03 minutes. Runtime.totalMemory()=165347328 Exception in thread "main" net.sf.samtools.util.SequenceUtil$SequenceListsDifferException: Sequence dictionaries are not the same size (25, 85) at net.sf.samtools.util.SequenceUtil.assertSequenceListsEqual(SequenceUtil.java:84) at net.sf.samtools.util.SequenceUtil.assertSequenceDictionariesEqual(SequenceUtil.java:126) at net.sf.picard.analysis.CollectRnaSeqMetrics.setup(CollectRnaSeqMetrics.java:86) at net.sf.picard.analysis.SinglePassSamProgram.makeItSo(SinglePassSamProgram.java:101) at net.sf.picard.analysis.SinglePassSamProgram.doWork(SinglePassSamProgram.java:54) at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:158) at net.sf.picard.cmdline.CommandLineProgram.instanceMainWithExit(CommandLineProgram.java:118) at net.sf.picard.analysis.CollectRnaSeqMetrics.main(CollectRnaSeqMetrics.java:77) --- On Thu, 5/26/11, Alec Wysoker <al...@br...> wrote: From: Alec Wysoker <al...@br...> Subject: Re: [Samtools-help] Picard release 1.46 To: "mei ge" <rge...@ya...> Cc: "Sam...@li..." <Sam...@li...> Date: Thursday, May 26, 2011, 10:58 AM Hi Robin, Human HG19 is attached. I don't have one for mouse. I just extracted these from gencode GTF file. Presumably you could do the same for mouse. Interval list format described here: http://picard.sourceforge.net/javadoc/net/sf/picard/util/IntervalList.html Note that this file is not required by CollectRnaSeqMetrics, but if you do not provide it the program will not be able to determine when reads map to rRNA. -Alec On 5/26/11 9:54 AM, mei ge wrote: Hi Alec: I am looking for human (hg19) and mouse(mm9). Thanks Robin --- On Wed, 5/25/11, Alec Wysoker <al...@br...> wrote: From: Alec Wysoker <al...@br...> Subject: Re: [Samtools-help] Picard release 1.46 To: "mei ge" <rge...@ya...> Cc: "samtools help" <sam...@li...> Date: Wednesday, May 25, 2011, 4:52 PM Hi Mei, Are you looking for human? If so, what genome build? -Alec On 5/25/11 3:15 PM, mei ge wrote: Hi Alec, I use the CollectRnaSeqMetrics in the new release Picard. I don't know where to get the Ribosomal_intervals file. Can you point a place for the file? Thanks Mei |
From: Rusch, M. <Michael.Rusch@STJUDE.ORG> - 2011-05-26 18:16:01
|
If I had to guess, I'd say that either the bam file or the fasta file includes the unplaced contigs (GL*) whereas the other one does not. 25 is just enough for 1-22 + M,X,Y. 85 is about right for the total with the hg19 unplaced contigs (if I remember correctly). To see the sequence list for the bam do: samtools view -H /s_1_2_pefca_2/s_1_2_pefca_2_sorted.bam | grep '^@SQ' To see the sequence list for the fasta, do: cat /databases/hs_chromosomes/hg19.fa.dict If that fails, try this (takes longer): grep '^>' /databases/hs_chromosomes/hg19.fa.dict Michael From: mei ge [mailto:rge...@ya...] Sent: Thursday, May 26, 2011 1:10 PM To: Alec Wysoker Cc: Sam...@li... Subject: Re: [Samtools-help] Picard release 1.46. . Hi Alec: I used your ribosomal interval file and I got an error as follow: [Thu May 26 13:50:45 EDT 2011] net.sf.picard.analysis.CollectRnaSeqMetrics REF_FLAT=/home/gtf/hg19/refFlat_table_for_picard.txt RIBOSOMAL_INTERVALS=/home/gtf/ribosomal_RNA/Homo_sapiens_assembly19.rRNA.interval_list STRAND_SPECIFICITY=NONE INPUT=/s_1_2_pefca_2/s_1_2_pefca_2_sorted.bam OUTPUT=/s_1_2_pefca_2/s_1_2_pefca_2_RNAseq_metrics.txt REFERENCE_SEQUENCE=/databases/hs_chromosomes/hg19.fa ASSUME_SORTED=true VALIDATION_STRINGENCY=SILENT STOP_AFTER=0 VERBOSITY=INFO QUIET=false COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false INFO 2011-05-26 13:50:46 CollectRnaSeqMetrics Loaded 21987 genes. [Thu May 26 13:50:46 EDT 2011] net.sf.picard.analysis.CollectRnaSeqMetrics done. Elapsed time: 0.03 minutes. Runtime.totalMemory()=165347328 Exception in thread "main" net.sf.samtools.util.SequenceUtil$SequenceListsDifferException: Sequence dictionaries are not the same size (25, 85) at net.sf.samtools.util.SequenceUtil.assertSequenceListsEqual(SequenceUtil.java:84) at net.sf.samtools.util.SequenceUtil.assertSequenceDictionariesEqual(SequenceUtil.java:126) at net.sf.picard.analysis.CollectRnaSeqMetrics.setup(CollectRnaSeqMetrics.java:86) at net.sf.picard.analysis.SinglePassSamProgram.makeItSo(SinglePassSamProgram.java:101) at net.sf.picard.analysis.SinglePassSamProgram.doWork(SinglePassSamProgram.java:54) at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:158) at net.sf.picard.cmdline.CommandLineProgram.instanceMainWithExit(CommandLineProgram.java:118) at net.sf.picard.analysis.CollectRnaSeqMetrics.main(CollectRnaSeqMetrics.java:77) --- On Thu, 5/26/11, Alec Wysoker <al...@br...> wrote: From: Alec Wysoker <al...@br...> Subject: Re: [Samtools-help] Picard release 1.46 To: "mei ge" <rge...@ya...> Cc: "Sam...@li..." <Sam...@li...> Date: Thursday, May 26, 2011, 10:58 AM Hi Robin, Human HG19 is attached. I don't have one for mouse. I just extracted these from gencode GTF file. Presumably you could do the same for mouse. Interval list format described here: http://picard.sourceforge.net/javadoc/net/sf/picard/util/IntervalList.html Note that this file is not required by CollectRnaSeqMetrics, but if you do not provide it the program will not be able to determine when reads map to rRNA. -Alec On 5/26/11 9:54 AM, mei ge wrote: Hi Alec: I am looking for human (hg19) and mouse(mm9). Thanks Robin --- On Wed, 5/25/11, Alec Wysoker <al...@br...><http://us.mc388.mail.yahoo.com/mc/compose?to=al...@br...> wrote: From: Alec Wysoker <al...@br...><http://us.mc388.mail.yahoo.com/mc/compose?to=al...@br...> Subject: Re: [Samtools-help] Picard release 1.46 To: "mei ge" <rge...@ya...><http://us.mc388.mail.yahoo.com/mc/compose?to=rge...@ya...> Cc: "samtools help" <sam...@li...><http://us.mc388.mail.yahoo.com/mc/compose?to=sam...@li...> Date: Wednesday, May 25, 2011, 4:52 PM Hi Mei, Are you looking for human? If so, what genome build? -Alec On 5/25/11 3:15 PM, mei ge wrote: Hi Alec, I use the CollectRnaSeqMetrics in the new release Picard. I don't know where to get the Ribosomal_intervals file. Can you point a place for the file? Thanks Mei ________________________________ Email Disclaimer: www.stjude.org/emaildisclaimer |
From: Alec W. <al...@br...> - 2011-05-26 18:18:47
|
Hi Robin, I will improve this error message so it is clearer. Homo_sapiens_assembly19.rRNA.interval_list has 25 sequences, and s_1_2_pefca_2_sorted.bam has 85. You may need to edit the interval list so that it has all the sequences in your bam, and uses the same naming convention. -Alec On 5/26/11 2:09 PM, mei ge wrote: > Hi Alec: > I used your ribosomal interval file and I got an error as follow: > [Thu May 26 13:50:45 EDT 2011] > net.sf.picard.analysis.CollectRnaSeqMetrics > REF_FLAT=/home/gtf/hg19/refFlat_table_for_picard.txt > RIBOSOMAL_INTERVALS=/home/gtf/ribosomal_RNA/Homo_sapiens_assembly19.rRNA.interval_list > STRAND_SPECIFICITY=NONE > INPUT=/s_1_2_pefca_2/s_1_2_pefca_2_sorted.bam > OUTPUT=/s_1_2_pefca_2/s_1_2_pefca_2_RNAseq_metrics.txt > REFERENCE_SEQUENCE=/databases/hs_chromosomes/hg19.fa ASSUME_SORTED=true > VALIDATION_STRINGENCY=SILENT > STOP_AFTER=0 > VERBOSITY=INFO > QUIET=false > COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false > CREATE_MD5_FILE=false > INFO 2011-05-26 13:50:46 CollectRnaSeqMetrics Loaded 21987 genes. > [Thu May 26 13:50:46 EDT 2011] > net.sf.picard.analysis.CollectRnaSeqMetrics done. Elapsed time: 0.03 > minutes. > Runtime.totalMemory()=165347328 > Exception in thread "main" > net.sf.samtools.util.SequenceUtil$SequenceListsDifferException: > Sequence dictionaries are not the same size (25, 85) > at > net.sf.samtools.util.SequenceUtil.assertSequenceListsEqual(SequenceUtil.java:84) > at > net.sf.samtools.util.SequenceUtil.assertSequenceDictionariesEqual(SequenceUtil.java:126) > at > net.sf.picard.analysis.CollectRnaSeqMetrics.setup(CollectRnaSeqMetrics.java:86) > at > net.sf.picard.analysis.SinglePassSamProgram.makeItSo(SinglePassSamProgram.java:101) > at > net.sf.picard.analysis.SinglePassSamProgram.doWork(SinglePassSamProgram.java:54) > at > net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:158) > at > net.sf.picard.cmdline.CommandLineProgram.instanceMainWithExit(CommandLineProgram.java:118) > at > net.sf.picard.analysis.CollectRnaSeqMetrics.main(CollectRnaSeqMetrics.java:77) > > > --- On *Thu, 5/26/11, Alec Wysoker /<al...@br...>/* wrote: > > > From: Alec Wysoker <al...@br...> > Subject: Re: [Samtools-help] Picard release 1.46 > To: "mei ge" <rge...@ya...> > Cc: "Sam...@li..." > <Sam...@li...> > Date: Thursday, May 26, 2011, 10:58 AM > > Hi Robin, > > Human HG19 is attached. I don't have one for mouse. I just > extracted these from gencode GTF file. Presumably you could do > the same for mouse. Interval list format described here: > http://picard.sourceforge.net/javadoc/net/sf/picard/util/IntervalList.html > > Note that this file is not required by CollectRnaSeqMetrics, but > if you do not provide it the program will not be able to determine > when reads map to rRNA. > > -Alec > > On 5/26/11 9:54 AM, mei ge wrote: >> Hi Alec: >> I am looking for human (hg19) and mouse(mm9). >> Thanks >> Robin >> >> --- On *Wed, 5/25/11, Alec Wysoker /<al...@br...> >> <http://us.mc388.mail.yahoo.com/mc/compose?to=al...@br...>/* >> wrote: >> >> >> From: Alec Wysoker <al...@br...> >> <http://us.mc388.mail.yahoo.com/mc/compose?to=al...@br...> >> Subject: Re: [Samtools-help] Picard release 1.46 >> To: "mei ge" <rge...@ya...> >> <http://us.mc388.mail.yahoo.com/mc/compose?to=rge...@ya...> >> Cc: "samtools help" <sam...@li...> >> <http://us.mc388.mail.yahoo.com/mc/compose?to=sam...@li...> >> Date: Wednesday, May 25, 2011, 4:52 PM >> >> Hi Mei, >> >> Are you looking for human? If so, what genome build? >> >> -Alec >> >> On 5/25/11 3:15 PM, mei ge wrote: >>> Hi Alec, >>> I use the CollectRnaSeqMetrics in the new release Picard. I >>> don't know where to get the Ribosomal_intervals file. Can >>> you point a place for the file? >>> Thanks >>> Mei >>> |
From: mei ge <rge...@ya...> - 2011-05-26 18:26:00
|
Hi Alec, The Homo_sapiens_assembly19.rRNA.interval_list has 85 sequences, my bam file has 25 sequences. I need to edit your file to match the bam sequences. Thanks Robin --- On Thu, 5/26/11, Alec Wysoker <al...@br...> wrote: From: Alec Wysoker <al...@br...> Subject: Re: [Samtools-help] Picard release 1.46 To: "mei ge" <rge...@ya...> Cc: "Sam...@li..." <Sam...@li...> Date: Thursday, May 26, 2011, 2:18 PM Hi Robin, I will improve this error message so it is clearer. Homo_sapiens_assembly19.rRNA.interval_list has 25 sequences, and s_1_2_pefca_2_sorted.bam has 85. You may need to edit the interval list so that it has all the sequences in your bam, and uses the same naming convention. -Alec On 5/26/11 2:09 PM, mei ge wrote: Hi Alec: I used your ribosomal interval file and I got an error as follow: [Thu May 26 13:50:45 EDT 2011] net.sf.picard.analysis.CollectRnaSeqMetrics REF_FLAT=/home/gtf/hg19/refFlat_table_for_picard.txt RIBOSOMAL_INTERVALS=/home/gtf/ribosomal_RNA/Homo_sapiens_assembly19.rRNA.interval_list STRAND_SPECIFICITY=NONE INPUT=/s_1_2_pefca_2/s_1_2_pefca_2_sorted.bam OUTPUT=/s_1_2_pefca_2/s_1_2_pefca_2_RNAseq_metrics.txt REFERENCE_SEQUENCE=/databases/hs_chromosomes/hg19.fa ASSUME_SORTED=true VALIDATION_STRINGENCY=SILENT STOP_AFTER=0 VERBOSITY=INFO QUIET=false COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false INFO 2011-05-26 13:50:46 CollectRnaSeqMetrics Loaded 21987 genes. [Thu May 26 13:50:46 EDT 2011] net.sf.picard.analysis.CollectRnaSeqMetrics done. Elapsed time: 0.03 minutes. Runtime.totalMemory()=165347328 Exception in thread "main" net.sf.samtools.util.SequenceUtil$SequenceListsDifferException: Sequence dictionaries are not the same size (25, 85) at net.sf.samtools.util.SequenceUtil.assertSequenceListsEqual(SequenceUtil.java:84) at net.sf.samtools.util.SequenceUtil.assertSequenceDictionariesEqual(SequenceUtil.java:126) at net.sf.picard.analysis.CollectRnaSeqMetrics.setup(CollectRnaSeqMetrics.java:86) at net.sf.picard.analysis.SinglePassSamProgram.makeItSo(SinglePassSamProgram.java:101) at net.sf.picard.analysis.SinglePassSamProgram.doWork(SinglePassSamProgram.java:54) at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:158) at net.sf.picard.cmdline.CommandLineProgram.instanceMainWithExit(CommandLineProgram.java:118) at net.sf.picard.analysis.CollectRnaSeqMetrics.main(CollectRnaSeqMetrics.java:77) --- On Thu, 5/26/11, Alec Wysoker <al...@br...> wrote: From: Alec Wysoker <al...@br...> Subject: Re: [Samtools-help] Picard release 1.46 To: "mei ge" <rge...@ya...> Cc: "Sam...@li..." <Sam...@li...> Date: Thursday, May 26, 2011, 10:58 AM Hi Robin, Human HG19 is attached. I don't have one for mouse. I just extracted these from gencode GTF file. Presumably you could do the same for mouse. Interval list format described here: http://picard.sourceforge.net/javadoc/net/sf/picard/util/IntervalList.html Note that this file is not required by CollectRnaSeqMetrics, but if you do not provide it the program will not be able to determine when reads map to rRNA. -Alec On 5/26/11 9:54 AM, mei ge wrote: Hi Alec: I am looking for human (hg19) and mouse(mm9). Thanks Robin --- On Wed, 5/25/11, Alec Wysoker <al...@br...> wrote: From: Alec Wysoker <al...@br...> Subject: Re: [Samtools-help] Picard release 1.46 To: "mei ge" <rge...@ya...> Cc: "samtools help" <sam...@li...> Date: Wednesday, May 25, 2011, 4:52 PM Hi Mei, Are you looking for human? If so, what genome build? -Alec On 5/25/11 3:15 PM, mei ge wrote: Hi Alec, I use the CollectRnaSeqMetrics in the new release Picard. I don't know where to get the Ribosomal_intervals file. Can you point a place for the file? Thanks Mei |
From: Alec W. <al...@br...> - 2011-05-26 18:31:47
|
Right, sorry. -Alec On 5/26/11 2:25 PM, mei ge wrote: > Hi Alec, > The Homo_sapiens_assembly19.rRNA.interval_list has 85 sequences, my > bam file has 25 sequences. I need to edit your file to match the bam > sequences. > Thanks > Robin > > > --- On *Thu, 5/26/11, Alec Wysoker /<al...@br...>/* wrote: > > > From: Alec Wysoker <al...@br...> > Subject: Re: [Samtools-help] Picard release 1.46 > To: "mei ge" <rge...@ya...> > Cc: "Sam...@li..." > <Sam...@li...> > Date: Thursday, May 26, 2011, 2:18 PM > > Hi Robin, > > I will improve this error message so it is clearer. > Homo_sapiens_assembly19.rRNA.interval_list has 25 sequences, and > s_1_2_pefca_2_sorted.bam has 85. You may need to edit the > interval list so that it has all the sequences in your bam, and > uses the same naming convention. > > -Alec > > On 5/26/11 2:09 PM, mei ge wrote: >> Hi Alec: >> I used your ribosomal interval file and I got an error as follow: >> [Thu May 26 13:50:45 EDT 2011] >> net.sf.picard.analysis.CollectRnaSeqMetrics >> REF_FLAT=/home/gtf/hg19/refFlat_table_for_picard.txt >> RIBOSOMAL_INTERVALS=/home/gtf/ribosomal_RNA/Homo_sapiens_assembly19.rRNA.interval_list >> STRAND_SPECIFICITY=NONE >> INPUT=/s_1_2_pefca_2/s_1_2_pefca_2_sorted.bam >> OUTPUT=/s_1_2_pefca_2/s_1_2_pefca_2_RNAseq_metrics.txt >> REFERENCE_SEQUENCE=/databases/hs_chromosomes/hg19.fa >> ASSUME_SORTED=true >> VALIDATION_STRINGENCY=SILENT >> STOP_AFTER=0 >> VERBOSITY=INFO >> QUIET=false >> COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false >> CREATE_MD5_FILE=false >> INFO 2011-05-26 13:50:46 CollectRnaSeqMetrics Loaded 21987 genes. >> [Thu May 26 13:50:46 EDT 2011] >> net.sf.picard.analysis.CollectRnaSeqMetrics done. Elapsed time: >> 0.03 minutes. >> Runtime.totalMemory()=165347328 >> Exception in thread "main" >> net.sf.samtools.util.SequenceUtil$SequenceListsDifferException: >> Sequence dictionaries are not the same size (25, 85) >> at >> net.sf.samtools.util.SequenceUtil.assertSequenceListsEqual(SequenceUtil.java:84) >> at >> net.sf.samtools.util.SequenceUtil.assertSequenceDictionariesEqual(SequenceUtil.java:126) >> at >> net.sf.picard.analysis.CollectRnaSeqMetrics.setup(CollectRnaSeqMetrics.java:86) >> at >> net.sf.picard.analysis.SinglePassSamProgram.makeItSo(SinglePassSamProgram.java:101) >> at >> net.sf.picard.analysis.SinglePassSamProgram.doWork(SinglePassSamProgram.java:54) >> at >> net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:158) >> at >> net.sf.picard.cmdline.CommandLineProgram.instanceMainWithExit(CommandLineProgram.java:118) >> at >> net.sf.picard.analysis.CollectRnaSeqMetrics.main(CollectRnaSeqMetrics.java:77) >> >> >> --- On *Thu, 5/26/11, Alec Wysoker /<al...@br...> >> <http://us.mc388.mail.yahoo.com/mc/compose?to=al...@br...>/* >> wrote: >> >> >> From: Alec Wysoker <al...@br...> >> <http://us.mc388.mail.yahoo.com/mc/compose?to=al...@br...> >> Subject: Re: [Samtools-help] Picard release 1.46 >> To: "mei ge" <rge...@ya...> >> <http://us.mc388.mail.yahoo.com/mc/compose?to=rge...@ya...> >> Cc: "Sam...@li..." >> <http://us.mc388.mail.yahoo.com/mc/compose?to=Sam...@li...> >> <Sam...@li...> >> <http://us.mc388.mail.yahoo.com/mc/compose?to=Sam...@li...> >> Date: Thursday, May 26, 2011, 10:58 AM >> >> Hi Robin, >> >> Human HG19 is attached. I don't have one for mouse. I just >> extracted these from gencode GTF file. Presumably you could >> do the same for mouse. Interval list format described here: >> http://picard.sourceforge.net/javadoc/net/sf/picard/util/IntervalList.html >> >> Note that this file is not required by CollectRnaSeqMetrics, >> but if you do not provide it the program will not be able to >> determine when reads map to rRNA. >> >> -Alec >> >> On 5/26/11 9:54 AM, mei ge wrote: >>> Hi Alec: >>> I am looking for human (hg19) and mouse(mm9). >>> Thanks >>> Robin >>> >>> --- On *Wed, 5/25/11, Alec Wysoker >>> /<al...@br...> >>> <http://us.mc388.mail.yahoo.com/mc/compose?to=al...@br...>/* >>> wrote: >>> >>> >>> From: Alec Wysoker <al...@br...> >>> <http://us.mc388.mail.yahoo.com/mc/compose?to=al...@br...> >>> Subject: Re: [Samtools-help] Picard release 1.46 >>> To: "mei ge" <rge...@ya...> >>> <http://us.mc388.mail.yahoo.com/mc/compose?to=rge...@ya...> >>> Cc: "samtools help" >>> <sam...@li...> >>> <http://us.mc388.mail.yahoo.com/mc/compose?to=sam...@li...> >>> Date: Wednesday, May 25, 2011, 4:52 PM >>> >>> Hi Mei, >>> >>> Are you looking for human? If so, what genome build? >>> >>> -Alec >>> >>> On 5/25/11 3:15 PM, mei ge wrote: >>>> Hi Alec, >>>> I use the CollectRnaSeqMetrics in the new release >>>> Picard. I don't know where to get the >>>> Ribosomal_intervals file. Can you point a place for >>>> the file? >>>> Thanks >>>> Mei >>>> |
From: mei ge <rge...@ya...> - 2011-05-27 18:48:51
|
Hi Alec, I need the ribosomal interval file for mouse. Where did you get the gencode gtf file? Thanks Robin --- On Thu, 5/26/11, Alec Wysoker <al...@br...> wrote: From: Alec Wysoker <al...@br...> Subject: Re: [Samtools-help] Picard release 1.46 To: "mei ge" <rge...@ya...> Cc: "Sam...@li..." <Sam...@li...> Date: Thursday, May 26, 2011, 10:58 AM Hi Robin, Human HG19 is attached. I don't have one for mouse. I just extracted these from gencode GTF file. Presumably you could do the same for mouse. Interval list format described here: http://picard.sourceforge.net/javadoc/net/sf/picard/util/IntervalList.html Note that this file is not required by CollectRnaSeqMetrics, but if you do not provide it the program will not be able to determine when reads map to rRNA. -Alec On 5/26/11 9:54 AM, mei ge wrote: Hi Alec: I am looking for human (hg19) and mouse(mm9). Thanks Robin --- On Wed, 5/25/11, Alec Wysoker <al...@br...> wrote: From: Alec Wysoker <al...@br...> Subject: Re: [Samtools-help] Picard release 1.46 To: "mei ge" <rge...@ya...> Cc: "samtools help" <sam...@li...> Date: Wednesday, May 25, 2011, 4:52 PM Hi Mei, Are you looking for human? If so, what genome build? -Alec On 5/25/11 3:15 PM, mei ge wrote: Hi Alec, I use the CollectRnaSeqMetrics in the new release Picard. I don't know where to get the Ribosomal_intervals file. Can you point a place for the file? Thanks Mei |