GLEAN Code
Brought to you by:
amackey
File | Date | Author | Commit |
---|---|---|---|
bin | 2011-10-26 | amackey | [r16] check for vulgar tag first |
data | 2006-10-06 | amackey | [r8] oops, didn't want to add that |
evigan | 2011-10-05 | drd6y | [r15] Evigan added |
lib | 2012-02-03 | amackey | [r18] continue to improve sequence retrieval |
README | 2006-10-06 | amackey | [r7] changes |
You can download a tar ball of GLEAN at: http://sourceforge.net/projects/glean-gene There is no installation procedure (apart from unpacking). You need to have R installed, and a handful of Perl libraries: BioPerl (preferably, a CVS checkout of bioperl-live) List::Util FindBin YAML Storable Getopt::Long Algorithm::Diff Data::Dumper Module::Pluggable GraphViz (only for some advanced uses) You invoke glean/bin/glean-lca with a "param.yaml" parameter file in YAML format (you can find one in data/param-example.yaml) against a MySQL Bio::DB::GFF database: % glean-lca --param myparams.yaml --database fly --user me --password youbetchya > fly.dat This collates all the data from the database and calculates performance estimates for each evidence source. Then you generate gene models with: % cat fly.dat | glean-dp --gff --minlen 45 > fly.gff --minlen specifies the smallest allowed gene length, in nucleotides; --gff triggers GFF2 output format (default is the funny format accepted by GBrowse for user-supplied annotation features). I'm sure you'll have questions; let me know what breaks first. Aaron Mackey ajmackey@gmail.com