From: Alberto D. <da...@io...> - 2005-02-11 15:16:56
|
Basically we will use sequences (loaded into GUS with the GBParser) for NCBI Blast (Blastx, Blastp and TBlastX), the same sequences will be also used for Interpro analyses. Results of both (Blast and Interpro) will be loaded into GUS. We will parse specific things from the Blast results, I would say: `Gi` `Accession` `Description` `E_value` `Score` `Length` `Frame_Query` `Frame_Hit` `Identical` `Hsp_Frac_Identical` `Conserved` `Hsp_Frac_Conserved` `Query_Start` `Query_End` `Hit_Start` `Hit_End` `Hsp_Align` `database_letters` `database_entries` We already have a Bioperl parser for that (specific for another system: GARSA) that could be adapted to GUS, problem being we are not sure what tables should be used to store those data in GUS. Cheers, Alberto On Fri, 2005-02-11 at 10:06 -0500, Steve Fischer wrote: > what are you planning on blasting? > > steve > > Alberto Davila wrote: > > >Hi Steve, > > > >On Fri, 2005-02-11 at 08:56 -0500, Steve Fischer wrote: > > > > > >>poliana- > >> > >>oops, the usage statement for LoadBlastSimFast is out of date. it > >>should instruct you to use the blastSimilarity command. > >> > >>LoadBlastSimFast makes a big assumption, that the subject and query > >>sequences are in GUS, and their def. lines have GUS primary keys. > >> > >>Are your sequences already loaded into GUS? > >> > >> > > > >They are not, there would be any howto/tips for that plugin ? We will > >certainly need a plugin to load "Interpro" and "ORF finding" results > >into GUS... If they are not available, then maybe we will have to write > >them ... > > > >Cheers, Alberto > > > > > > > >>steve > >> > >> > >> > >>Poliana Mateus wrote: > >> > >> > >> > >>>Hello all, > >>> > >>>Where can find the script parseBlastFilesForSimilarity.pl?? > >>>I'm trying to run LoadBlastSimFast... > >>> > >>>Poliana |
From: davila <da...@io...> - 2005-02-13 21:06:30
|
U3RldmUsDQoqIHNlZSBiZWxvdw0KDQpBbGJlcnRvIERhdmlsYSB3cm90ZToNCg0KPldlIGFyZSBk b2luZyB0aGlzIGZvciBHYXJzYSAoYW5vdGhlciBzeXN0ZW0pIC4uIGJhc2ljYWxseSB3ZSBoYXZl IGENCj5iaW9wZXJsIHBhcnNlciAoQmlvOjpTZWFyY2g6OklPKSB0aGF0IHJlYWRzIHRoZSBCbGFz dCByZXN1bHRzIGZpbGUgYW5kDQo+ZXh0cmFjdCBhbGwgdGhlIG5lZWRlZCBpbmZvICh0byB0aGUg IkJsYXN0X0hpdCIgdGFibGUpLi4uIGFuZCBhbHNvIGxvYWQNCj5pbnRvIGEgZ2l2ZW4gdGFibGUg KGVnOiBFeHRlcm5hbF9EQikgYWxsIHRoZSBzZXF1ZW5jZXMgKGluIGZhc3RhIGZvcm1hdCkNCj5w cmVzZW50aW5nIHNpbWlsYXJpdHkgd2l0aCB0aGUgcXVlcmllcy4uLiBhdCB0aGUgZW5kIHdlIGhh dmUgIkJsYXN0X0hpdCINCj5hbmQgIkV4dGVybmFsX0RCIiBwb3B1bGF0ZWQgd2l0aCB0aGUgc2Ft ZSBzY3JpcHQuDQo+DQo+IA0KPg0Kd293LCBncmVhdC4gIGNvdWxkIHlvdSBtYWtlIGEgZ3VzIHBs dWdpbiBmcm9tIHRoYXQ/DQoNCg0KU2hvdWxkIG5vdCBiZSBhIGJpZyBwcm9ibGVtLCBJIHdpbGwg YXNrIFBvbGlhbmEgdG8gZG8gdGhhdC4uLiBzaGUgY2FuIG9jYXNzaW9uYWxseSBjb250YWN0IHlv dSBhc2tpbmcgZm9yIHNvbWUgZGV0YWlscy4uLiBhdCB0aGUgZW5kIHdlIHdpbGwgcHV0IHRoaW5n cyBiZWluZyBkZWJ1Z2dlZC9kZXZlbG9wZWQgYnkgdXMgYXQgOiB3d3cuYmlvd2ViZGIub3JnIGFu ZCBhbHNvIHByb3ZpZGUgdGhlbSB0byBhbnkgaW50ZXJlc3RlZCBwZW9wbGUuIEluIGFuIGlkZWFs IHdvcmxkLCBub2JvZHkgc2hvdWxkIHN1ZmZlciB0d2ljZSB3aXRoIHRoZSBzYW1lICJidWciIDst KQ0KDQo+UmVnYXJkaW5nIEludGVycHJvIGFuZCBHbGltbWVyLCB0aGUgbWFpbiBwcm9ibGVtIGlz IHRvIGtub3cgaW4gd2hpY2gNCj50YWJsZXMgd2Ugc2hvdWxkIGxvYWQgdGhlIHBhcnNlZCByZXN1 bHRzID8NCj4NCj4gDQo+DQoqIGRlc2NyaWJlIHRoZSBpbmZvIHlvdSB3YW50IHRvIHN0b3JlLg0K DQpCYXNpY2FsbHkgdGhpczoNCg0KRnJhbWVfSGl0LCBNZXRob2QgLCBNZXRob2RfQWNjZXNzaW9u LCBBY2Nlc3Npb24sIEhpdF9TdGF0dXMsIFF1ZXJ5X1N0YXJ0LCBRdWVyeV9FbmQsIERlc2NyaXB0 aW9uLCBFX3ZhbHVlDQoNCkFnYWluLCBJIGFtIGFza2luZyBQb2xpYW5hIHRvIHRha2UgY2FyZSBv ZiB0aGF0Lg0KDQoqIHN0ZXZlDQoNCkNoZWVycywgQWxiZXJ0bw0KDQoNCkFsYmVydG8gTS4gUi4g RMOhdmlsYSwgUGhEDQpLaW5ldG9wbGFzdGlkIEJpb2xvZ3kgYW5kIERpc2Vhc2UgKEJpb21lZCBD ZW50cmFsKQ0KaHR0cDovL3d3dy5raW5ldG9wbGFzdGlkcy5jb20NCmh0dHA6Ly93d3cuZGFyd2lu LmZpb2NydXouYnINCkRCQk0gLyBJbnN0aXR1dG8gT3N3YWxkbyBDcnV6IC8gRklPQ1JVWg0KQXYu IEJyYXNpbCA0MzY1DQpSaW8gZGUgSmFuZWlybywgUkosIEJyYXNpbA0KQ0VQIDIxMDQ1LTkwMA0K RW1haWw6IGRhdmlsYUBmaW9jcnV6LmJyDQogICAgICAgICAgYW1yZGF2aWxhQHlhaG9vLmNvbQ0K UGhvbmU6IDU1LTIxLTM4NjUtODIyOSAvIDM4NjUtODIwNg0KRmF4OiA1NS0yMS0yNTkwLTM0OTUN Ci0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0NClRoZSBC aW93ZWJEQiBjb25zb3J0aXVtOiBodHRwOi8vd3d3LmJpb3dlYmRiLm9yZw0KDQogDQoNCg0KDQo+ QWxiZXJ0bw0KPg0KPk9uIEZyaSwgMjAwNS0wMi0xMSBhdCAxMzoyMSAtMDUwMCwgWS4gVGhvbWFz IEdhbiB3cm90ZToNCj4gDQo+DQo+Pkkgd2FzIGdvaW5nIHRvIGdpdmUgdGhlIHNhbWUgYW5zd2Vy IHN0ZXZlIGdhdmUgZm9yIGludGVycHJvIGFuZCBnZW5lDQo+PmZpbmRpbmcgcmVzdWx0cy4NCj4+ DQo+PkZvciBsb2FkaW5nIHNlcXVlbmNlcyBpbnRvIEdVUywgdGhlIGRpbGxlbWEgd2l0aCBvcHRp b24gMiBpczogaG93IGRvIHlvdQ0KPj5rbm93IHdoaWNoIHNlcXVlbmNlIHRvIGxvYWQgd2hlbiB5 b3UgbG9hZCAod2hpY2ggaXMgYmVmb3JlIHlvdSBhY3R1YWxseQ0KPj5oYXZlIHRoZSBzaW1pbGFy aXR5IHJlc3VsdCk/IE9uZSBzb2x1dGlvbiB3b3VsZCBiZSB0byBpbml0aWFsbHkgbG9hZA0KPj5j b21wbGV0ZSBkYXRhc2V0KHMpIGJ1dCBkZWxldGUgdGhvc2Ugd2l0aG91dCBzaW1pbGFyaXR5IGFm dGVyIGxvYWRpbmcNCj4+c2ltaWxhcml0eSByZXN1bHRzLg0KPj4NCj4+LVRob21hcw0KPj4NCj4+ T24gRnJpLCAxMSBGZWIgMjAwNSwgU3RldmUgRmlzY2hlciB3cm90ZToNCj4+DQo+PiAgIA0KPj4N Cj4+PmFsYmVydG8tDQo+Pj4NCj4+PndlJ3ZlIG5ldmVyIGxvYWRlZCBpbnRlcnBybywgc28gdGhl cmUgaXNuJ3QgYSBwbHVnaW4uDQo+Pj5pIGJlbGlldmUgcGxhc21vZGIgaGFzIGxvYWRlZCBnbGlt bWVyIHJlc3VsdHMsIHRob3VnaCBpJ20gbm90IHN1cmUuICAgaSBoYXZlDQo+Pj5hc2tlZCBhIHBs YXNtb2RiIGRldmVsb3BlciB0byBhbnN3ZXIgdGhhdCBxdWVzdGlvbi4NCj4+Pg0KPj4+c3RldmUN Cj4+Pg0KPj4+QWxiZXJ0byBEYXZpbGEgd3JvdGU6DQo+Pj4NCj4+PiAgICAgDQo+Pj4NCj4+Pj5I ZXkgU3RldmUsIFRob21hcywNCj4+Pj4NCj4+Pj5UaGFua3MgYSBsb3QgZm9yIHRoZSB0aXBzLCBy ZWFsbHkgaGVscGZ1bC4uIG5vdywgZmV3IG1vcmUgcXVlc3Rpb25zOg0KPj4+Pg0KPj4+Pg0KPj4+ PiAgICAgICANCj4+Pj4NCj4+Pj4+b2suICBOUiA9IE5SREINCj4+Pj4+DQo+Pj4+PnRoZSB3YXkg d2UgaGF2ZSB1c2VkIGd1cyB3aXRoIHNpbWlsYXJpdGllcyBpcyB0aGF0IGJvdGggdGhlIHF1ZXJ5 IGFuZA0KPj4+Pj5zdWJqZWN0IGFyZSBsb2FkZWQgaW50byBndXMuICBBcyB0aG9tYXMgZXhwbGFp bmVkLCB0aGUgc2ltaWxhcml0eSB0YWJsZQ0KPj4+Pj5jYXB0dXJlcyBzaW1pbGFyaXR5IGJldHdl ZW4gc2VxdWVuY2VzIHRoYXQgYXJlIGluIGd1cy4NCj4+Pj4+b3VyIGFwcHJvYWNoIGhhcyBhbHdh eXMgYmVlbiB0byBqdXN0IGxvYWQgKHdhcmVob3VzZSkgdGhlIGVudGlyZSBzdWJqZWN0DQo+Pj4+ PmRhdGFiYXNlIChOUiwgRVNUKSB0aGF0IHdlIGFyZSBibGFzdGluZyBhZ2FpbnN0Lg0KPj4+Pj4N Cj4+Pj4+dGhlIGN1cnJlbnQgcGx1Z2lucyBhbmQgYmxhc3RTaW1pbGFyaXR5IGFyZSBzZXQgdXAg Zm9yIHRoaXMuDQo+Pj4+Pg0KPj4+Pj5vYnZpb3VzbHksIHRoaXMgdGFrZXMgYSBsb3Qgb2YgZGlz ayBzcGFjZS4gIHR3byBtYWpvciBlZmZpY2llbmNpZXMgdGhhdCB3ZQ0KPj4+Pj5kb24ndCBjdXJy ZW50bHkgaGF2ZSBwbHVnaW5zIGZvciB3b3VsZCBiZToNCj4+Pj4+IDEuIHRvIG9ubHkgc3RvcmUg aW4gZ3VzIGEgKnJlZmVyZW5jZSogdG8gdGhlIGV4dGVybmFsIHNlcXVlbmNlIChpZSwgZG9uJ3QN Cj4+Pj4+c3RvcmUgdGhlIGFjdGdzKS4NCj4+Pj4+IDIuIG9ubHkgc3RvcmUgaW4gZ3VzIHRoZSBz ZXF1ZW5jZXMgdGhhdCBhY3R1YWxseSBoYXZlIHNpbWlsYXJpdGllcw0KPj4+Pj4NCj4+Pj4+ICAg ICAgICAgDQo+Pj4+Pg0KPj4+Pk9wdGlvbiAyIHNvdW5kIGJldHRlciBmb3IgdXMsIHNpbmNlIHdl IHdpbGwgYmUgYmxhc3RpbmcgYWdhaW5zdCBzZXZlcmFsDQo+Pj4+ZGF0YWJhc2VzICg+IDEwR0Ig ZGF0YWJhc2VzKQ0KPj4+Pg0KPj4+PldoYXQgYWJvdXQgdGhlIHBsdWdpbnMgdG8gbG9hZCBJbnRl cnBybyBhbmQgImdlbmUgZmluZGVyIiAoZ2xpbW1lciwgZXRjKQ0KPj4+PnJlc3VsdHMgPyBJcyB0 aGVyZSBhbnkgYXQgYWxsID8NCj4+Pj4NCj4+Pj5DaGVlcnMsIEFsYmVydG8NCj4+Pj4NCj4+Pj4N Cj4+Pj4gICAgICAgDQo+Pj4+DQo+Pj4+PnN0ZXZlDQo+Pj4+Pg0KPj4+Pj5BbGJlcnRvIERhdmls YSB3cm90ZToNCj4+Pj4+DQo+Pj4+Pg0KPj4+Pj4gICAgICAgICANCj4+Pj4+DQo+Pj4+Pj5BbGwg dGhlIGJsYXN0YWJsZSBkYXRhYmFzZXMgSSBtZW50aW9uZWQgYXJlIHN0YW5kYXJkIGRhdGFiYXNl cyBmcm9tIE5DQkkNCj4+Pj4+PihmdHA6Ly9mdHAubmNiaS5ubG0ubmloLmdvdi9ibGFzdC9kYi9i bGFzdGRiLnR4dCk6DQo+Pj4+Pj4NCj4+Pj4+Pk5UID0gbnVjbGVvdGlkZXMNCj4+Pj4+Pg0KPj4+ Pj4+fjMwMDAwIGVudHJpZXMgZnJvbSBnZW5iYW5rIChnZW5iYW5rIGZvcm1hdCkgYXJlIGxvYWRl ZCBpbnRvIEdVUyBub3cuDQo+Pj4+Pj4NCj4+Pj4+Pk5vdCBzdXJlIGFib3V0IHlvdXIgIk5SREIi LCBJIGtub3cgTlIgZnJvbSBOQ0JJIHRoYXQgaXMgYSBjb2xsZWN0aW9uIG9mDQo+Pj4+Pj5hbWlu b2FjaWQgZW50cmllcywgY291bGQgaXQgYmUgdGhlIHNhbWUgPw0KPj4+Pj4+DQo+Pj4+Pj5BbGJl cnRvDQo+Pj4+Pj4NCj4+Pj4+Pk9uIEZyaSwgMjAwNS0wMi0xMSBhdCAxMDo0MyAtMDUwMCwgU3Rl dmUgRmlzY2hlciB3cm90ZToNCj4+Pj4+Pg0KPj4+Pj4+DQo+Pj4+Pj4NCj4+Pj4+PiAgICAgICAg ICAgDQo+Pj4+Pj4NCj4+Pj4+Pj4od2hhdCBpcyBOVD8pDQo+Pj4+Pj4+DQo+Pj4+Pj4+d2hpY2gg b2YgdGhlc2UgKGdlbmJhbmssIHlvdXIgZmFzdGEsIE5SREIsIE5ULCBFU1QpIGhhdmUgeW91IGxv YWRlZCBpbnRvDQo+Pj4+Pj4+Z3VzPw0KPj4+Pj4+Pg0KPj4+Pj4+PnN0ZXZlDQo+Pj4+Pj4+DQo+ Pj4+Pj4+QWxiZXJ0byBEYXZpbGEgd3JvdGU6DQo+Pj4+Pj4+DQo+Pj4+Pj4+DQo+Pj4+Pj4+DQo+ Pj4+Pj4+ICAgICAgICAgICAgIA0KPj4+Pj4+Pg0KPj4+Pj4+Pj5RdWVyeToNCj4+Pj4+Pj4+DQo+ Pj4+Pj4+PkVpdGhlciBzZXF1ZW5jZXMgZnJvbSBnZW5iYW5rIChnZW5iYW5rIGZvcm1hdCkgb3Ig c2VxdWVuY2VzIGdlbmVyYXRlZA0KPj4+Pj4+Pj5pbg0KPj4+Pj4+Pj50aGUgbGFiIChmYXN0YSBm b3JtYXQpDQo+Pj4+Pj4+Pg0KPj4+Pj4+Pj5CbGFzdGFibGUgZGF0YWJhc2VzIChhbGwgYXJlIGZv cm1hdHRlZCBkYXRhYmFzZXMgZnJvbSBOQ0JJKToNCj4+Pj4+Pj4+DQo+Pj4+Pj4+Pk5SDQo+Pj4+ Pj4+Pk5UDQo+Pj4+Pj4+PkVTVA0KPj4+Pj4+Pj4NCj4+Pj4+Pj4+QWxiZXJ0bw0KPj4+Pj4+Pj4N Cj4+Pj4+Pj4+T24gRnJpLCAyMDA1LTAyLTExIGF0IDEwOjM0IC0wNTAwLCBTdGV2ZSBGaXNjaGVy IHdyb3RlOg0KPj4+Pj4+Pj4NCj4+Pj4+Pj4+DQo+Pj4+Pj4+Pg0KPj4+Pj4+Pj4NCj4+Pj4+Pj4+ ICAgICAgICAgICAgICAgDQo+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Zm9yIHRoZSBibGFzdCwgd2hhdCBh cmUgdGhlIHF1ZXJ5IHNlcXVlbmNlcyBhbmQgd2hhdCBhcmUgdGhlIGJsYXN0YWJsZQ0KPj4+Pj4+ Pj4+ZGF0YWJhc2VzPw0KPj4+Pj4+Pj4+DQo+Pj4+Pj4+Pj5zdGV2ZQ0KPj4+Pj4+Pj4+DQo+Pj4+ Pj4+Pj5BbGJlcnRvIERhdmlsYSB3cm90ZToNCj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+DQo+Pj4+Pj4+ Pj4NCj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+ICAgICAgICAgICAgICAgICANCj4+Pj4+Pj4+Pg0KPj4+ Pj4+Pj4+PkJhc2ljYWxseSB3ZSB3aWxsIHVzZSBzZXF1ZW5jZXMgKGxvYWRlZCBpbnRvIEdVUyB3 aXRoIHRoZSBHQlBhcnNlcikNCj4+Pj4+Pj4+Pj5mb3INCj4+Pj4+Pj4+Pj5OQ0JJIEJsYXN0IChC bGFzdHgsIEJsYXN0cCBhbmQgVEJsYXN0WCksIHRoZSBzYW1lIHNlcXVlbmNlcyB3aWxsIGJlDQo+ Pj4+Pj4+Pj4+YWxzbw0KPj4+Pj4+Pj4+PnVzZWQgZm9yIEludGVycHJvIGFuYWx5c2VzLiBSZXN1 bHRzIG9mIGJvdGggKEJsYXN0IGFuZCBJbnRlcnBybykgd2lsbA0KPj4+Pj4+Pj4+PmJlDQo+Pj4+ Pj4+Pj4+bG9hZGVkIGludG8gR1VTLiBXZSB3aWxsIHBhcnNlIHNwZWNpZmljIHRoaW5ncyBmcm9t IHRoZSBCbGFzdA0KPj4+Pj4+Pj4+PnJlc3VsdHMsIEkNCj4+Pj4+Pj4+Pj53b3VsZCBzYXk6DQo+ Pj4+Pj4+Pj4+DQo+Pj4+Pj4+Pj4+YEdpYCBgQWNjZXNzaW9uYCBgRGVzY3JpcHRpb25gIGBFX3Zh bHVlYCBgU2NvcmVgIGBMZW5ndGhgDQo+Pj4+Pj4+Pj4+YEZyYW1lX1F1ZXJ5YCBgRnJhbWVfSGl0 YCBgSWRlbnRpY2FsYCBgSHNwX0ZyYWNfSWRlbnRpY2FsYA0KPj4+Pj4+Pj4+PmBDb25zZXJ2ZWRg IGBIc3BfRnJhY19Db25zZXJ2ZWRgDQo+Pj4+Pj4+Pj4+YFF1ZXJ5X1N0YXJ0YA0KPj4+Pj4+Pj4+ PmBRdWVyeV9FbmRgIGBIaXRfU3RhcnRgIGBIaXRfRW5kYCBgSHNwX0FsaWduYCBgZGF0YWJhc2Vf bGV0dGVyc2ANCj4+Pj4+Pj4+Pj5gZGF0YWJhc2VfZW50cmllc2ANCj4+Pj4+Pj4+Pj5XZSBhbHJl YWR5IGhhdmUgYSBCaW9wZXJsIHBhcnNlciBmb3IgdGhhdCAoc3BlY2lmaWMgZm9yIGFub3RoZXIN Cj4+Pj4+Pj4+Pj5zeXN0ZW06DQo+Pj4+Pj4+Pj4+R0FSU0EpIHRoYXQgY291bGQgYmUgYWRhcHRl ZCB0byBHVVMsIHByb2JsZW0gYmVpbmcgd2UgYXJlIG5vdCBzdXJlDQo+Pj4+Pj4+Pj4+d2hhdA0K Pj4+Pj4+Pj4+PnRhYmxlcyBzaG91bGQgYmUgdXNlZCB0byBzdG9yZSB0aG9zZSBkYXRhIGluIEdV Uy4NCj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj5DaGVlcnMsIEFsYmVydG8NCj4+Pj4+Pj4+Pj4NCj4+ Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj5PbiBGcmksIDIwMDUtMDItMTEgYXQgMTA6MDYgLTA1MDAsIFN0 ZXZlIEZpc2NoZXIgd3JvdGU6DQo+Pj4+Pj4+Pj4+DQo+Pj4+Pj4+Pj4+DQo+Pj4+Pj4+Pj4+DQo+ Pj4+Pj4+Pj4+DQo+Pj4+Pj4+Pj4+DQo+Pj4+Pj4+Pj4+ICAgICAgICAgICAgICAgICAgIA0KPj4+ Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj53aGF0IGFyZSB5b3UgcGxhbm5pbmcgb24gYmxhc3Rpbmc/DQo+ Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj5zdGV2ZQ0KPj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+QWxi ZXJ0byBEYXZpbGEgd3JvdGU6DQo+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+ DQo+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+DQo+Pj4+Pj4+Pj4+PiAgICAg ICAgICAgICAgICAgICAgIA0KPj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+PkhpIFN0ZXZlLA0KPj4+ Pj4+Pj4+Pj4+DQo+Pj4+Pj4+Pj4+Pj5PbiBGcmksIDIwMDUtMDItMTEgYXQgMDg6NTYgLTA1MDAs IFN0ZXZlIEZpc2NoZXIgd3JvdGU6DQo+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+ Pj4+Pj4+DQo+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4+DQo+Pj4+Pj4+ Pj4+Pj4gICAgICAgICAgICAgICAgICAgICAgIA0KPj4+Pj4+Pj4+Pj4+DQo+Pj4+Pj4+Pj4+Pj4+ cG9saWFuYS0NCj4+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pj5vb3BzLCB0aGUgdXNhZ2Ugc3Rh dGVtZW50IGZvciBMb2FkQmxhc3RTaW1GYXN0IGlzIG91dCBvZiBkYXRlLg0KPj4+Pj4+Pj4+Pj4+ Pml0IHNob3VsZCBpbnN0cnVjdCB5b3UgdG8gdXNlIHRoZSBibGFzdFNpbWlsYXJpdHkgY29tbWFu ZC4NCj4+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pj5Mb2FkQmxhc3RTaW1GYXN0IG1ha2VzIGEg YmlnIGFzc3VtcHRpb24sIHRoYXQgdGhlIHN1YmplY3QgYW5kDQo+Pj4+Pj4+Pj4+Pj4+cXVlcnkg c2VxdWVuY2VzIGFyZSBpbiBHVVMsIGFuZCB0aGVpciBkZWYuIGxpbmVzIGhhdmUgR1VTIHByaW1h cnkNCj4+Pj4+Pj4+Pj4+Pj5rZXlzLg0KPj4+Pj4+Pj4+Pj4+PkFyZSB5b3VyIHNlcXVlbmNlcyBh bHJlYWR5IGxvYWRlZCBpbnRvIEdVUz8NCj4+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pj4NCj4+ Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pj4N Cj4+Pj4+Pj4+Pj4+Pj4gICAgICAgICAgICAgICAgICAgICAgICAgDQo+Pj4+Pj4+Pj4+Pj4+DQo+ Pj4+Pj4+Pj4+Pj5UaGV5IGFyZSBub3QsIHRoZXJlIHdvdWxkIGJlIGFueSBob3d0by90aXBzIGZv ciB0aGF0IHBsdWdpbiA/IFdlDQo+Pj4+Pj4+Pj4+Pj53aWxsDQo+Pj4+Pj4+Pj4+Pj5jZXJ0YWlu bHkgbmVlZCBhIHBsdWdpbiB0byBsb2FkICJJbnRlcnBybyIgYW5kICJPUkYgZmluZGluZyINCj4+ Pj4+Pj4+Pj4+PnJlc3VsdHMNCj4+Pj4+Pj4+Pj4+PmludG8gR1VTLi4uIElmIHRoZXkgYXJlIG5v dCBhdmFpbGFibGUsIHRoZW4gbWF5YmUgd2Ugd2lsbCBoYXZlIHRvDQo+Pj4+Pj4+Pj4+Pj53cml0 ZQ0KPj4+Pj4+Pj4+Pj4+dGhlbSAuLi4NCj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4+Q2hlZXJz LCBBbGJlcnRvDQo+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4+DQo+Pj4+ Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4+DQo+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+ Pj4+Pj4+PiAgICAgICAgICAgICAgICAgICAgICAgDQo+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+ Pj5zdGV2ZQ0KPj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4+Pg0KPj4+ Pj4+Pj4+Pj4+PlBvbGlhbmEgTWF0ZXVzIHdyb3RlOg0KPj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+ Pj4+Pg0KPj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+ Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4+Pg0KPj4+Pj4+Pj4+Pj4+PiAgICAgICAgICAgICAgICAgICAg ICAgICANCj4+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pj4+SGVsbG8gYWxsLA0KPj4+Pj4+Pj4+ Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pj4+V2hlcmUgY2FuIGZpbmQgdGhlIHNjcmlwdCBwYXJzZUJsYXN0 RmlsZXNGb3JTaW1pbGFyaXR5LnBsPz8NCj4+Pj4+Pj4+Pj4+Pj4+SSdtIHRyeWluZyB0byBydW4g TG9hZEJsYXN0U2ltRmFzdC4uLg0KPj4+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pj4+UG9saWFu YQ0KPj4+Pj4+Pj4+Pj4+Pj4NCj4+Pj4+Pj4+Pj4+Pj4+ICAgICAgICAgICAgICAgICAgICAgICAg ICAgDQo+Pj4+Pj4+Pj4+Pj4+Pg0KDQoNCg== |
From: davila <da...@io...> - 2005-02-15 01:32:50
|
SGkgU3RldmUsDQogDQpJIHdvbmRlciB0byBrbm93IGlmIHlvdSB0aGluayBpdCB3b3VsZCBiZSBp bnRlcmVzdGluZyB0byBleHBhbmQgdGhlICJTaW1pbGFyaXR5IGFuZCBTaW1pbGFyaXR5U3BhbiIg dGFibGVzID8gU29tZSBibGFzdCByZXN1bHRzLCANCmVnOiBxdWVyeV9zdHJpbmcsIGhpdF9zdHJp bmcsIGhvbW9sb2d5X3N0cmluZyBhbmQgYWxpZ25tZW50IGRvbsK0dCBhcHBlYXIgdG8gYmUgcmVw cmVzZW50ZWQgaW4gdGhvc2UgdGFibGVzIChvZiBjb3Vyc2UsIEkgbWlnaHQgYmUgd3JvbmcpLi4u DQogDQpJZGVhbGx5LCB0aG9zZSB0YWJsZXMgc2hvdWxkIGJlIGFibGUgdG8gc3RvcmUgbW9zdCBk YXRhIHBhcnNlZCBmcm9tIEJsYXN0IHJlc3VsdHMsIGFuIGV4YW1wbGUgb2YgbW9zdCBpbXBvcnRh bnQgZGF0YSBpcyBsaXN0ZWQgaW4gdGhlIEJpbzo6U2VhcmNoSU8gc3lzdGVtIG9mIEJpb3Blcmw6 IGh0dHA6Ly9iaW9wZXJsLm9yZy9IT1dUT3MvU2VhcmNoSU8vdXNlLmh0bWwNCiANCkNoZWVycywg QWxiZXJ0bw0KIA0KDQoJLS0tLS1NZW5zYWdlbSBvcmlnaW5hbC0tLS0tIA0KCURlOiBTdGV2ZSBG aXNjaGVyIFttYWlsdG86c2Zpc2NoZXJAcGNiaS51cGVubi5lZHVdIA0KCUVudmlhZGE6IHNlZyAx NC8yLzIwMDUgMTc6NTMgDQoJUGFyYTogUG9saWFuYSBNYXRldXMgDQoJQ2M6IGRhdmlsYTsgZ3Vz ZGV2LWd1c2RldkBsaXN0cy5zb3VyY2Vmb3JnZS5uZXQgDQoJQXNzdW50bzogUmU6IFtHdXNkZXYt Z3VzZGV2XSBwYXJzZUJsYXN0RmlsZXNGb3JTaW1pbGFyaXR5LnBsDQoJDQoJDQoJUG9saWFuYS0N CgkNCgl0aGUgb25seSBibGFzdCBwbHVnaW5zIHdlIGhhdmUgYXJlIExvYWRCbGFzdFNpbUZhc3Qg YW5kDQoJTG9hZEJsYXN0U2ltaWxhcml0eVBLLg0KCQ0KCXRoZSBvbmx5IHRhYmxlcyBhcmUgU2lt aWxhcml0eSBhbmQgU2ltaWxhcml0eVNwYW4NCgkNCglzdGV2ZQ0KCQ0KCVBvbGlhbmEgTWF0ZXVz IHdyb3RlOg0KCQ0KCT5IaSBTdGV2ZQ0KCT4NCgk+SSBuZWVkIHRvIGluc2VydCBnaXZlbiBpbiB0 aGUgR1VTIChyZXN1bHRlZCBibGFzdCkgYXM6DQoJPg0KCT4tLS0tLS0tLS0tLS0tLS0tLS0tLS0t LS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tDQoJPmV4dHJhY3RlZCBkYXRhIG9mIG91cnMg c2NyaXB0DQoJPi0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0t LS0tLS0NCgk+cXVlcnlfbmFtZQ0KCT5uYW1lDQoJPmFjY2Vzc2lvbg0KCT5kZXNjcmlwdGlvbg0K CT5zaWduaWZpY2FuY2UNCgk+cmF3X3Njb3JlDQoJPmxlbmd0aA0KCT5udW1faWRlbnRpY2FsDQoJ PmZyYWNfaWRlbnRpY2FsDQoJPm51bV9jb25zZXJ2ZWQNCgk+ZnJhY19jb25zZXJ2ZWQNCgk+c3Rh cnQoJ3F1ZXJ5JykNCgk+ZW5kKCdxdWVyeScpDQoJPnN0YXJ0KCdoaXQnKQ0KCT5lbmQoJ2hpdCcp DQoJPi0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0N Cgk+DQoJPkFuYWx5emluZyB0aGUgTG9hZEJsYXN0U2ltRmFzdCBQbHVnaW4gSSB2ZXJpZmllZCB0 aGF0IGl0IGluc2VydHMgaW4NCgk+dGFibGVzIERvVHMuU2ltaWxhcml0eSBhbmQgRG9Ucy5TeW1p bGFyaXR5U3BhbiwgYm90aCBvbmx5IGFjY2VwdCBnaXZlbg0KCT5udW1lcmljcy4NCgk+RXhpc3Rz IGludG8gR1VTIG90aGVyIHRhYmxlcyB0aGF0IHN0b3JlIHJlc3VsdGVkIG9mIEJsYXN0Pw0KCT4N Cgk+UG9saWFuYQ0KCT4NCgk+DQoJPg0KCT4NCgk+DQoJPg0KCT5PbiBGcmksIDExIEZlYiAyMDA1 IDEzOjUwOjMyIC0wNTAwLCBTdGV2ZSBGaXNjaGVyDQoJPjxzZmlzY2hlckBwY2JpLnVwZW5uLmVk dT4gd3JvdGU6DQoJPiANCgk+DQoJPj5zZWUgYmVsb3cNCgk+Pg0KCT4+QWxiZXJ0byBEYXZpbGEg d3JvdGU6DQoJPj4NCgk+PiAgIA0KCT4+DQoJPj4+V2UgYXJlIGRvaW5nIHRoaXMgZm9yIEdhcnNh IChhbm90aGVyIHN5c3RlbSkgLi4gYmFzaWNhbGx5IHdlIGhhdmUgYQ0KCT4+PmJpb3BlcmwgcGFy c2VyIChCaW86OlNlYXJjaDo6SU8pIHRoYXQgcmVhZHMgdGhlIEJsYXN0IHJlc3VsdHMgZmlsZSBh bmQNCgk+Pj5leHRyYWN0IGFsbCB0aGUgbmVlZGVkIGluZm8gKHRvIHRoZSAiQmxhc3RfSGl0IiB0 YWJsZSkuLi4gYW5kIGFsc28gbG9hZA0KCT4+PmludG8gYSBnaXZlbiB0YWJsZSAoZWc6IEV4dGVy bmFsX0RCKSBhbGwgdGhlIHNlcXVlbmNlcyAoaW4gZmFzdGEgZm9ybWF0KQ0KCT4+PnByZXNlbnRp bmcgc2ltaWxhcml0eSB3aXRoIHRoZSBxdWVyaWVzLi4uIGF0IHRoZSBlbmQgd2UgaGF2ZSAiQmxh c3RfSGl0Ig0KCT4+PmFuZCAiRXh0ZXJuYWxfREIiIHBvcHVsYXRlZCB3aXRoIHRoZSBzYW1lIHNj cmlwdC4NCgk+Pj4NCgk+Pj4NCgk+Pj4NCgk+Pj4gICAgIA0KCT4+Pg0KCT4+d293LCBncmVhdC4g IGNvdWxkIHlvdSBtYWtlIGEgZ3VzIHBsdWdpbiBmcm9tIHRoYXQ/DQoJPj4NCgk+PiAgIA0KCT4+ DQoJPj4+UmVnYXJkaW5nIEludGVycHJvIGFuZCBHbGltbWVyLCB0aGUgbWFpbiBwcm9ibGVtIGlz IHRvIGtub3cgaW4gd2hpY2gNCgk+Pj50YWJsZXMgd2Ugc2hvdWxkIGxvYWQgdGhlIHBhcnNlZCBy ZXN1bHRzID8NCgk+Pj4NCgk+Pj4NCgk+Pj4NCgk+Pj4gICAgIA0KCT4+Pg0KCT4+ZGVzY3JpYmUg dGhlIGluZm8geW91IHdhbnQgdG8gc3RvcmUuDQoJPj4NCgk+PnN0ZXZlDQoJPj4NCgk+PiAgIA0K CT4+DQoJPj4+QWxiZXJ0bw0KCT4+Pg0KCT4+Pk9uIEZyaSwgMjAwNS0wMi0xMSBhdCAxMzoyMSAt MDUwMCwgWS4gVGhvbWFzIEdhbiB3cm90ZToNCgk+Pj4NCgk+Pj4NCgk+Pj4gICAgIA0KCT4+Pg0K CT4+Pj5JIHdhcyBnb2luZyB0byBnaXZlIHRoZSBzYW1lIGFuc3dlciBzdGV2ZSBnYXZlIGZvciBp bnRlcnBybyBhbmQgZ2VuZQ0KCT4+Pj5maW5kaW5nIHJlc3VsdHMuDQoJPj4+Pg0KCT4+Pj5Gb3Ig bG9hZGluZyBzZXF1ZW5jZXMgaW50byBHVVMsIHRoZSBkaWxsZW1hIHdpdGggb3B0aW9uIDIgaXM6 IGhvdyBkbyB5b3UNCgk+Pj4+a25vdyB3aGljaCBzZXF1ZW5jZSB0byBsb2FkIHdoZW4geW91IGxv YWQgKHdoaWNoIGlzIGJlZm9yZSB5b3UgYWN0dWFsbHkNCgk+Pj4+aGF2ZSB0aGUgc2ltaWxhcml0 eSByZXN1bHQpPyBPbmUgc29sdXRpb24gd291bGQgYmUgdG8gaW5pdGlhbGx5IGxvYWQNCgk+Pj4+ Y29tcGxldGUgZGF0YXNldChzKSBidXQgZGVsZXRlIHRob3NlIHdpdGhvdXQgc2ltaWxhcml0eSBh ZnRlciBsb2FkaW5nDQoJPj4+PnNpbWlsYXJpdHkgcmVzdWx0cy4NCgk+Pj4+DQoJPj4+Pi1UaG9t YXMNCgk+Pj4+DQoJPj4+Pk9uIEZyaSwgMTEgRmViIDIwMDUsIFN0ZXZlIEZpc2NoZXIgd3JvdGU6 DQoJPj4+Pg0KCT4+Pj4NCgk+Pj4+DQoJPj4+PiAgICAgICANCgk+Pj4+DQoJPj4+Pj5hbGJlcnRv LQ0KCT4+Pj4+DQoJPj4+Pj53ZSd2ZSBuZXZlciBsb2FkZWQgaW50ZXJwcm8sIHNvIHRoZXJlIGlz bid0IGEgcGx1Z2luLg0KCT4+Pj4+aSBiZWxpZXZlIHBsYXNtb2RiIGhhcyBsb2FkZWQgZ2xpbW1l ciByZXN1bHRzLCB0aG91Z2ggaSdtIG5vdCBzdXJlLiAgIGkgaGF2ZQ0KCT4+Pj4+YXNrZWQgYSBw bGFzbW9kYiBkZXZlbG9wZXIgdG8gYW5zd2VyIHRoYXQgcXVlc3Rpb24uDQoJPj4+Pj4NCgk+Pj4+ PnN0ZXZlDQoJPj4+Pj4NCgk+Pj4+PkFsYmVydG8gRGF2aWxhIHdyb3RlOg0KCT4+Pj4+DQoJPj4+ Pj4NCgk+Pj4+Pg0KCT4+Pj4+ICAgICAgICAgDQoJPj4+Pj4NCgk+Pj4+Pj5IZXkgU3RldmUsIFRo b21hcywNCgk+Pj4+Pj4NCgk+Pj4+Pj5UaGFua3MgYSBsb3QgZm9yIHRoZSB0aXBzLCByZWFsbHkg aGVscGZ1bC4uIG5vdywgZmV3IG1vcmUgcXVlc3Rpb25zOg0KCT4+Pj4+Pg0KCT4+Pj4+Pg0KCT4+ Pj4+Pg0KCT4+Pj4+Pg0KCT4+Pj4+PiAgICAgICAgICAgDQoJPj4+Pj4+DQoJPj4+Pj4+Pm9rLiAg TlIgPSBOUkRCDQoJPj4+Pj4+Pg0KCT4+Pj4+Pj50aGUgd2F5IHdlIGhhdmUgdXNlZCBndXMgd2l0 aCBzaW1pbGFyaXRpZXMgaXMgdGhhdCBib3RoIHRoZSBxdWVyeSBhbmQNCgk+Pj4+Pj4+c3ViamVj dCBhcmUgbG9hZGVkIGludG8gZ3VzLiAgQXMgdGhvbWFzIGV4cGxhaW5lZCwgdGhlIHNpbWlsYXJp dHkgdGFibGUNCgk+Pj4+Pj4+Y2FwdHVyZXMgc2ltaWxhcml0eSBiZXR3ZWVuIHNlcXVlbmNlcyB0 aGF0IGFyZSBpbiBndXMuDQoJPj4+Pj4+Pm91ciBhcHByb2FjaCBoYXMgYWx3YXlzIGJlZW4gdG8g anVzdCBsb2FkICh3YXJlaG91c2UpIHRoZSBlbnRpcmUgc3ViamVjdA0KCT4+Pj4+Pj5kYXRhYmFz ZSAoTlIsIEVTVCkgdGhhdCB3ZSBhcmUgYmxhc3RpbmcgYWdhaW5zdC4NCgk+Pj4+Pj4+DQoJPj4+ Pj4+PnRoZSBjdXJyZW50IHBsdWdpbnMgYW5kIGJsYXN0U2ltaWxhcml0eSBhcmUgc2V0IHVwIGZv ciB0aGlzLg0KCT4+Pj4+Pj4NCgk+Pj4+Pj4+b2J2aW91c2x5LCB0aGlzIHRha2VzIGEgbG90IG9m IGRpc2sgc3BhY2UuICB0d28gbWFqb3IgZWZmaWNpZW5jaWVzIHRoYXQgd2UNCgk+Pj4+Pj4+ZG9u J3QgY3VycmVudGx5IGhhdmUgcGx1Z2lucyBmb3Igd291bGQgYmU6DQoJPj4+Pj4+PjEuIHRvIG9u bHkgc3RvcmUgaW4gZ3VzIGEgKnJlZmVyZW5jZSogdG8gdGhlIGV4dGVybmFsIHNlcXVlbmNlIChp ZSwgZG9uJ3QNCgk+Pj4+Pj4+c3RvcmUgdGhlIGFjdGdzKS4NCgk+Pj4+Pj4+Mi4gb25seSBzdG9y ZSBpbiBndXMgdGhlIHNlcXVlbmNlcyB0aGF0IGFjdHVhbGx5IGhhdmUgc2ltaWxhcml0aWVzDQoJ Pj4+Pj4+Pg0KCT4+Pj4+Pj4NCgk+Pj4+Pj4+DQoJPj4+Pj4+PiAgICAgICAgICAgICANCgk+Pj4+ Pj4+DQoJPj4+Pj4+T3B0aW9uIDIgc291bmQgYmV0dGVyIGZvciB1cywgc2luY2Ugd2Ugd2lsbCBi ZSBibGFzdGluZyBhZ2FpbnN0IHNldmVyYWwNCgk+Pj4+Pj5kYXRhYmFzZXMgKD4gMTBHQiBkYXRh YmFzZXMpDQoJPj4+Pj4+DQoJPj4+Pj4+V2hhdCBhYm91dCB0aGUgcGx1Z2lucyB0byBsb2FkIElu dGVycHJvIGFuZCAiZ2VuZSBmaW5kZXIiIChnbGltbWVyLCBldGMpDQoJPj4+Pj4+cmVzdWx0cyA/ IElzIHRoZXJlIGFueSBhdCBhbGwgPw0KCT4+Pj4+Pg0KCT4+Pj4+PkNoZWVycywgQWxiZXJ0bw0K CT4+Pj4+Pg0KCT4+Pj4+Pg0KCT4+Pj4+Pg0KCT4+Pj4+Pg0KCT4+Pj4+PiAgICAgICAgICAgDQoJ Pj4+Pj4+DQoJPj4+Pj4+PnN0ZXZlDQoJPj4+Pj4+Pg0KCT4+Pj4+Pj5BbGJlcnRvIERhdmlsYSB3 cm90ZToNCgk+Pj4+Pj4+DQoJPj4+Pj4+Pg0KCT4+Pj4+Pj4NCgk+Pj4+Pj4+DQoJPj4+Pj4+PiAg ICAgICAgICAgICANCgk+Pj4+Pj4+DQoJPj4+Pj4+Pj5BbGwgdGhlIGJsYXN0YWJsZSBkYXRhYmFz ZXMgSSBtZW50aW9uZWQgYXJlIHN0YW5kYXJkIGRhdGFiYXNlcyBmcm9tIE5DQkkNCgk+Pj4+Pj4+ PihmdHA6Ly9mdHAubmNiaS5ubG0ubmloLmdvdi9ibGFzdC9kYi9ibGFzdGRiLnR4dCk6DQoJPj4+ Pj4+Pj4NCgk+Pj4+Pj4+Pk5UID0gbnVjbGVvdGlkZXMNCgk+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+fjMw MDAwIGVudHJpZXMgZnJvbSBnZW5iYW5rIChnZW5iYW5rIGZvcm1hdCkgYXJlIGxvYWRlZCBpbnRv IEdVUyBub3cuDQoJPj4+Pj4+Pj4NCgk+Pj4+Pj4+Pk5vdCBzdXJlIGFib3V0IHlvdXIgIk5SREIi LCBJIGtub3cgTlIgZnJvbSBOQ0JJIHRoYXQgaXMgYSBjb2xsZWN0aW9uIG9mDQoJPj4+Pj4+Pj5h bWlub2FjaWQgZW50cmllcywgY291bGQgaXQgYmUgdGhlIHNhbWUgPw0KCT4+Pj4+Pj4+DQoJPj4+ Pj4+Pj5BbGJlcnRvDQoJPj4+Pj4+Pj4NCgk+Pj4+Pj4+Pk9uIEZyaSwgMjAwNS0wMi0xMSBhdCAx MDo0MyAtMDUwMCwgU3RldmUgRmlzY2hlciB3cm90ZToNCgk+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+DQoJ Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+DQoJPj4+Pj4+Pj4gICAgICAgICAgICAgICAN Cgk+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pih3aGF0IGlzIE5UPykNCgk+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+ Pj53aGljaCBvZiB0aGVzZSAoZ2VuYmFuaywgeW91ciBmYXN0YSwgTlJEQiwgTlQsIEVTVCkgaGF2 ZSB5b3UgbG9hZGVkIGludG8NCgk+Pj4+Pj4+Pj5ndXM/DQoJPj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+ c3RldmUNCgk+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj5BbGJlcnRvIERhdmlsYSB3cm90ZToNCgk+Pj4+ Pj4+Pj4NCgk+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4NCgk+ Pj4+Pj4+Pj4gICAgICAgICAgICAgICAgIA0KCT4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj5RdWVyeToN Cgk+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+PkVpdGhlciBzZXF1ZW5jZXMgZnJvbSBnZW5iYW5rIChn ZW5iYW5rIGZvcm1hdCkgb3Igc2VxdWVuY2VzIGdlbmVyYXRlZA0KCT4+Pj4+Pj4+Pj5pbg0KCT4+ Pj4+Pj4+Pj50aGUgbGFiIChmYXN0YSBmb3JtYXQpDQoJPj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj5C bGFzdGFibGUgZGF0YWJhc2VzIChhbGwgYXJlIGZvcm1hdHRlZCBkYXRhYmFzZXMgZnJvbSBOQ0JJ KToNCgk+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pk5SDQoJPj4+Pj4+Pj4+Pk5UDQoJPj4+Pj4+Pj4+ PkVTVA0KCT4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+QWxiZXJ0bw0KCT4+Pj4+Pj4+Pj4NCgk+Pj4+ Pj4+Pj4+T24gRnJpLCAyMDA1LTAyLTExIGF0IDEwOjM0IC0wNTAwLCBTdGV2ZSBGaXNjaGVyIHdy b3RlOg0KCT4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4N Cgk+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4gICAgICAgICAgICAgICAgICAg DQoJPj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Zm9yIHRoZSBibGFzdCwgd2hhdCBhcmUgdGhlIHF1 ZXJ5IHNlcXVlbmNlcyBhbmQgd2hhdCBhcmUgdGhlIGJsYXN0YWJsZQ0KCT4+Pj4+Pj4+Pj4+ZGF0 YWJhc2VzPw0KCT4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj5zdGV2ZQ0KCT4+Pj4+Pj4+Pj4+DQoJ Pj4+Pj4+Pj4+Pj5BbGJlcnRvIERhdmlsYSB3cm90ZToNCgk+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+ Pj4+DQoJPj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+ Pj4NCgk+Pj4+Pj4+Pj4+PiAgICAgICAgICAgICAgICAgICAgIA0KCT4+Pj4+Pj4+Pj4+DQoJPj4+ Pj4+Pj4+Pj4+QmFzaWNhbGx5IHdlIHdpbGwgdXNlIHNlcXVlbmNlcyAobG9hZGVkIGludG8gR1VT IHdpdGggdGhlIEdCUGFyc2VyKQ0KCT4+Pj4+Pj4+Pj4+PmZvcg0KCT4+Pj4+Pj4+Pj4+Pk5DQkkg Qmxhc3QgKEJsYXN0eCwgQmxhc3RwIGFuZCBUQmxhc3RYKSwgdGhlIHNhbWUgc2VxdWVuY2VzIHdp bGwgYmUNCgk+Pj4+Pj4+Pj4+Pj5hbHNvDQoJPj4+Pj4+Pj4+Pj4+dXNlZCBmb3IgSW50ZXJwcm8g YW5hbHlzZXMuIFJlc3VsdHMgb2YgYm90aCAoQmxhc3QgYW5kIEludGVycHJvKSB3aWxsDQoJPj4+ Pj4+Pj4+Pj4+YmUNCgk+Pj4+Pj4+Pj4+Pj5sb2FkZWQgaW50byBHVVMuIFdlIHdpbGwgcGFyc2Ug c3BlY2lmaWMgdGhpbmdzIGZyb20gdGhlIEJsYXN0DQoJPj4+Pj4+Pj4+Pj4+cmVzdWx0cywgSQ0K CT4+Pj4+Pj4+Pj4+PndvdWxkIHNheToNCgk+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj5gR2lg IGBBY2Nlc3Npb25gIGBEZXNjcmlwdGlvbmAgYEVfdmFsdWVgIGBTY29yZWAgYExlbmd0aGANCgk+ Pj4+Pj4+Pj4+Pj5gRnJhbWVfUXVlcnlgIGBGcmFtZV9IaXRgIGBJZGVudGljYWxgIGBIc3BfRnJh Y19JZGVudGljYWxgDQoJPj4+Pj4+Pj4+Pj4+YENvbnNlcnZlZGAgYEhzcF9GcmFjX0NvbnNlcnZl ZGANCgk+Pj4+Pj4+Pj4+Pj5gUXVlcnlfU3RhcnRgDQoJPj4+Pj4+Pj4+Pj4+YFF1ZXJ5X0VuZGAg YEhpdF9TdGFydGAgYEhpdF9FbmRgIGBIc3BfQWxpZ25gIGBkYXRhYmFzZV9sZXR0ZXJzYA0KCT4+ Pj4+Pj4+Pj4+PmBkYXRhYmFzZV9lbnRyaWVzYA0KCT4+Pj4+Pj4+Pj4+PldlIGFscmVhZHkgaGF2 ZSBhIEJpb3BlcmwgcGFyc2VyIGZvciB0aGF0IChzcGVjaWZpYyBmb3IgYW5vdGhlcg0KCT4+Pj4+ Pj4+Pj4+PnN5c3RlbToNCgk+Pj4+Pj4+Pj4+Pj5HQVJTQSkgdGhhdCBjb3VsZCBiZSBhZGFwdGVk IHRvIEdVUywgcHJvYmxlbSBiZWluZyB3ZSBhcmUgbm90IHN1cmUNCgk+Pj4+Pj4+Pj4+Pj53aGF0 DQoJPj4+Pj4+Pj4+Pj4+dGFibGVzIHNob3VsZCBiZSB1c2VkIHRvIHN0b3JlIHRob3NlIGRhdGEg aW4gR1VTLg0KCT4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+PkNoZWVycywgQWxiZXJ0bw0KCT4+ Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pk9uIEZyaSwgMjAwNS0wMi0x MSBhdCAxMDowNiAtMDUwMCwgU3RldmUgRmlzY2hlciB3cm90ZToNCgk+Pj4+Pj4+Pj4+Pj4NCgk+ Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4N Cgk+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4gICAgICAgICAgICAg ICAgICAgICAgIA0KCT4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj53aGF0IGFyZSB5b3UgcGxh bm5pbmcgb24gYmxhc3Rpbmc/DQoJPj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj5zdGV2ZQ0K CT4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+QWxiZXJ0byBEYXZpbGEgd3JvdGU6DQoJPj4+ Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+ Pg0KCT4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+ Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+ICAgICAgICAgICAgICAgICAgICAgICAgIA0KCT4+Pj4+ Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+PkhpIFN0ZXZlLA0KCT4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+ Pj4+Pj4+Pj4+Pj5PbiBGcmksIDIwMDUtMDItMTEgYXQgMDg6NTYgLTA1MDAsIFN0ZXZlIEZpc2No ZXIgd3JvdGU6DQoJPj4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+ Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+DQoJ Pj4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+ICAgICAgICAg ICAgICAgICAgICAgICAgICAgDQoJPj4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+Pj5wb2xp YW5hLQ0KCT4+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+Pm9vcHMsIHRoZSB1c2FnZSBz dGF0ZW1lbnQgZm9yIExvYWRCbGFzdFNpbUZhc3QgaXMgb3V0IG9mIGRhdGUuDQoJPj4+Pj4+Pj4+ Pj4+Pj4+aXQgc2hvdWxkIGluc3RydWN0IHlvdSB0byB1c2UgdGhlIGJsYXN0U2ltaWxhcml0eSBj b21tYW5kLg0KCT4+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+PkxvYWRCbGFzdFNpbUZh c3QgbWFrZXMgYSBiaWcgYXNzdW1wdGlvbiwgdGhhdCB0aGUgc3ViamVjdCBhbmQNCgk+Pj4+Pj4+ Pj4+Pj4+Pj5xdWVyeSBzZXF1ZW5jZXMgYXJlIGluIEdVUywgYW5kIHRoZWlyIGRlZi4gbGluZXMg aGF2ZSBHVVMgcHJpbWFyeQ0KCT4+Pj4+Pj4+Pj4+Pj4+PmtleXMuDQoJPj4+Pj4+Pj4+Pj4+Pj4+ QXJlIHlvdXIgc2VxdWVuY2VzIGFscmVhZHkgbG9hZGVkIGludG8gR1VTPw0KCT4+Pj4+Pj4+Pj4+ Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+ Pg0KCT4+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+Pg0K CT4+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+PiAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgDQoJPj4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pj5UaGV5IGFyZSBub3QsIHRo ZXJlIHdvdWxkIGJlIGFueSBob3d0by90aXBzIGZvciB0aGF0IHBsdWdpbiA/IFdlDQoJPj4+Pj4+ Pj4+Pj4+Pj53aWxsDQoJPj4+Pj4+Pj4+Pj4+Pj5jZXJ0YWlubHkgbmVlZCBhIHBsdWdpbiB0byBs b2FkICJJbnRlcnBybyIgYW5kICJPUkYgZmluZGluZyINCgk+Pj4+Pj4+Pj4+Pj4+PnJlc3VsdHMN Cgk+Pj4+Pj4+Pj4+Pj4+PmludG8gR1VTLi4uIElmIHRoZXkgYXJlIG5vdCBhdmFpbGFibGUsIHRo ZW4gbWF5YmUgd2Ugd2lsbCBoYXZlIHRvDQoJPj4+Pj4+Pj4+Pj4+Pj53cml0ZQ0KCT4+Pj4+Pj4+ Pj4+Pj4+dGhlbSAuLi4NCgk+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+Q2hlZXJzLCBB bGJlcnRvDQoJPj4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+ DQoJPj4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+ Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+ Pj4+Pj4gICAgICAgICAgICAgICAgICAgICAgICAgICANCgk+Pj4+Pj4+Pj4+Pj4+Pg0KCT4+Pj4+ Pj4+Pj4+Pj4+PnN0ZXZlDQoJPj4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+ Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pj4+UG9saWFuYSBNYXRldXMgd3JvdGU6DQoJPj4+ Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+ Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+ Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+Pj4+DQoJPj4+Pj4+Pj4+Pj4+ Pj4+ICAgICAgICAgICAgICAgICAgICAgICAgICAgICANCgk+Pj4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+ Pj4+Pj4+Pj4+Pj4+SGVsbG8gYWxsLA0KCT4+Pj4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+ Pj4+V2hlcmUgY2FuIGZpbmQgdGhlIHNjcmlwdCBwYXJzZUJsYXN0RmlsZXNGb3JTaW1pbGFyaXR5 LnBsPz8NCgk+Pj4+Pj4+Pj4+Pj4+Pj4+SSdtIHRyeWluZyB0byBydW4gTG9hZEJsYXN0U2ltRmFz dC4uLg0KCT4+Pj4+Pj4+Pj4+Pj4+Pj4NCgk+Pj4+Pj4+Pj4+Pj4+Pj4+UG9saWFuYQ0KCT4+Pj4+ Pj4+Pj4+Pj4+Pj4NCgkNCg0K |
From: Aaron J. M. <am...@pc...> - 2005-02-15 02:33:30
|
FYI, if you wish to store details of a pairwise sequence alignment, =20 it's arguably preferable to not store three separate alignment strings =20= (query-with-gaps, target-with-gaps and similarity string), since these =20= are not quite independent attributes (you can not truly change one =20 without changing the others, in some sense), but rather to save only =20 the location of gaps (requiring one to reconstruct the various =20 alignment strings if/when necessary). EnsEMBL chose to use the "CIGAR" =20= string representation of an alignment; this format has made its way =20 into GFF3 as well: http://www.ensembl.org/Docs/wiki/html/EnsemblDocs/CigarFormat.html The FASTA programs have a similar (but more expressive) alignment =20 "encoding" that includes possibilities for forward and backwards =20 frameshifts (i.e. for protein-to-DNA alignments). Either of these =20 encodings are also (somewhat) more "computable" than the raw string =20 alignment representation. -Aaron On Feb 14, 2005, at 8:27 PM, davila wrote: > Hi Steve, > > I wonder to know if you think it would be interesting to expand the =20= > "Similarity and SimilaritySpan" tables ? Some blast results, > eg: query_string, hit_string, homology_string and alignment don=C2=B4t = =20 > appear to be represented in those tables (of course, I might be =20 > wrong)... > > Ideally, those tables should be able to store most data parsed from =20= > Blast results, an example of most important data is listed in the =20 > Bio::SearchIO system of Bioperl: =20 > http://bioperl.org/HOWTOs/SearchIO/use.html > > Cheers, Alberto > > > -----Mensagem original----- > De: Steve Fischer [mailto:sfi...@pc...] > Enviada: seg 14/2/2005 17:53 > Para: Poliana Mateus > Cc: davila; gus...@li... > Assunto: Re: [Gusdev-gusdev] parseBlastFilesForSimilarity.pl > =09 > =09 > Poliana- > =09 > the only blast plugins we have are LoadBlastSimFast and > LoadBlastSimilarityPK. > =09 > the only tables are Similarity and SimilaritySpan > =09 > steve > =09 > Poliana Mateus wrote: > =09 > >Hi Steve > > > >I need to insert given in the GUS (resulted blast) as: > > > >---------------------------------------------------- > >extracted data of ours script > >---------------------------------------------------- > >query_name > >name > >accession > >description > >significance > >raw_score > >length > >num_identical > >frac_identical > >num_conserved > >frac_conserved > >start('query') > >end('query') > >start('hit') > >end('hit') > >---------------------------------------------------- > > > >Analyzing the LoadBlastSimFast Plugin I verified that it = inserts in > >tables DoTs.Similarity and DoTs.SymilaritySpan, both only = accept =20 > given > >numerics. > >Exists into GUS other tables that store resulted of Blast? > > > >Poliana > > > > > > > > > > > > > >On Fri, 11 Feb 2005 13:50:32 -0500, Steve Fischer > ><sfi...@pc...> wrote: > > > > > >>see below > >> > >>Alberto Davila wrote: > >> > >> > >> > >>>We are doing this for Garsa (another system) .. basically we = have a > >>>bioperl parser (Bio::Search::IO) that reads the Blast results = file =20 > and > >>>extract all the needed info (to the "Blast_Hit" table)... and = also =20 > load > >>>into a given table (eg: External_DB) all the sequences (in = fasta =20 > format) > >>>presenting similarity with the queries... at the end we have =20= > "Blast_Hit" > >>>and "External_DB" populated with the same script. > >>> > >>> > >>> > >>> > >>> > >>wow, great. could you make a gus plugin from that? > >> > >> > >> > >>>Regarding Interpro and Glimmer, the main problem is to know = in =20 > which > >>>tables we should load the parsed results ? > >>> > >>> > >>> > >>> > >>> > >>describe the info you want to store. > >> > >>steve > >> > >> > >> > >>>Alberto > >>> > >>>On Fri, 2005-02-11 at 13:21 -0500, Y. Thomas Gan wrote: > >>> > >>> > >>> > >>> > >>>>I was going to give the same answer steve gave for interpro = and =20 > gene > >>>>finding results. > >>>> > >>>>For loading sequences into GUS, the dillema with option 2 = is: how =20 > do you > >>>>know which sequence to load when you load (which is before = you =20 > actually > >>>>have the similarity result)? One solution would be to = initially =20 > load > >>>>complete dataset(s) but delete those without similarity = after =20 > loading > >>>>similarity results. > >>>> > >>>>-Thomas > >>>> > >>>>On Fri, 11 Feb 2005, Steve Fischer wrote: > >>>> > >>>> > >>>> > >>>> > >>>> > >>>>>alberto- > >>>>> > >>>>>we've never loaded interpro, so there isn't a plugin. > >>>>>i believe plasmodb has loaded glimmer results, though i'm = not =20 > sure. i have > >>>>>asked a plasmodb developer to answer that question. > >>>>> > >>>>>steve > >>>>> > >>>>>Alberto Davila wrote: > >>>>> > >>>>> > >>>>> > >>>>> > >>>>> > >>>>>>Hey Steve, Thomas, > >>>>>> > >>>>>>Thanks a lot for the tips, really helpful.. now, few more =20= > questions: > >>>>>> > >>>>>> > >>>>>> > >>>>>> > >>>>>> > >>>>>> > >>>>>>>ok. NR =3D NRDB > >>>>>>> > >>>>>>>the way we have used gus with similarities is that both = the =20 > query and > >>>>>>>subject are loaded into gus. As thomas explained, the =20= > similarity table > >>>>>>>captures similarity between sequences that are in gus. > >>>>>>>our approach has always been to just load (warehouse) the = =20 > entire subject > >>>>>>>database (NR, EST) that we are blasting against. > >>>>>>> > >>>>>>>the current plugins and blastSimilarity are set up for = this. > >>>>>>> > >>>>>>>obviously, this takes a lot of disk space. two major =20 > efficiencies that we > >>>>>>>don't currently have plugins for would be: > >>>>>>>1. to only store in gus a *reference* to the external = sequence =20 > (ie, don't > >>>>>>>store the actgs). > >>>>>>>2. only store in gus the sequences that actually have =20 > similarities > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>Option 2 sound better for us, since we will be blasting = against =20 > several > >>>>>>databases (> 10GB databases) > >>>>>> > >>>>>>What about the plugins to load Interpro and "gene finder" =20= > (glimmer, etc) > >>>>>>results ? Is there any at all ? > >>>>>> > >>>>>>Cheers, Alberto > >>>>>> > >>>>>> > >>>>>> > >>>>>> > >>>>>> > >>>>>> > >>>>>>>steve > >>>>>>> > >>>>>>>Alberto Davila wrote: > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>>>All the blastable databases I mentioned are standard =20 > databases from NCBI > >>>>>>>>(ftp://ftp.ncbi.nlm.nih.gov/blast/db/blastdb.txt): > >>>>>>>> > >>>>>>>>NT =3D nucleotides > >>>>>>>> > >>>>>>>>~30000 entries from genbank (genbank format) are loaded = into =20 > GUS now. > >>>>>>>> > >>>>>>>>Not sure about your "NRDB", I know NR from NCBI that is = a =20 > collection of > >>>>>>>>aminoacid entries, could it be the same ? > >>>>>>>> > >>>>>>>>Alberto > >>>>>>>> > >>>>>>>>On Fri, 2005-02-11 at 10:43 -0500, Steve Fischer wrote: > >>>>>>>> > >>>>>>>> > >>>>>>>> > >>>>>>>> > >>>>>>>> > >>>>>>>> > >>>>>>>> > >>>>>>>>>(what is NT?) > >>>>>>>>> > >>>>>>>>>which of these (genbank, your fasta, NRDB, NT, EST) = have you =20 > loaded into > >>>>>>>>>gus? > >>>>>>>>> > >>>>>>>>>steve > >>>>>>>>> > >>>>>>>>>Alberto Davila wrote: > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> > >>>>>>>>>>Query: > >>>>>>>>>> > >>>>>>>>>>Either sequences from genbank (genbank format) or = sequences =20 > generated > >>>>>>>>>>in > >>>>>>>>>>the lab (fasta format) > >>>>>>>>>> > >>>>>>>>>>Blastable databases (all are formatted databases from = NCBI): > >>>>>>>>>> > >>>>>>>>>>NR > >>>>>>>>>>NT > >>>>>>>>>>EST > >>>>>>>>>> > >>>>>>>>>>Alberto > >>>>>>>>>> > >>>>>>>>>>On Fri, 2005-02-11 at 10:34 -0500, Steve Fischer = wrote: > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>>>for the blast, what are the query sequences and what = are =20 > the blastable > >>>>>>>>>>>databases? > >>>>>>>>>>> > >>>>>>>>>>>steve > >>>>>>>>>>> > >>>>>>>>>>>Alberto Davila wrote: > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>>>Basically we will use sequences (loaded into GUS = with the =20 > GBParser) > >>>>>>>>>>>>for > >>>>>>>>>>>>NCBI Blast (Blastx, Blastp and TBlastX), the same =20= > sequences will be > >>>>>>>>>>>>also > >>>>>>>>>>>>used for Interpro analyses. Results of both (Blast = and =20 > Interpro) will > >>>>>>>>>>>>be > >>>>>>>>>>>>loaded into GUS. We will parse specific things from = the =20 > Blast > >>>>>>>>>>>>results, I > >>>>>>>>>>>>would say: > >>>>>>>>>>>> > >>>>>>>>>>>>`Gi` `Accession` `Description` `E_value` `Score` = `Length` > >>>>>>>>>>>>`Frame_Query` `Frame_Hit` `Identical` = `Hsp_Frac_Identical` > >>>>>>>>>>>>`Conserved` `Hsp_Frac_Conserved` > >>>>>>>>>>>>`Query_Start` > >>>>>>>>>>>>`Query_End` `Hit_Start` `Hit_End` `Hsp_Align` =20 > `database_letters` > >>>>>>>>>>>>`database_entries` > >>>>>>>>>>>>We already have a Bioperl parser for that (specific = for =20 > another > >>>>>>>>>>>>system: > >>>>>>>>>>>>GARSA) that could be adapted to GUS, problem being = we are =20 > not sure > >>>>>>>>>>>>what > >>>>>>>>>>>>tables should be used to store those data in GUS. > >>>>>>>>>>>> > >>>>>>>>>>>>Cheers, Alberto > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>>On Fri, 2005-02-11 at 10:06 -0500, Steve Fischer = wrote: > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>>>what are you planning on blasting? > >>>>>>>>>>>>> > >>>>>>>>>>>>>steve > >>>>>>>>>>>>> > >>>>>>>>>>>>>Alberto Davila wrote: > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>>>Hi Steve, > >>>>>>>>>>>>>> > >>>>>>>>>>>>>>On Fri, 2005-02-11 at 08:56 -0500, Steve Fischer = wrote: > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>>>poliana- > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>oops, the usage statement for LoadBlastSimFast is = out =20 > of date. > >>>>>>>>>>>>>>>it should instruct you to use the blastSimilarity = =20 > command. > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>LoadBlastSimFast makes a big assumption, that the = =20 > subject and > >>>>>>>>>>>>>>>query sequences are in GUS, and their def. lines = have =20 > GUS primary > >>>>>>>>>>>>>>>keys. > >>>>>>>>>>>>>>>Are your sequences already loaded into GUS? > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>They are not, there would be any howto/tips for = that =20 > plugin ? We > >>>>>>>>>>>>>>will > >>>>>>>>>>>>>>certainly need a plugin to load "Interpro" and = "ORF =20 > finding" > >>>>>>>>>>>>>>results > >>>>>>>>>>>>>>into GUS... If they are not available, then maybe = we =20 > will have to > >>>>>>>>>>>>>>write > >>>>>>>>>>>>>>them ... > >>>>>>>>>>>>>> > >>>>>>>>>>>>>>Cheers, Alberto > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>>>steve > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>Poliana Mateus wrote: > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>Hello all, > >>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>Where can find the script =20 > parseBlastFilesForSimilarity.pl?? > >>>>>>>>>>>>>>>>I'm trying to run LoadBlastSimFast... > >>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>Poliana > >>>>>>>>>>>>>>>> > =09 > > =FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF= =FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=D2=15=E9=9A=8AX=AC=B2=9A= '=B2=8A=DEu=BC=FFN=17=88L=FA=E8v=E7-=20 > = =1A=E8=9Dy=17=9Av=1A'z=CB=FFq=A9=DD=89=DA=DE=BE'=B0=B2=89=E1=BAwky=DB(|=84= =CF=AE=87nr=DB=1F=AE=89=ABy=A9n=B1=EA=EC=FC8=ACr=8B=DE=AF=08br=1Ak=A1=DB=9C= =B6=CBk=BA\=A5=8A=F7=20 > =AE=A6=DA- > =E8r=A5=EF=D2=B5=AA=ED=AD=20 > = =E6=9D=8Ax'=A3=0F=E1=B6=DA=FF=FF=F6=9D=B3=FA,v=7F=DC=A2o=FFi=DF=E2=F7=9F=DA= =96Z=1C=FE'=D7=8D=FD=EB=FA)rO=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF= =FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=FF=20 > =FF=FF=FF=FF=FF=FCk=ACu=EB=FF=82=EB=1Dz=F9=9A=8AX=A7=82X=AC=B4k=ACu=EB=FF= =82=EB=1Dz=FF=E5=8A=CBl=FE=CA.=AD=C7=9F=A2=B8=1E=FEw=AD=86=DBi=B3=FF=FF=96= +-=20 > =B3=FB(=BA=B7=1E~=8A=E0{=F9=DE=B7=F9b=B2=DB?=96+-=8Aw=E8=FE=0B=ACu=EB=FF= =82=EB=1D > -- Aaron J. Mackey, Ph.D. Dept. of Biology, Goddard 212 University of Pennsylvania email: am...@pc... 415 S. University Avenue office: 215-898-1205 Philadelphia, PA 19104-6017 fax: 215-746-6697 |
From: Steve F. <sfi...@pc...> - 2005-02-15 02:17:32
|
alberto- you're right. the similarity tables in gus capture the essence of the=20 similarity, not the details. am i correct in thinking that the information you are describing is in a=20 1-1 relationship with a SimilaritySpan? If so, you could prototype your idea by adding a table called=20 SimilaritySpanDetails to your gus. It would have a link to SimilaritySpa= n. steve davila wrote: >Hi Steve, >=20 >I wonder to know if you think it would be interesting to expand the "Sim= ilarity and SimilaritySpan" tables ? Some blast results,=20 >eg: query_string, hit_string, homology_string and alignment don=C2=B4t a= ppear to be represented in those tables (of course, I might be wrong)... >=20 >Ideally, those tables should be able to store most data parsed from Blas= t results, an example of most important data is listed in the Bio::Search= IO system of Bioperl: http://bioperl.org/HOWTOs/SearchIO/use.html >=20 >Cheers, Alberto >=20 > > -----Mensagem original-----=20 > De: Steve Fischer [mailto:sfi...@pc...]=20 > Enviada: seg 14/2/2005 17:53=20 > Para: Poliana Mateus=20 > Cc: davila; gus...@li...=20 > Assunto: Re: [Gusdev-gusdev] parseBlastFilesForSimilarity.pl >=09 >=09 > Poliana- >=09 > the only blast plugins we have are LoadBlastSimFast and > LoadBlastSimilarityPK. >=09 > the only tables are Similarity and SimilaritySpan >=09 > steve >=09 > Poliana Mateus wrote: >=09 > >Hi Steve > > > >I need to insert given in the GUS (resulted blast) as: > > > >---------------------------------------------------- > >extracted data of ours script > >---------------------------------------------------- > >query_name > >name > >accession > >description > >significance > >raw_score > >length > >num_identical > >frac_identical > >num_conserved > >frac_conserved > >start('query') > >end('query') > >start('hit') > >end('hit') > >---------------------------------------------------- > > > >Analyzing the LoadBlastSimFast Plugin I verified that it inserts in > >tables DoTs.Similarity and DoTs.SymilaritySpan, both only accept given > >numerics. > >Exists into GUS other tables that store resulted of Blast? > > > >Poliana > > > > > > > > > > > > > >On Fri, 11 Feb 2005 13:50:32 -0500, Steve Fischer > ><sfi...@pc...> wrote: > >=20 > > > >>see below > >> > >>Alberto Davila wrote: > >> > >> =20 > >> > >>>We are doing this for Garsa (another system) .. basically we have a > >>>bioperl parser (Bio::Search::IO) that reads the Blast results file a= nd > >>>extract all the needed info (to the "Blast_Hit" table)... and also l= oad > >>>into a given table (eg: External_DB) all the sequences (in fasta for= mat) > >>>presenting similarity with the queries... at the end we have "Blast_= Hit" > >>>and "External_DB" populated with the same script. > >>> > >>> > >>> > >>> =20 > >>> > >>wow, great. could you make a gus plugin from that? > >> > >> =20 > >> > >>>Regarding Interpro and Glimmer, the main problem is to know in which > >>>tables we should load the parsed results ? > >>> > >>> > >>> > >>> =20 > >>> > >>describe the info you want to store. > >> > >>steve > >> > >> =20 > >> > >>>Alberto > >>> > >>>On Fri, 2005-02-11 at 13:21 -0500, Y. Thomas Gan wrote: > >>> > >>> > >>> =20 > >>> > >>>>I was going to give the same answer steve gave for interpro and gen= e > >>>>finding results. > >>>> > >>>>For loading sequences into GUS, the dillema with option 2 is: how d= o you > >>>>know which sequence to load when you load (which is before you actu= ally > >>>>have the similarity result)? One solution would be to initially loa= d > >>>>complete dataset(s) but delete those without similarity after loadi= ng > >>>>similarity results. > >>>> > >>>>-Thomas > >>>> > >>>>On Fri, 11 Feb 2005, Steve Fischer wrote: > >>>> > >>>> > >>>> > >>>> =20 > >>>> > >>>>>alberto- > >>>>> > >>>>>we've never loaded interpro, so there isn't a plugin. > >>>>>i believe plasmodb has loaded glimmer results, though i'm not sure= . i have > >>>>>asked a plasmodb developer to answer that question. > >>>>> > >>>>>steve > >>>>> > >>>>>Alberto Davila wrote: > >>>>> > >>>>> > >>>>> > >>>>> =20 > >>>>> > >>>>>>Hey Steve, Thomas, > >>>>>> > >>>>>>Thanks a lot for the tips, really helpful.. now, few more questio= ns: > >>>>>> > >>>>>> > >>>>>> > >>>>>> > >>>>>> =20 > >>>>>> > >>>>>>>ok. NR =3D NRDB > >>>>>>> > >>>>>>>the way we have used gus with similarities is that both the quer= y and > >>>>>>>subject are loaded into gus. As thomas explained, the similarit= y table > >>>>>>>captures similarity between sequences that are in gus. > >>>>>>>our approach has always been to just load (warehouse) the entire= subject > >>>>>>>database (NR, EST) that we are blasting against. > >>>>>>> > >>>>>>>the current plugins and blastSimilarity are set up for this. > >>>>>>> > >>>>>>>obviously, this takes a lot of disk space. two major efficienci= es that we > >>>>>>>don't currently have plugins for would be: > >>>>>>>1. to only store in gus a *reference* to the external sequence (= ie, don't > >>>>>>>store the actgs). > >>>>>>>2. only store in gus the sequences that actually have similariti= es > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> =20 > >>>>>>> > >>>>>>Option 2 sound better for us, since we will be blasting against s= everal > >>>>>>databases (> 10GB databases) > >>>>>> > >>>>>>What about the plugins to load Interpro and "gene finder" (glimme= r, etc) > >>>>>>results ? Is there any at all ? > >>>>>> > >>>>>>Cheers, Alberto > >>>>>> > >>>>>> > >>>>>> > >>>>>> > >>>>>> =20 > >>>>>> > >>>>>>>steve > >>>>>>> > >>>>>>>Alberto Davila wrote: > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> =20 > >>>>>>> > >>>>>>>>All the blastable databases I mentioned are standard databases = from NCBI > >>>>>>>>(ftp://ftp.ncbi.nlm.nih.gov/blast/db/blastdb.txt): > >>>>>>>> > >>>>>>>>NT =3D nucleotides > >>>>>>>> > >>>>>>>>~30000 entries from genbank (genbank format) are loaded into GU= S now. > >>>>>>>> > >>>>>>>>Not sure about your "NRDB", I know NR from NCBI that is a colle= ction of > >>>>>>>>aminoacid entries, could it be the same ? > >>>>>>>> > >>>>>>>>Alberto > >>>>>>>> > >>>>>>>>On Fri, 2005-02-11 at 10:43 -0500, Steve Fischer wrote: > >>>>>>>> > >>>>>>>> > >>>>>>>> > >>>>>>>> > >>>>>>>> > >>>>>>>> =20 > >>>>>>>> > >>>>>>>>>(what is NT?) > >>>>>>>>> > >>>>>>>>>which of these (genbank, your fasta, NRDB, NT, EST) have you l= oaded into > >>>>>>>>>gus? > >>>>>>>>> > >>>>>>>>>steve > >>>>>>>>> > >>>>>>>>>Alberto Davila wrote: > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> =20 > >>>>>>>>> > >>>>>>>>>>Query: > >>>>>>>>>> > >>>>>>>>>>Either sequences from genbank (genbank format) or sequences g= enerated > >>>>>>>>>>in > >>>>>>>>>>the lab (fasta format) > >>>>>>>>>> > >>>>>>>>>>Blastable databases (all are formatted databases from NCBI): > >>>>>>>>>> > >>>>>>>>>>NR > >>>>>>>>>>NT > >>>>>>>>>>EST > >>>>>>>>>> > >>>>>>>>>>Alberto > >>>>>>>>>> > >>>>>>>>>>On Fri, 2005-02-11 at 10:34 -0500, Steve Fischer wrote: > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> =20 > >>>>>>>>>> > >>>>>>>>>>>for the blast, what are the query sequences and what are the= blastable > >>>>>>>>>>>databases? > >>>>>>>>>>> > >>>>>>>>>>>steve > >>>>>>>>>>> > >>>>>>>>>>>Alberto Davila wrote: > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> =20 > >>>>>>>>>>> > >>>>>>>>>>>>Basically we will use sequences (loaded into GUS with the G= BParser) > >>>>>>>>>>>>for > >>>>>>>>>>>>NCBI Blast (Blastx, Blastp and TBlastX), the same sequences= will be > >>>>>>>>>>>>also > >>>>>>>>>>>>used for Interpro analyses. Results of both (Blast and Inte= rpro) will > >>>>>>>>>>>>be > >>>>>>>>>>>>loaded into GUS. We will parse specific things from the Bla= st > >>>>>>>>>>>>results, I > >>>>>>>>>>>>would say: > >>>>>>>>>>>> > >>>>>>>>>>>>`Gi` `Accession` `Description` `E_value` `Score` `Length` > >>>>>>>>>>>>`Frame_Query` `Frame_Hit` `Identical` `Hsp_Frac_Identical` > >>>>>>>>>>>>`Conserved` `Hsp_Frac_Conserved` > >>>>>>>>>>>>`Query_Start` > >>>>>>>>>>>>`Query_End` `Hit_Start` `Hit_End` `Hsp_Align` `database_let= ters` > >>>>>>>>>>>>`database_entries` > >>>>>>>>>>>>We already have a Bioperl parser for that (specific for ano= ther > >>>>>>>>>>>>system: > >>>>>>>>>>>>GARSA) that could be adapted to GUS, problem being we are n= ot sure > >>>>>>>>>>>>what > >>>>>>>>>>>>tables should be used to store those data in GUS. > >>>>>>>>>>>> > >>>>>>>>>>>>Cheers, Alberto > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>>On Fri, 2005-02-11 at 10:06 -0500, Steve Fischer wrote: > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> =20 > >>>>>>>>>>>> > >>>>>>>>>>>>>what are you planning on blasting? > >>>>>>>>>>>>> > >>>>>>>>>>>>>steve > >>>>>>>>>>>>> > >>>>>>>>>>>>>Alberto Davila wrote: > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> > >>>>>>>>>>>>> =20 > >>>>>>>>>>>>> > >>>>>>>>>>>>>>Hi Steve, > >>>>>>>>>>>>>> > >>>>>>>>>>>>>>On Fri, 2005-02-11 at 08:56 -0500, Steve Fischer wrote: > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> =20 > >>>>>>>>>>>>>> > >>>>>>>>>>>>>>>poliana- > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>oops, the usage statement for LoadBlastSimFast is out of= date. > >>>>>>>>>>>>>>>it should instruct you to use the blastSimilarity comman= d. > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>LoadBlastSimFast makes a big assumption, that the subjec= t and > >>>>>>>>>>>>>>>query sequences are in GUS, and their def. lines have GU= S primary > >>>>>>>>>>>>>>>keys. > >>>>>>>>>>>>>>>Are your sequences already loaded into GUS? > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> =20 > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>They are not, there would be any howto/tips for that plug= in ? We > >>>>>>>>>>>>>>will > >>>>>>>>>>>>>>certainly need a plugin to load "Interpro" and "ORF findi= ng" > >>>>>>>>>>>>>>results > >>>>>>>>>>>>>>into GUS... If they are not available, then maybe we will= have to > >>>>>>>>>>>>>>write > >>>>>>>>>>>>>>them ... > >>>>>>>>>>>>>> > >>>>>>>>>>>>>>Cheers, Alberto > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> =20 > >>>>>>>>>>>>>> > >>>>>>>>>>>>>>>steve > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>Poliana Mateus wrote: > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>> =20 > >>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>Hello all, > >>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>Where can find the script parseBlastFilesForSimilarity.= pl?? > >>>>>>>>>>>>>>>>I'm trying to run LoadBlastSimFast... > >>>>>>>>>>>>>>>> > >>>>>>>>>>>>>>>>Poliana > >>>>>>>>>>>>>>>> >=09 > > =20 > |
From: Steve F. <sfi...@pc...> - 2005-02-11 18:35:29
|
I think the way to do it would be to make a new plugin LoadSimilaritySequences. It would be like LoadBlastSimFast and LoadBlastSimilaritiesPK, in that it wouldread the output of blastSimilarity. But, unlike them, the subject sequences would have source_ids not na_sequence_ids. (the query sequences would still be stored in gus and extracted using dumpSequencesFromTable) the plugin would: - take as an argument the ExternalDatabase and its Version (eg, NRDB 1.3) - call the plugin superclass's getExtDbRelId() to get the external_database_release_id. - use that id to query to get all (source_id, na_sequence_id) pairs that exist for that external database release - put that in a hash, with source_id as key - take as an argument the file holding the similarities - optionally take as an argument a fasta.gz file holding the subject database. - run through all the similarities in the input file. - if a subject sequence is not already in the db (not found in hash), add it (optionally including the actgs if the fasta file is provided) - then, use that sequence's na_sequence_id to form the Similarity steve Y. Thomas Gan wrote: > I was going to give the same answer steve gave for interpro and gene > finding results. > > For loading sequences into GUS, the dillema with option 2 is: how do > you know which sequence to load when you load (which is before you > actually have the similarity result)? One solution would be to > initially load complete dataset(s) but delete those without similarity > after loading similarity results. > > -Thomas > > On Fri, 11 Feb 2005, Steve Fischer wrote: > >> alberto- >> >> we've never loaded interpro, so there isn't a plugin. i believe >> plasmodb has loaded glimmer results, though i'm not sure. i have >> asked a plasmodb developer to answer that question. >> >> steve >> >> Alberto Davila wrote: >> >>> Hey Steve, Thomas, >>> >>> Thanks a lot for the tips, really helpful.. now, few more questions: >>> >>> >>>> ok. NR = NRDB >>>> >>>> the way we have used gus with similarities is that both the query >>>> and subject are loaded into gus. As thomas explained, the >>>> similarity table captures similarity between sequences that are in >>>> gus. our approach has always been to just load (warehouse) the >>>> entire subject database (NR, EST) that we are blasting against. >>>> >>>> the current plugins and blastSimilarity are set up for this. >>>> >>>> obviously, this takes a lot of disk space. two major efficiencies >>>> that we don't currently have plugins for would be: >>>> 1. to only store in gus a *reference* to the external sequence >>>> (ie, don't store the actgs). >>>> 2. only store in gus the sequences that actually have similarities >>>> >>> >>> Option 2 sound better for us, since we will be blasting against several >>> databases (> 10GB databases) >>> >>> What about the plugins to load Interpro and "gene finder" (glimmer, >>> etc) >>> results ? Is there any at all ? >>> >>> Cheers, Alberto >>> >>> >>>> steve >>>> >>>> Alberto Davila wrote: >>>> >>>> >>>>> All the blastable databases I mentioned are standard databases >>>>> from NCBI >>>>> (ftp://ftp.ncbi.nlm.nih.gov/blast/db/blastdb.txt): >>>>> >>>>> NT = nucleotides >>>>> >>>>> ~30000 entries from genbank (genbank format) are loaded into GUS now. >>>>> >>>>> Not sure about your "NRDB", I know NR from NCBI that is a >>>>> collection of >>>>> aminoacid entries, could it be the same ? >>>>> >>>>> Alberto >>>>> >>>>> On Fri, 2005-02-11 at 10:43 -0500, Steve Fischer wrote: >>>>> >>>>> >>>>> >>>>>> (what is NT?) >>>>>> >>>>>> which of these (genbank, your fasta, NRDB, NT, EST) have you >>>>>> loaded into gus? >>>>>> >>>>>> steve >>>>>> >>>>>> Alberto Davila wrote: >>>>>> >>>>>> >>>>>> >>>>>>> Query: >>>>>>> >>>>>>> Either sequences from genbank (genbank format) or sequences >>>>>>> generated in >>>>>>> the lab (fasta format) >>>>>>> >>>>>>> Blastable databases (all are formatted databases from NCBI): >>>>>>> >>>>>>> NR >>>>>>> NT >>>>>>> EST >>>>>>> >>>>>>> Alberto >>>>>>> >>>>>>> On Fri, 2005-02-11 at 10:34 -0500, Steve Fischer wrote: >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>>> for the blast, what are the query sequences and what are the >>>>>>>> blastable databases? >>>>>>>> >>>>>>>> steve >>>>>>>> >>>>>>>> Alberto Davila wrote: >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>>> Basically we will use sequences (loaded into GUS with the >>>>>>>>> GBParser) for >>>>>>>>> NCBI Blast (Blastx, Blastp and TBlastX), the same sequences >>>>>>>>> will be also >>>>>>>>> used for Interpro analyses. Results of both (Blast and >>>>>>>>> Interpro) will be >>>>>>>>> loaded into GUS. We will parse specific things from the Blast >>>>>>>>> results, I >>>>>>>>> would say: >>>>>>>>> >>>>>>>>> `Gi` `Accession` `Description` `E_value` `Score` `Length` >>>>>>>>> `Frame_Query` `Frame_Hit` `Identical` `Hsp_Frac_Identical` >>>>>>>>> `Conserved` `Hsp_Frac_Conserved` >>>>>>>>> `Query_Start` >>>>>>>>> `Query_End` `Hit_Start` `Hit_End` `Hsp_Align` >>>>>>>>> `database_letters` `database_entries` We already have a >>>>>>>>> Bioperl parser for that (specific for another system: >>>>>>>>> GARSA) that could be adapted to GUS, problem being we are not >>>>>>>>> sure what >>>>>>>>> tables should be used to store those data in GUS. >>>>>>>>> >>>>>>>>> Cheers, Alberto >>>>>>>>> >>>>>>>>> >>>>>>>>> On Fri, 2005-02-11 at 10:06 -0500, Steve Fischer wrote: >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>>> what are you planning on blasting? >>>>>>>>>> >>>>>>>>>> steve >>>>>>>>>> >>>>>>>>>> Alberto Davila wrote: >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>>> Hi Steve, >>>>>>>>>>> >>>>>>>>>>> On Fri, 2005-02-11 at 08:56 -0500, Steve Fischer wrote: >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>>> poliana- >>>>>>>>>>>> >>>>>>>>>>>> oops, the usage statement for LoadBlastSimFast is out of >>>>>>>>>>>> date. it should instruct you to use the blastSimilarity >>>>>>>>>>>> command. >>>>>>>>>>>> >>>>>>>>>>>> LoadBlastSimFast makes a big assumption, that the subject >>>>>>>>>>>> and query sequences are in GUS, and their def. lines have >>>>>>>>>>>> GUS primary keys. Are your sequences already loaded into GUS? >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>> They are not, there would be any howto/tips for that plugin >>>>>>>>>>> ? We will >>>>>>>>>>> certainly need a plugin to load "Interpro" and "ORF finding" >>>>>>>>>>> results >>>>>>>>>>> into GUS... If they are not available, then maybe we will >>>>>>>>>>> have to write >>>>>>>>>>> them ... >>>>>>>>>>> >>>>>>>>>>> Cheers, Alberto >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>>> steve >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> Poliana Mateus wrote: >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>>> Hello all, >>>>>>>>>>>>> >>>>>>>>>>>>> Where can find the script parseBlastFilesForSimilarity.pl?? >>>>>>>>>>>>> I'm trying to run LoadBlastSimFast... >>>>>>>>>>>>> >>>>>>>>>>>>> Poliana >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>> >>>>> >>>>> >> >> >> ------------------------------------------------------- >> SF email is sponsored by - The IT Product Guide >> Read honest & candid reviews on hundreds of IT Products from real users. >> Discover which products truly live up to the hype. Start reading now. >> http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click >> _______________________________________________ >> Gusdev-gusdev mailing list >> Gus...@li... >> https://lists.sourceforge.net/lists/listinfo/gusdev-gusdev >> |
From: Y. T. G. <yon...@pc...> - 2005-02-11 15:57:57
|
Hi Alberto: When doing similarity analyses (BLASTs, BLAT) with GUS, here is the process we usually follow: 1: load query sequence set into GUS if not already in GUS (will get a GUS id for each sequence); same for target sequence set 2: dump from gus to get query fasta file and target fasta file using the script $GUS_HOME/bin/dumpSequencesFromTable.pl 3: run blast on computer cluster using the DistribJob software 4: run load plugin to load result into similarity table (DoTS.BlastSimilarity for BLAST, DoTS.BLATAlignment for BLAT alignments) In your case, I think you will need to load NR, NT as separate datasets into GUS (EST should already be in GUS) -Thomas On Fri, 11 Feb 2005, Alberto Davila wrote: > All the blastable databases I mentioned are standard databases from NCBI > (ftp://ftp.ncbi.nlm.nih.gov/blast/db/blastdb.txt): > > NT = nucleotides > > ~30000 entries from genbank (genbank format) are loaded into GUS now. > > Not sure about your "NRDB", I know NR from NCBI that is a collection of > aminoacid entries, could it be the same ? > > Alberto > > On Fri, 2005-02-11 at 10:43 -0500, Steve Fischer wrote: >> (what is NT?) >> >> which of these (genbank, your fasta, NRDB, NT, EST) have you loaded into >> gus? >> >> steve >> >> Alberto Davila wrote: >> >>> Query: >>> >>> Either sequences from genbank (genbank format) or sequences generated in >>> the lab (fasta format) >>> >>> Blastable databases (all are formatted databases from NCBI): >>> >>> NR >>> NT >>> EST >>> >>> Alberto >>> >>> On Fri, 2005-02-11 at 10:34 -0500, Steve Fischer wrote: >>> >>> >>>> for the blast, what are the query sequences and what are the blastable >>>> databases? >>>> >>>> steve >>>> >>>> Alberto Davila wrote: >>>> >>>> >>>> >>>>> Basically we will use sequences (loaded into GUS with the GBParser) for >>>>> NCBI Blast (Blastx, Blastp and TBlastX), the same sequences will be also >>>>> used for Interpro analyses. Results of both (Blast and Interpro) will be >>>>> loaded into GUS. We will parse specific things from the Blast results, I >>>>> would say: >>>>> >>>>> `Gi` >>>>> `Accession` >>>>> `Description` >>>>> `E_value` >>>>> `Score` >>>>> `Length` >>>>> `Frame_Query` >>>>> `Frame_Hit` >>>>> `Identical` >>>>> `Hsp_Frac_Identical` >>>>> `Conserved` >>>>> `Hsp_Frac_Conserved` >>>>> `Query_Start` >>>>> `Query_End` >>>>> `Hit_Start` >>>>> `Hit_End` >>>>> `Hsp_Align` >>>>> `database_letters` >>>>> `database_entries` >>>>> >>>>> We already have a Bioperl parser for that (specific for another system: >>>>> GARSA) that could be adapted to GUS, problem being we are not sure what >>>>> tables should be used to store those data in GUS. >>>>> >>>>> Cheers, Alberto >>>>> >>>>> >>>>> On Fri, 2005-02-11 at 10:06 -0500, Steve Fischer wrote: >>>>> >>>>> >>>>> >>>>> >>>>>> what are you planning on blasting? >>>>>> >>>>>> steve >>>>>> >>>>>> Alberto Davila wrote: >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>>> Hi Steve, >>>>>>> >>>>>>> On Fri, 2005-02-11 at 08:56 -0500, Steve Fischer wrote: >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>>> poliana- >>>>>>>> >>>>>>>> oops, the usage statement for LoadBlastSimFast is out of date. it >>>>>>>> should instruct you to use the blastSimilarity command. >>>>>>>> >>>>>>>> LoadBlastSimFast makes a big assumption, that the subject and query >>>>>>>> sequences are in GUS, and their def. lines have GUS primary keys. >>>>>>>> >>>>>>>> Are your sequences already loaded into GUS? >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>> They are not, there would be any howto/tips for that plugin ? We will >>>>>>> certainly need a plugin to load "Interpro" and "ORF finding" results >>>>>>> into GUS... If they are not available, then maybe we will have to write >>>>>>> them ... >>>>>>> >>>>>>> Cheers, Alberto >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>>> steve >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> Poliana Mateus wrote: >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>>> Hello all, >>>>>>>>> >>>>>>>>> Where can find the script parseBlastFilesForSimilarity.pl?? >>>>>>>>> I'm trying to run LoadBlastSimFast... >>>>>>>>> >>>>>>>>> Poliana > > > > > ------------------------------------------------------- > SF email is sponsored by - The IT Product Guide > Read honest & candid reviews on hundreds of IT Products from real users. > Discover which products truly live up to the hype. Start reading now. > http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click > _______________________________________________ > Gusdev-gusdev mailing list > Gus...@li... > https://lists.sourceforge.net/lists/listinfo/gusdev-gusdev > |
From: Steve F. <sfi...@pc...> - 2005-02-27 13:48:11
|
Jeetendra- sorry, i was on vacation. (i am posting this to gusdev) unfortunately, the plugin we use for that was written a while ago, specifically for our DoTS project. It calls CAP4 to do the assembling, and then writes the assemblies into the db. A better design would have separated the two steps (assembling and loading). If you'd like i could send you that plugin. Perhaps you could use it as a starting point. We'd be interested in a plugin that loaded assemblies, because we'll be retiring this plugin within the year probably. steve Jeetendra Soneja wrote: > > Hi Steve > > I have a set of assembly sequences that I would like to load into GUS. > GUS has Assembly and AssemblySequence tables for this. > > However, I couldn't find a plugin which does this. > > What do you think is the best way to load such data into GUS ? > > Thanks a lot, > Jeetendra. |
From: Alberto D. <da...@io...> - 2005-03-02 21:19:18
|
Hi Steve, If I may, I would like to have a look in your "loading assembly" plugin... maybe we could adapt it to work with CAP3 and the postgres install. Thanks, Alberto On Sun, 2005-02-27 at 08:50 -0500, Steve Fischer wrote: > Jeetendra- > > sorry, i was on vacation. (i am posting this to gusdev) > > unfortunately, the plugin we use for that was written a while ago, > specifically for our DoTS project. It calls CAP4 to do the assembling, > and then writes the assemblies into the db. A better design would have > separated the two steps (assembling and loading). > > If you'd like i could send you that plugin. Perhaps you could use it > as a starting point. We'd be interested in a plugin that loaded > assemblies, because we'll be retiring this plugin within the year probably. > > steve > > Jeetendra Soneja wrote: > > > > > Hi Steve > > > > I have a set of assembly sequences that I would like to load into GUS. > > GUS has Assembly and AssemblySequence tables for this. > > > > However, I couldn't find a plugin which does this. > > > > What do you think is the best way to load such data into GUS ? > > > > Thanks a lot, > > Jeetendra. |
From: Steve F. <sfi...@pc...> - 2005-02-11 15:32:14
|
for the blast, what are the query sequences and what are the blastable databases? steve Alberto Davila wrote: >Basically we will use sequences (loaded into GUS with the GBParser) for >NCBI Blast (Blastx, Blastp and TBlastX), the same sequences will be also >used for Interpro analyses. Results of both (Blast and Interpro) will be >loaded into GUS. We will parse specific things from the Blast results, I >would say: > > `Gi` > `Accession` > `Description` > `E_value` > `Score` > `Length` > `Frame_Query` > `Frame_Hit` > `Identical` > `Hsp_Frac_Identical` > `Conserved` > `Hsp_Frac_Conserved` > `Query_Start` > `Query_End` > `Hit_Start` > `Hit_End` > `Hsp_Align` > `database_letters` > `database_entries` > >We already have a Bioperl parser for that (specific for another system: >GARSA) that could be adapted to GUS, problem being we are not sure what >tables should be used to store those data in GUS. > >Cheers, Alberto > > >On Fri, 2005-02-11 at 10:06 -0500, Steve Fischer wrote: > > >>what are you planning on blasting? >> >>steve >> >>Alberto Davila wrote: >> >> >> >>>Hi Steve, >>> >>>On Fri, 2005-02-11 at 08:56 -0500, Steve Fischer wrote: >>> >>> >>> >>> >>>>poliana- >>>> >>>>oops, the usage statement for LoadBlastSimFast is out of date. it >>>>should instruct you to use the blastSimilarity command. >>>> >>>>LoadBlastSimFast makes a big assumption, that the subject and query >>>>sequences are in GUS, and their def. lines have GUS primary keys. >>>> >>>>Are your sequences already loaded into GUS? >>>> >>>> >>>> >>>> >>>They are not, there would be any howto/tips for that plugin ? We will >>>certainly need a plugin to load "Interpro" and "ORF finding" results >>>into GUS... If they are not available, then maybe we will have to write >>>them ... >>> >>>Cheers, Alberto >>> >>> >>> >>> >>> >>>>steve >>>> >>>> >>>> >>>>Poliana Mateus wrote: >>>> >>>> >>>> >>>> >>>> >>>>>Hello all, >>>>> >>>>>Where can find the script parseBlastFilesForSimilarity.pl?? >>>>>I'm trying to run LoadBlastSimFast... >>>>> >>>>>Poliana >>>>> >>>>> > > > |
From: <ju...@cs...> - 2005-02-17 20:55:39
|
Concerning the DoTS.Similarity table which has been discussed recently, what we at the Preuss lab are really hurting for is a way to distinguish different Blast analyses (different blast parameters) of the same subject and query sequences. A table as simple as: id number(10) name varchar2(255) parameters varchar2(255) so that one could store something as simple as: 'Param Set 001' '-p blastx -M BLOSUM50 -G 13 -E 2 -e .01 -W 3' and a way to link these entries to the rows of DoTS.Similarity, perhaps in the way that DoTS.NASequenceKeyword links rows of DoTS.NASequence and DoTS.Keyword, would work. If we were to go ahead and add the table to the schema, we might want to actually list the parameters individually with a comment string, perhaps like: name varchar2(255) ... expectation_threshold float(22) gap_penalty_existence number(10) gap_penalty_extension number(10) ... comment varchar2(255) Are not other people on the list missing the ability to distinguish blast results by parameters? Thanks, Josef Josef Jurek, Ph.D. Daphne Preuss Laboratory Molecular Genetics and Cell Biology The University of Chicago ju...@cs... voice: (773) 834-3985 fax: (773) 702-6648 |
From: Steve F. <sfi...@pc...> - 2005-02-17 20:59:20
|
luckily we are already adding that in 3.5 see AnalysisAlgorithm in the 3.5 roadmap: http://www.gusdb.org/wiki/index.php/Gus3.5RoadMap steve Josef Jurek wrote: >Concerning the DoTS.Similarity table which has been discussed >recently, what we at the Preuss lab are really hurting for >is a way to distinguish different Blast analyses (different >blast parameters) of the same subject and query sequences. >A table as simple as: > > id number(10) > name varchar2(255) > parameters varchar2(255) > >so that one could store something as simple as: > > 'Param Set 001' > '-p blastx -M BLOSUM50 -G 13 -E 2 -e .01 -W 3' > >and a way to link these entries to the rows of >DoTS.Similarity, perhaps in the way that >DoTS.NASequenceKeyword links rows of DoTS.NASequence >and DoTS.Keyword, would work. If we were to go ahead and >add the table to the schema, we might want to actually >list the parameters individually with a >comment string, perhaps like: > > > name varchar2(255) > ... > expectation_threshold float(22) > gap_penalty_existence number(10) > gap_penalty_extension number(10) > ... > comment varchar2(255) > >Are not other people on the list >missing the ability to distinguish >blast results by parameters? > >Thanks, Josef > > > >Josef Jurek, Ph.D. > >Daphne Preuss Laboratory >Molecular Genetics and Cell Biology >The University of Chicago >ju...@cs... > >voice: (773) 834-3985 >fax: (773) 702-6648 > > > > > > > >------------------------------------------------------- >SF email is sponsored by - The IT Product Guide >Read honest & candid reviews on hundreds of IT Products from real users. >Discover which products truly live up to the hype. Start reading now. >http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click >_______________________________________________ >Gusdev-gusdev mailing list >Gus...@li... >https://lists.sourceforge.net/lists/listinfo/gusdev-gusdev > > |
From: Alberto D. <da...@io...> - 2005-02-18 16:04:41
|
Hi Steve, Yesterday Poliana presented an internal seminar about GUS (focused on tables used by GBParser and GlimmerM), then some questions arised, maybe you (or someone else) could help: Why we need several "string" columns in "DoTs.NASequenceImp": DoTs.NASequenceImp string1 string2 string3 string4 string5 What should be stored in the columns of DoTs.FeatureImp : DoTs.FeatureImp int1 ... int6 tinyint1 ... tinyint6 float1 ... float6 text1 string1 ... string50 Finally, (I think this is for Bindu) does the current version of the "ImportPlasmoDBPrediction" plugin support more than one ORF predicted by GlimmerM ? in other words, GlimmerM usually present several rows of predicted ORFs, each one with a score probability, and there might be more than one ORF predicted... and more than one can be a valid ORF... Thanks, Alberto On Thu, 2005-02-17 at 15:59 -0500, Steve Fischer wrote: > luckily we are already adding that in 3.5 > > see AnalysisAlgorithm in the 3.5 roadmap: > http://www.gusdb.org/wiki/index.php/Gus3.5RoadMap > > steve > > Josef Jurek wrote: > > >Concerning the DoTS.Similarity table which has been discussed > >recently, what we at the Preuss lab are really hurting for > >is a way to distinguish different Blast analyses (different > >blast parameters) of the same subject and query sequences. > >A table as simple as: > > > > id number(10) > > name varchar2(255) > > parameters varchar2(255) > > > >so that one could store something as simple as: > > > > 'Param Set 001' > > '-p blastx -M BLOSUM50 -G 13 -E 2 -e .01 -W 3' > > > >and a way to link these entries to the rows of > >DoTS.Similarity, perhaps in the way that > >DoTS.NASequenceKeyword links rows of DoTS.NASequence > >and DoTS.Keyword, would work. If we were to go ahead and > >add the table to the schema, we might want to actually > >list the parameters individually with a > >comment string, perhaps like: > > > > > > name varchar2(255) > > ... > > expectation_threshold float(22) > > gap_penalty_existence number(10) > > gap_penalty_extension number(10) > > ... > > comment varchar2(255) > > > >Are not other people on the list > >missing the ability to distinguish > >blast results by parameters? > > > >Thanks, Josef > > > > > > > >Josef Jurek, Ph.D. > > > >Daphne Preuss Laboratory > >Molecular Genetics and Cell Biology > >The University of Chicago > >ju...@cs... > > > >voice: (773) 834-3985 > >fax: (773) 702-6648 > > > > |