Re: [XMLPipeDB-developer] Plasmodium bug/task list
Brought to you by:
kdahlquist,
zugzugglug
From: Richard B. <rbr...@gm...> - 2011-03-25 21:55:48
|
will do. i'll send out an email when completed. On Fri, Mar 25, 2011 at 11:30 AM, Kam Dahlquist <kda...@lm...> wrote: > Hi, > > I've checked the gdb and it looks OK. Richard, will you release a build of > GenMAPP Builder with these changes so that I can test the export on one of > my machines? I am going to work on the ReadMe for this gdb so we can > release it. > > Once I've validated that the new version of GenMAPP Builder works, we'll be > ready to move on to the code merge. > > Best, > Dr. D > > > At 09:15 PM 3/24/2011, you wrote: > > Sorry for the late reply... been laid up with a cold and barely functioning > on cold medicine... > > I believe I have the Pfalciparum solved by using the following code: > > > /** > > * > > *@see* edu.lmu.xmlpipedb.gmbuilder.databasetoolkit.profiles.UniProtSpeciesProfile#getSystemTableManagerCustomizations(edu.lmu.xmlpipedb.gmbuilder.databasetoolkit.tables.TableManager, > edu.lmu.xmlpipedb.gmbuilder.databasetoolkit.tables.TableManager, > java.util.Date) > > */ > > @Override > > *public* TableManager getSystemTableManagerCustomizations(TableManager > tableManager, TableManager primarySystemTableManager, Date version) * > throws* SQLException, InvalidParameterException { > > // Start with the default OrderedLocusNames behavior. > > TableManager result = > *super* .getSystemTableManagerCustomizations(tableManager, > primarySystemTableManager, version); > > // Next, we add IDs from the other gene/name tags, but ONLY if they match > > // the pattern PF[A-Z][0-9]{4}[a-z]. > > //final String pfID = "PF[A-Z][0-9][0-9][0-9][0-9][a-z]"; > > //final String pfID2 = "PF[0-9][0-9]_[0-9][0-9][0-9][0-9]"; > > //final String pfID3 = "MAL[0-9]*P1.[0-9]*"; > > String sqlQuery = > "select d.entrytype_gene_hjid as hjid, c.value " + > > "from genenametype c inner join entrytype_genetype d " + > > "on (c.entrytype_genetype_name_hjid = d.hjid) " + > > //"where (c.value similar to ? " + > > "where c.type = 'ORF'" + > > //"or c.value similar to ? " + > > //"or c.value similar to ?) " + > > //"and type <> 'ordered locus names' " + > > //"and type <> 'ORF' " + > > "group by d.entrytype_gene_hjid, c.value"; > > String dateToday = GenMAPPBuilderUtilities.*getSystemsDateString* > (version); > > Connection c = ConnectionManager.*getRelationalDBConnection*(); > > PreparedStatement ps; > > ResultSet rs; > > *try* { > > // Query, iterate, add to table manager. > > ps = c.prepareStatement(sqlQuery); > > //ps.setString(1, pfID); > > //ps.setString(2, pfID2); > > //ps.setString(3, pfID3); > > rs = ps.executeQuery(); > > *while* (rs.next()) { > > String hjid = Long.*valueOf*(rs.getLong( > "hjid" )).toString(); > > // capture unmodified value in string variable id > > // We want to remove the '_' here but also add id's with the '_' > > String id = rs.getString( > "value" ); > > // condition if id matched pattern "PFA_" > > *if* (id.matches("PFA_.*" )) { > > String[] substrings = id.split("/" ); > > String new_id = *null*; > > String old_id = id; > > *for* (*int* i = 0; i < substrings.length ; i++) { > > new_id = substrings[i].replace("_" , ""); > > *_Log*.debug( "Remove '_' from " + id + " to create: " + new_id + " for > surrogate " + hjid); > > result.submit( "OrderedLocusNames", QueryType.*insert* , *new* String[][] > { { "ID", new_id }, { "Species" , "|" + getSpeciesName() + "|" }, { > "\"Date\"" , dateToday }, { "UID", hjid } }); > > *_Log*.debug( "Keep '_' from " + id + " to create: " + old_id + " for > surrogate " + hjid); > > result.submit( "OrderedLocusNames", QueryType.*insert* , *new* String[][] > { { "ID", old_id }, { "Species" , "|" + getSpeciesName() + "|" }, { > "\"Date\"" , dateToday }, { "UID", hjid } }); > > } > > } > > // otherwise process as normal > > *else* { > > *_Log* .debug("Processing raw ID: " + id + " for surrogate " + hjid); > > tableManager.submit( > "OrderedLocusNames" , QueryType.*insert* , *new* String[][] { { "ID", id > }, { "Species" , "|" + getSpeciesName() + "|" }, { "\"Date\"" , dateToday > }, { "UID", hjid } }); > > } > > } > > } > *catch* (SQLException sqlexc) { > > logSQLException(sqlexc, sqlQuery); > > } > > *return* result; > > } > > I have exported a gdb which contains 5472 gene id's as compared to the > previous 5338. The 134 gene id's with the prefix PFA_ are included plus > their underscore removed form. > > Zipped up and attached here and will later post to the wiki. > > Richard > > > > ------------------------------------------------------------------------------ > Enable your software for Intel(R) Active Management Technology to meet the > growing manageability and security demands of your customers. Businesses > are taking advantage of Intel(R) vPro (TM) technology - will your software > be a part of the solution? Download the Intel(R) Manageability Checker > today! http://p.sf.net/sfu/intel-dev2devmar > _______________________________________________ > xmlpipedb-developer mailing list > xml...@li... > https://lists.sourceforge.net/lists/listinfo/xmlpipedb-developer > > |