Re: [XMLPipeDB-developer] Trouble with Match
Brought to you by:
kdahlquist,
zugzugglug
From: Dionisio, J. D. N. <Joh...@lm...> - 2014-04-01 16:42:24
|
Hi Kevin, Thanks for the clarification on the Leishmania profiles. I'll move on with those. Meanwhile, on the match question, the likely reason here is that those ID matches are appearing in parts of the XML file that Tally Engine and PostgreSQL do not cover [by default]. The process to resolve this issue is to identify those "extra" IDs and characterize what makes them different. Where in the XML file do they appear? Are they actually synonym variations of IDs that we do capture (i.e., different only due to underscores or periods)? Based on what we have seen of these IDs so far, there is a chance that the latter is the case; let's make that our current hypothesis. Spot checks on a few IDs will do the trick. You can also do a full-scale check of the uncaptured IDs using Microsoft Excel. The procedure for doing that is in the wiki: http://xmlpipedb.sourceforge.net/wiki/index.php/Using_Microsoft_Excel_to_Compare_ID_Lists Hope that helps. Let me know if there are further questions. John David N. Dionisio, PhD Associate Professor, Computer Science Associate Director, University Honors Program Loyola Marymount University On Apr 1, 2014, at 9:14 AM, Kevin McGee <km...@li...> wrote: > Hey Dr. Dionisio, > > I am still having trouble getting match to find the correct number of id's. I have gotten tally engine and postgress both to work, but I cannot get match to find something remotely close. Heres a picture of my code and the number of matches I am getting > <xmlipedbmatch_4_1_2014_error.PNG>------------------------------------------------------------------------------ > _______________________________________________ > xmlpipedb-developer mailing list > xml...@li... > https://lists.sourceforge.net/lists/listinfo/xmlpipedb-developer |