From: Don G. <gil...@bi...> - 2005-12-09 19:21:05
|
Scott, Thanks; I'll add something like a 'make clean'. In fact a significant amount of effort for complex genome databases goes into checking and editing those initial sql dump files to correct them and ensure all genome features are properly represented (in my experience). So "make clean" has been less interesting than "make with intelligent corrections", which ends up being a person's effort rather than a software design issue. One of the hidden issues with Chado genome databases is that, unless you are working with a very simply populated database (e.g. GFF input), and unless/until there is a standardized way to put everything into such a database so that software can follow it, it is a chore to know if you have extracted all relevant feature/sequence information. One also wants the ability to make corrections at this stage in producing data for public consumption - correcting non-standard (non-SO) terms, rearranging/correcting names, etc. - Don .. -- d.gilbert--bioinformatics--indiana-u--bloomington-in-47405 -- gil...@in...--http://marmot.bio.indiana.edu/ |