Here' a list from RGD. It includes a lot of what we use now (plus some we are planning) but I think it wouold be appropriate for most organisms. (I'm also attaching the file.) DATABASE ID SYMBOL & NAME ALIAS_SYMBOL & NAME DESCRIPTION - brief (~250 char) text; basic info, including function, process, disease, etc.; why someone would be interested in the gene SEQUENCE ACC IDs with links GenBank/EMBL/DDBJ mRNA protein genomic contig unigene Refseq (mRNA and protein) Primers REFERENCES Curated - database ID and other IDs, e.g., PubMed Others EXTERNA.L DATABASE IDs and LINKS Uniprot Ensembl Entrez gene OMIM Kegg Reactome BIND etc. MAPPING DATA Chromosome Band Position on specific maps (genetic, RH) Genomic position - start and stop plus build Links to images such as browser images (we actually show the browser image in addition to links.) RELATED GENES Splice variants Pseudogenes Alleles etc. OTHER DB OBJECTS ASSOC. WITH THE GENE Strains QTL ESTs SSLP etc. HOMOLOGS GENE TYPE Splice variant Pseudogene Alleles GENE EVIDENCE Known Predicted by homology Predicted by EST evidence etc. BIOLOGICAL DATA (CONTROLLED VOCAB/ONTOLOGIES/TEXT) Biological Process Cellular Component Disease Domains Drugs effects EC number Evolution Expression Family Homology details Molecular Function Mutation/over- and underexpression Pathway Phenotype Physical interaction Protein modifications Regulation Size Transcript structure etc <<gene page.doc>> Susan K. Bromberg, Ph.D. Curation Team Leader Rat Genome Database (http://www.rgd.mcw.edu) Bioinformatics Research Center (http://www.brc.mcw.edu) Medical College of Wisconsin 8701 Watertown Plank Rd. Milwaukee, WI 53226 414-456-7515 sbromberg@...
Sign up for the SourceForge newsletter:No, thanks