Download Latest Version CMASA.rar (50.3 MB)
Email in envelope

Get an email when there's a new version of CMASA

Home
Name Modified Size InfoDownloads / Week
README.txt 2010-09-10 4.8 kB
CMASA.rar 2010-09-10 50.3 MB
Totals: 2 Items   50.3 MB 0
CMASA:contact matrix based local structure alignment version 1.0:

Please cite:
  Li GH, Huang JF CMASA: an accurate algorithm for detecting local protein 
  structural similarity and its application to enzyme catalytic site annotation. 
    BMC Bioinformatics. 2010 Aug 27;11(1):439.

Copyright Holder:
CMASA_pro_vs_site  Copyright (c) 2009-2010 Gong-Hua Li, Ph.D
                   State Key Laboratory of Genetic Resources and Evolution
                   Kunming Institute of Zoology
		   Chinese Academy of Sciences
CMASA_site_vs_pro  Copyright (c) 2009-2010 Gong-Hua Li
                   State Key Laboratory of Genetic Resources and Evolution
                   Kunming Institute of Zoology
		   Chinese Academy of Sciences


License: LGPL, GPL,GNU Lesser General Public License


Installation:

For windows:
       CMASA_pro_vs_site.c and CMASA_site_vs_pro.c have been compiled by GUN MingWin G++,
    the excuatable files are:
    
    CMASA_pro_vs_site.exe   and 
    CMASA_site_vs_pro.exe

    This two excuatable files have been tested in win32 for XP, window2003. 

For linux:
      CMASA_pro_vs_site.c and CMASA_site_vs_pro.c need GUN gsl(GNU Scientific Library,
    which is a free numerical library for C and C++ programmers) for compling.
    
    Gun gsl can be installed  by http://www.gnu.org/software/gsl/.

    If you are using ubuntu, it is luck!
    You can just type the following command to install GUN gsl:

      sudo apt-get install libgsl0-dev

    After the GNU gsl is installed, you can type the following command to compile:
    for CMASA_pro_vs_site.c  type:

      g++ CMASA_pro_vs_site.c -o CMASA_pro_vs_site /usr/lib/libgsl.a /usr/lib/libgslcblas.a 

    for CMASA_site_vs_pro.c type:

      g++ CMASA_site_vs_pro.c -o CMASA_site_vs_pro /usr/lib/libgsl.a /usr/lib/libgslcblas.a 

    There may be some warnings, it does not matter.

    If there are still some troubles for compling CMASA, you can try the pre compiled excuable files or
     you can e-mail to me: ligonghua_88@yahoo.com.cn



Aim of the CMASA:
    CMASA_site_vs_pro:   is to used when a active site search to a nrpdb database or nrscop database.
       This programm is to find the remote homologues or the functional convergence.
    
    CMASA_pro_vs_site:  is to predict the function of a protein by searching to a active site database,
       for example: csa database for predicting enzyme catalytic sites.
    

Database:
    CMASA use a specified database format. The database should contain the main file and the infor file.
    for example, the nrscop database should contain the two files "nrscop" and "nrscop.infor".
    In this downloadable version, two databases are specified.

    nrscop:  this database contains all of the specials level structures of scop1.75.
             Total 14519 structures.

    csa: this database is directly from CSA, you can visit:
        http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/CSA/CSA_Download.pl for more information.
	This csa database contains all of the sites of the version 2.2.11 and  just remove the multiple same chains
	in the same pdb file.
	Some csa sites are removed:
	1. number of residues of catalytic sites < 3
	2. 4 residues but 2 is "GLY".
        
	some catalytic sites >5, the most conserved 5 residues is selected.
	the conserve information are from Consurf database(http://consurf.tau.ac.il/)

	Total 15341 structures.


How to use:
     CMASA accept the following inputs:
     -i  query pdbfile, needed.
     -d  database name,needed.
     -o  output file  defult is stdout
     -t  output format
         F  text format
	 T  html format, defult
     -c  cmad cutoff value defult is 1.2
     -s  substitute file, defult is exact match.

     Please type the following comand to test CMASA:

        CMASA_pro_vs_site -i 1mctA.pdb -d csa -o 1mctA_csa.html

        CMASA_pro_vs_site -i 1mctA.pdb -d csa -o 1mctA_csa_blosum62.html -s blosum62.txt

        CMASA_site_vs_pro -i 1mct_site.pdb -d nrscop -o 1mct_site_nrscop.html

        CMASA_site_vs_pro -i 1mct_site.pdb -d nrscop -o 1mct_site_nrscop_blosum62.html -s blosum62.txt




How to define a substitute file:
   1. Create a file for example "mysub.txt"
   2. type the substitute amino types in a line,
      for example, "D" and "E", "R" and "K" can be substituted each other
      you just add the following two lines in the mysub.txt

      DE
      RK

   3. save file.


Can we use my own database?
    In this downloadable version, it is no.
    But in the next version(will be finished shortly), the "makedb" will be generated, 
    then you may create the own database to run CMASA.
    Let's expect the next version.
    
       
    

     



	
    
    

Source: README.txt, updated 2010-09-10