Download Latest Version PAPA1.0_x86_64.tar.gz (3.3 MB)
Email in envelope

Get an email when there's a new version of papa

Home
Name Modified Size InfoDownloads / Week
Readme.txt 2015-11-03 4.9 kB
User guide.pdf 2015-11-03 117.1 kB
PAPA1.0_x86_64.tar.gz 2015-10-29 3.3 MB
Totals: 3 Items   3.4 MB 0
PAPA is a flexible tool for pleiotropic pathway analysis utilizing GWAS summary results.


1.Installing and Running PAPA
PAPA is developed by C to interface with R for efficient data analysis. Please make sure that R (http://www.r-project.org/) has been installed on your system. PAPA is a command line based program. Unzip the downloaded "PAPA" package and run "PAPA" program at your terminal window to start analysis.  
   Example:
   ./PAPA


2.Parameter Setting
PAPA will ask users to input a set of parameters before starting analysis:
[1]SNP-gene annotation file name: Please input the storage path and name of SNP-gene annotation file. We provide a SNP-gene annotation file, named ¡°SNPmap.txt¡± in PAPA package. ¡°SNPmap.txt¡± contains 419,513 entries. In this file, SNPs are assigned to a gene, if they located within the gene or < 500 kb upstream or downstream of the gene. Users¡¯ SNP-gene annotation files are also acceptable in PAPA. Please make your SNP-gene annotation files using following format:
   Example:
   rs2101576	AJAP1
   rs10915493	AJAP1
   rs10799221	AJAP1
   ....
        
   Each line of this file records a SNP (the first column) and corresponding gene (the second column). 

[2]Gene-pathway annotation file name: Please input the storage path and name of gene-pathway annotation file. We provide two pathway-gene annotation files in PAPA package. The first one (¡°PathmapGSEA.txt¡±) contains the pathways and gene ontology categories, derived from GSEA Molecular Signatures Database. The second one (¡°PathmapKEGG.txt¡±) contains the pathways and gene ontology terms, collected from KEGG, BioCarta, Ambion GeneAssist Pathway Atlas and Gene Ontology (GO) database. 
Users¡¯ gene-pathway annotation files are also acceptable in PAPA. Users can obtain pathway annotation information from public database, such as KEGG (http://www.genome.ad.jp/kegg/pathway.html), GSEA Molecular Signatures Database (http://www.broadinstitute.org/gsea/msigdb/ index.jsp), Gene Ontology (http://www.geneontology.org), Reactome (http://www.reactome.org/). Please make your gene-pathway annotation files using following format:
    Example:
    KEGG_GALACTOSE_METABOLISM	LALBA
    KEGG_GALACTOSE_METABOLISM	HK2
    KEGG_GALACTOSE_METABOLISM	HK1
    KEGG_GALACTOSE_METABOLISM	GLB1
    ....
    Each line of this file records a pathway (the first column) and corresponding gene (the second column)

[3]GWAS summary file name of phenotype 1: Please input the storage path  and name of GWAS summary file of phenotype 1. This file records the GWAS statistics of each SNP of phenotype 1. Each line of this file records a SNP (the first column) and corresponding association testing statistic (the second column) , for instance, chi-square values for qualitative traits.
   Example:
   rs3748597	0.524
   rs2465136	0.119
   rs13129	1.095
   rs9442372	0.919
   ....

[4]GWAS summary file name of phenotype 2: Please input the storage path and name of GWAS summary file of phenotype 2. This file records the GWAS statistics of each SNP of phenotype 2. Each line of this file records a SNP (the first column) and corresponding association testing statistic (the second column).
   Example:
   rs3748597	0.223
   rs2465136	0.321
   rs13129	1.019
   rs9442372	2.108
   ....

[5]Permuation times: Permutations are used for P value calculation in PAPA. More than 1,000 permutations are recommended for obtaining accurate P values. Note, too large permutation times will make PAPA taking a long time to complete data analysis.

[6]Max and min sizes of analyzed pathways: Users need to define the maximum (fault value = 200 genes) and minimum (fault value = 5 genes) sizes of pathways analyzed by PAPA. 

[7]Weight parameter of phenotype 1 and phenotype 2: Please input the weighting parameters of phenotype 1 and phenotype 2. 


3.Output files
PAPA will output two result files:
[1]Pathway_result.txt: Pleiotropic pathway analysis result files. Each line of this file records a pathway and corresponding pleitropy analysis results, including normalized centered enrichment score (NCES) value and P value.
   Example:
   Pathway name	NCES	P value
   BIOCARTA_ATM_PATHWAY	0.290	0.636
   BIOCARTA_CELLCYCLE_PATHWAY	0.353	0.363
   ....	
   NCES can be used to measure the strength of association between pathways and disease phenotypes. In PAPA, circular permutation approach (Cabrera, et al., 2012, G3, 2, 1067-1075) is used to calculate the emperical P value of NCES  of each pathway. The pathways with P values < 0.05 are considered as significant pathway associated with disease phenotypes. 

[2]Rplots.pdf: Plot of pleitropic pathway analysis results. In the generated figure, each point denotes a pathway. X-axis presents the total number of pathways analyzed by PAPA software. Y-axis shows ¨Clog10 (P values) calculated by PAPA.

 






Source: Readme.txt, updated 2015-11-03