| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Readme.txt | 2015-11-03 | 4.9 kB | |
| User guide.pdf | 2015-11-03 | 117.1 kB | |
| PAPA1.0_x86_64.tar.gz | 2015-10-29 | 3.3 MB | |
| Totals: 3 Items | 3.4 MB | 0 |
PAPA is a flexible tool for pleiotropic pathway analysis utilizing GWAS summary results.
1.Installing and Running PAPA
PAPA is developed by C to interface with R for efficient data analysis. Please make sure that R (http://www.r-project.org/) has been installed on your system. PAPA is a command line based program. Unzip the downloaded "PAPA" package and run "PAPA" program at your terminal window to start analysis.
Example:
./PAPA
2.Parameter Setting
PAPA will ask users to input a set of parameters before starting analysis:
[1]SNP-gene annotation file name: Please input the storage path and name of SNP-gene annotation file. We provide a SNP-gene annotation file, named ¡°SNPmap.txt¡± in PAPA package. ¡°SNPmap.txt¡± contains 419,513 entries. In this file, SNPs are assigned to a gene, if they located within the gene or < 500 kb upstream or downstream of the gene. Users¡¯ SNP-gene annotation files are also acceptable in PAPA. Please make your SNP-gene annotation files using following format:
Example:
rs2101576 AJAP1
rs10915493 AJAP1
rs10799221 AJAP1
....
Each line of this file records a SNP (the first column) and corresponding gene (the second column).
[2]Gene-pathway annotation file name: Please input the storage path and name of gene-pathway annotation file. We provide two pathway-gene annotation files in PAPA package. The first one (¡°PathmapGSEA.txt¡±) contains the pathways and gene ontology categories, derived from GSEA Molecular Signatures Database. The second one (¡°PathmapKEGG.txt¡±) contains the pathways and gene ontology terms, collected from KEGG, BioCarta, Ambion GeneAssist Pathway Atlas and Gene Ontology (GO) database.
Users¡¯ gene-pathway annotation files are also acceptable in PAPA. Users can obtain pathway annotation information from public database, such as KEGG (http://www.genome.ad.jp/kegg/pathway.html), GSEA Molecular Signatures Database (http://www.broadinstitute.org/gsea/msigdb/ index.jsp), Gene Ontology (http://www.geneontology.org), Reactome (http://www.reactome.org/). Please make your gene-pathway annotation files using following format:
Example:
KEGG_GALACTOSE_METABOLISM LALBA
KEGG_GALACTOSE_METABOLISM HK2
KEGG_GALACTOSE_METABOLISM HK1
KEGG_GALACTOSE_METABOLISM GLB1
....
Each line of this file records a pathway (the first column) and corresponding gene (the second column)
[3]GWAS summary file name of phenotype 1: Please input the storage path and name of GWAS summary file of phenotype 1. This file records the GWAS statistics of each SNP of phenotype 1. Each line of this file records a SNP (the first column) and corresponding association testing statistic (the second column) , for instance, chi-square values for qualitative traits.
Example:
rs3748597 0.524
rs2465136 0.119
rs13129 1.095
rs9442372 0.919
....
[4]GWAS summary file name of phenotype 2: Please input the storage path and name of GWAS summary file of phenotype 2. This file records the GWAS statistics of each SNP of phenotype 2. Each line of this file records a SNP (the first column) and corresponding association testing statistic (the second column).
Example:
rs3748597 0.223
rs2465136 0.321
rs13129 1.019
rs9442372 2.108
....
[5]Permuation times: Permutations are used for P value calculation in PAPA. More than 1,000 permutations are recommended for obtaining accurate P values. Note, too large permutation times will make PAPA taking a long time to complete data analysis.
[6]Max and min sizes of analyzed pathways: Users need to define the maximum (fault value = 200 genes) and minimum (fault value = 5 genes) sizes of pathways analyzed by PAPA.
[7]Weight parameter of phenotype 1 and phenotype 2: Please input the weighting parameters of phenotype 1 and phenotype 2.
3.Output files
PAPA will output two result files:
[1]Pathway_result.txt: Pleiotropic pathway analysis result files. Each line of this file records a pathway and corresponding pleitropy analysis results, including normalized centered enrichment score (NCES) value and P value.
Example:
Pathway name NCES P value
BIOCARTA_ATM_PATHWAY 0.290 0.636
BIOCARTA_CELLCYCLE_PATHWAY 0.353 0.363
....
NCES can be used to measure the strength of association between pathways and disease phenotypes. In PAPA, circular permutation approach (Cabrera, et al., 2012, G3, 2, 1067-1075) is used to calculate the emperical P value of NCES of each pathway. The pathways with P values < 0.05 are considered as significant pathway associated with disease phenotypes.
[2]Rplots.pdf: Plot of pleitropic pathway analysis results. In the generated figure, each point denotes a pathway. X-axis presents the total number of pathways analyzed by PAPA software. Y-axis shows ¨Clog10 (P values) calculated by PAPA.