Menu

README

readme (1)
Denis Bertrand Burton Chia Chayaporn Suphavilai

What is it?

OncoIMPACT is a first-in-class algorithmic framework that nominates patient-specific driver genes by integrative modeling of genomic mutations (point, structural and copy-number) and the resulting perturbations in transcriptional programs via defined molecular networks.

OncoIMPACT is configured to run in two modes: (1) a database mode that allows it to determine parameter settings from the data sets provided and (2) a discovery mode where information in the provided database is used to predict driver genes for samples in an additional data set from the same cancer type.


Installation

Download and uncompress the latest version
tar -zxvf oncoIMPACT.tgz
chmod +x oncoIMPACT_v0.9.3/*.pl


Usage

Before running the pipeline, please ensure that you have the following files ready.

Input file format

  • CNV_data: a tab-separated file with samples as columns and genes as rows. Each entry of this matrix must hold either a value of -1/0/1. A value of -1 would indicate a DELETION event while a value of 1 will indicate an AMPLIFICATION event, and a value of 0 if the is not affected by copy number gene.

  • SNP_data (point mutations and short indels): a tab-separated file with samples as columns and genes as rows. Each entry of this matrix must hold either the binary values of 0/1. A value of 1 would indicate the presence of a SNP in the gene of that particular sample while a value of 0 would indicate otherwise.

  • EXP_data: a tab-separated file with samples as columns and genes as rows. Each entry of this matrix would represent the log2 fold-change of the gene expression of the tumor compared to a normal control.

Config file
You will need to create a configuration file for your specific project. The configuration file needs to contain the following parameters (you may refer to the sample configuration file provided with the scripts under <Scripts Directory/sampleConfig.cfg>).

  • outDir: Full path to destination folder

  • scriptDir: Full path to folder where oncoIMPACT is installed

  • numThreads: Number of threads to use

  • cnv: Full path to CNV data matrix

  • exp: Full path to expression data matrix

  • dataType: Flag for expression data type. Valid options: ARRAY (default), RNA_SEQ

  • snp: Full path to SNP data matrix

  • dataBase: Full path to the pre-computed database (implies discovery mode only)

  • databaseExport: Full path where the database will be exported (implies database + discovery mode)

  • testMode: Boolean flag to toggle test mode (valid options: 0 / 1)

When you are ready to run the OncoIMPACT pipeline, simply enter the following command
oncoIMPACT.pl <path to configuration file>


Sample data

We have provided 1 sample datasets for to test the OncoIMPACT pipeline;

  • Glioblastoma.tgz: Glioblastoma (GBM) TCGA dataset

  • Glioblastoma_database.tgz: pre-computed database of the TCGA GBM dataset

  • single_patient.tgz: data for a single GBM patient (to test the discovery mode)


Change Log

v0.9.3:

  • NEW: Enable the processing of RNA-seq data

  • NEW: Enable the construction of databases

  • NEW: Enable the discovery mode using a pre-computed database

v0.9.2:

  • NEW: An option in configuration file to run OncoIMPACT in test mode which performs the simulation with fewer iterations and fixed seed. In this mode, OncoIMPACT should complete in less than 2 hours using a single thread.

  • NEW: Sanity checks to ensure validity of parameters provided by user

  • FIX: Improved disk space utilization

  • FIX: Improved compatibility with Mac OS.

v0.9.1:

  • oncoIMPACT will now avoid reproducing the input files if COMPLETE_SAMPLES folder exists

  • fix for bugs introduced in last version


Licensing

The MIT License (MIT)
Copyright (c) 2014 Genome Institute of Singapore


Citing OncoIMPACT

Bertrand et. al.. Patient-specific driver gene prediction and risk assessment through integrated network analysis of cancer omics profiles. Nucleic Acid Research 2015, 43 (7): e44


Contacts

If you want community-driven support, please visit the forum at https://sourceforge.net/p/oncoimpact/discussion/.

If you have a bug to report, you may raise a ticket at https://sourceforge.net/p/oncoimpact/tickets/.

If you have other questions or feedback, you may direct them to Denis Bertrand (bertrandd@gis.a-star.edu.sg) and Burton Chia (chiakhb@gis.a-star.edu.sg).


MongoDB Logo MongoDB