SOAPfuse Wiki
a tool for identifying fusion transcripts from paired-end RNA-Seq data
Brought to you by:
jwl890427
To run SOAPfuse, we need to prepare the config file, and SOAPfuse will run based on the configuration.
$ cd /PATH_WHERE_YOU_PUT_THE_PACKAGE/SOAPfuse-vX.X/config/
$ less -S config.txt
DB_db_dir = /DATABASE_DIR/
PG_pg_dir = /TOOL_DIR/source/bin
PS_ps_dir = /TOOL_DIR/source
PD_all_out = /out_directory/
PA_all_fq_postfix = PostFix
* PD_all_out is the directory which you prepared to store all results of SOAPfuse. You can set it via
the option ('-o') of main program which is introduced below, and it has the higher priority. SOAPfuse
will creat the sub-directories of each step in out_directory automatically when it runs.
$ perl SOAPfuse-RUN.pl -c <config_file>
-fd <WHOLE_SEQ-DATA_DIR>
-l <sample_list>
-o <out_directory>
[Other Options]
-c [s] Config File for run this pipeline. <required>
-fd [s] Directory which stores Paired-end Sequenced Read Files. <required>
Sequenced Reads Format can be fastq or fasta.
Files could be compressed by gzip or just readable text-format.
-l [s] The information list of sample(s) you want to deal. <required>
This list can include infomation of one or more samples.
It is suggested to include one sample/patient in each sample list file.
-o [s] Directory which will store all results.
It has the first priority, or you should set 'PD_all_out' in config file.
-fs [i] The step you want to start from. [1]
-es [i] The step you want to end at. [9]
Step 9 is the last step of the SOAPfuse pipeline.
-tp [s] The name-postfix of temp directory*. [data +%s.'_'.int(rand(1000)+1)]
Donot set same string for different Sample-info-list files.
It is suggested to set this parameter as same as SampleID for distinguishing
the scripts of different samples easily in the general case that one
sample-info-list file just includes one sample.
-fm Sign to enable perl fork management. [disabled]
-h Display this help info.
Other Command:
$ perl SOAPfuse-RUN.pl -c version