Menu

Tree [r16] /
 History

HTTPS access


File Date Author Commit
 PosCount.java 2011-04-12 andrealtmann [r11] updated readme
 README.txt 2013-02-23 andrealtmann [r15] fixed small coding bug
 pileup2vipr.java 2011-01-12 andrealtmann [r3] speedup for java version, and support for .gz i...
 pileup2vipr.py 2011-01-12 andrealtmann [r2] added Java implementation for pileup2vipr
 skellam.R 2014-08-08 andrealtmann [r16] added skellam.R
 vipR.R 2014-08-08 andrealtmann [r16] added skellam.R

Read Me

Welcome to vipR!

This is the current Beta version of vipR. vipR
comprises two parts
 I) conversion of the simple pileup file generated by SAM tools
    into vipR's input format
II) detection of sequence variants.


For part I) we offer two implementations:
a) pileup2vipr.py is a python implementation, and a bit slow
b) pileup2vipr.java is a much faster Java implementation. Prior
   to execution, the java code must be compiled via
   > javac pileup2vipr.java

NOTE: some JAVA flavours might require a javac -C pileup2vipr.java

   then the program can be executed using
   > java pileup2vipr

Currently vipR can only handle the simple pileup format produced by samtools.
For achieving best results, use the latest version of samtools and execute 
following command:
samtools mpileup -d 1000000 -s -f REFERENCE.fasta INPUTFILE 
the option '-s' ensures that the 'simple' pileup format is generated, the option
'-d 1000000' sets the read depth limit to 1 million. The standard parameter is 250,
which might be a bit low for targeted resequencing experiments.

For part II) we offer a R/S script. In order to execute it, a
working R/S environment installed on your computer is required.
Simply execute 'source(vipR.R)' in your R/S environment, and
do the variant calling using the commands
vipr.loadData and vipr.run for loading your data and searching
for variants, respectively.

Basically, using vipR.R is a three step process:
> source("vipR.R")
> my.data <- vipr.loadData(c("POOL1","POOL2",...,"POOLn")) 
> vipr.run(my.data, nhap = #haplotypes, ofname="myoutputfile.vcf")
where POOL1,...,POOLn correspond to the vipR input files, generated in step I) and
#haplotypes is the number of haplotypes per pool.

If there are questions feel free to contact me:
altmann@mpipsykl.mpg.de or
altmann@stanford.edu


Have fun!

Andre
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.