Currently next generation sequencing (NGS) technologies are mainly used to sequence individuals. However, the high coverage required and the resulting costs may be
prohibitive for population scale studies. Sequencing pools of individuals instead may often be more cost effective and more accurate than sequencing individuals. PoPoolation is a pipeline for analysing pooled next generation sequencing data. PoPoolation builds upon open source tools (bwa, samtools) and uses standard file formats (gtf, sam, pileup) to ensure a wide compatibility. Currently PoPoolation allows to calculate Tajima’s Pi, Watterson’s Theta and Tajima’s D for reference sequences using a sliding window approach. Alternatively these population genetic estimators may be calculated for a set of genes (provided as gtf). One of the main challenges in population genomics is to identify regions of intererest on a genome wide scale. We believe that PoPoolation will greatly aid this task by allowing a fast and user friendly analysis of NGS data from DNA pools.
Please cite the following two paper
You may also be interested in our Pool-seq review (Nature Reviews Genetics) where we provide some recommendations for the analysis of Pool-seq data:
Gowinda: unbiased analysis of gene set enrichement (e.g: Gene Ontology) for Genome Wide Association Studies. Gowinda may thus be used for biological interpretation of the results of PoPoolation and PoPoolation2:
PoPoolation2: Allows analyzing the population frequencies of SNPs from two or more populations. It may be used to identify differentiation between populations or to analyze data from genome wide association studies.
PoPoolation TE2 A tool for comparing the transposable element (TE) abundance between sample, where samples could be pooled populations, tissues or sequenced individuals. It identifies novel as well as known TE insertions and reports the population frequencies of all TEs in all samples.
PoPoolation TE: A quick and simple pipeline for the analysis of transposable element insertion frequencies in populations from pooled next generation sequencing data. PoPoolation TE identifies TE insertions that are present in the reference genome as well as novel TE insertions and estimates their population frequencies. This also allows for an comparision of TE insertion frequencies between different populations
PoPoolation DB: A user-friendly web-based database for the retrieval of natural variation in Drosophila melanogaster
Wiki: Manual
Wiki: PoPOOLationWalkthrough
Wiki: TeachingPoPoolation