Download Latest Version cpdbtutorial-code-ro.2023.zip (13.3 MB)
Email in envelope

Get an email when there's a new version of CpDB tutorial

Home
Name Modified Size InfoDownloads / Week
cpdbtutorial-code-ro.2023.zip 2023-10-03 13.3 MB
README.md 2023-05-22 1.8 kB
Totals: 2 Items   13.3 MB 0

Introduction

This software allows us to create a relational database in PostgreSQL hosting full bacterial genomes. Besides the database, there is software, like a parser, to convert EMBL or GBK files to the CpDB relational schema. Once in the CpDB, one can extract unlimited reports from bacterial genomes using SQL. This software is part of the Ph.D. in Bioinformatics from Anderson Santos developed under the Corynebacterium pseudotuberculosis (Cp) pangenome project. The Cp pangenome delivered to the scientific community fifteen bacterial strains deposited at the GenBank database between the years of 2009 and 2012. The thesis was written in Brazilian Portuguese. However, an English book chapter explaining the software is available at this address. CpDB is the backbone for the Pannotator software. Both software still is alive and kicking.

Downloading

svn checkout svn://svn.code.sf.net/p/cpdbtutorial/code/ cpdbtutorial-code

Note: The download zip is a copy of this above checkout result.

Installing

For Ubuntu 10 OS or higher: 1. Install flex:

sudo apt install flex
  1. Install bison
sudo apt install bison
  1. Install build-essential package:
sudo apt install build-essential

Compiling

  1. This project has four different parsers: GO, GBK, and two first-hit parsers. To compile each one goes to the respective directory and type:
./make

Enjoy it.

BUG 1) Large EMBL/Genbank text qualifiers are not supported (>255 characters). They certainly will throw a stack overflow. To walk around: just remove from the EMBL/Genbank target file those text qualifiers containing texts greater than 255 characters.

Source: README.md, updated 2023-05-22