A national virtual bridge engineering contest for kids of all ages.
**Unfortunately, the Bridge Contest is no longer operational. We hope the code base will be of use to others with similar projects in mind.** A national virtual bridge engineering contest with prizes for middle and high school kids, grades 6 to 12, but anyone can enter for fun. Based on the award-wining Bridge Designer client software, now open source through this project. Web site infrastructure supports worldwide participation in qualifying and semifinal rounds with a tiny administrative staff. The new Rails back end is a RubyMine project. Thanks to JetBrains for this mega development environment!
A set of tools for working with VCF files, such as those generated by the 1000 Genomes Project. This project is migrating to github: https://vcftools.github.io/
MSLB is a crossplatform Simple Logbook program / application to write work logs or even a diary. MSLB can also use a bunch of presets that you can add with a simple doubleclick :) Logs get saved in a simple folder system. And its all portable.
CSBB - Computational Suite for Bioinformaticians and Biologists
CSBB is a command line-based bioinformatics suite to analyze biological data acquired through varied avenues of biological experiments. CSBB is implemented in Perl, while it also leverages the use of R, java, python and ruby in background for specific modules. Major focus of CSBB is to allow users from biology and bioinformatics community, to get benefited by performing down-stream analysis tasks while eliminating the need to write programming code. CSBB is currently available on Linux, UNIX. Currently CSBB provides 18 modules focused on analytical tasks like performing upper-quantile normalization, interactive visualization and next generation sequencing pipelines. CSBB now also has capability to process public data. Providing User with End to End pipeline experience.
A multi-purpose extensible self-adaptive evolutionary algorithm
MicroGP (uGP) is a versatile optimizer able to outperform both human experts and conventional heuristics in finding the optimal solution of hard problems. It is an evolutionary algorithm since it mimics some principles of the Neo-Darwinian paradigm.
Perl Web Scraping Project
Web scraping (web harvesting or web data extraction) is data scraping used for extracting data from websites. Web scraping software may access the World Wide Web directly using the Hypertext Transfer Protocol, or through a web browser. While web scraping can be done manually by a software user, the term typically refers to automated processes implemented using a bot or web crawler. It is a form of copying, in which specific data is gathered and copied from the web, typically into a central local database or spreadsheet, for later retrieval or analysis. Web scraping a web page involves fetching it and extracting from it. Fetching is the downloading of a page (which a browser does when you view the page). Therefore, web crawling is a main component of web scraping, to fetch pages for later processing. Once fetched, then extraction can take place. The content of a page may be parsed, searched, reformatted, its data copied into a spreadsheet, and so on.
Pan-genome Ortholog Clustering Tool
PanOCT, Pan-genome Ortholog Clustering Tool, is a program written in PERL for pan-genomic analysis of closely related prokaryotic species or strains. Unlike traditional graph-based ortholog detection programs, it uses micro synteny or conserved gene neighborhood (CGN) in addition to homology to accurately place proteins into orthologous clusters.
ForeRunner X01 toolkit provides an easy access to Forerunner 201/301/305/305E GPS-units. There are multiple command line tools to read data from GPS into database and tools to export data in different formats like gpx and tcx.
Ensembl Easy Sequence Retriever
Maldonado E, Khan I, Philip S, Vasconcelos V, Antunes A (2013) EASER: Ensembl Easy Sequence Retriever. Evolutionary Bioinformatics, 9:487-490. doi: 10.4137/EBO.S11335.
Plasmodium Typing Utility Software
The goal of PlaTypUS is to establish a unified Plasmodium whole genome analysis tool, that aligns short read sequences in an agreed upon manner, with many quality control steps, and calls both SNVs and CNVs in a community agreed standard way. We also seek to provide this analysis in a stand-alone graphic user interface so that labs around the world can analyze their own whole genome sequencing data, rather than relying on outside institutes.
Simple Scientific Workflow System for CAGE Analysis
Cap analysis of gene expression (CAGE) is a sequencing based technology to capture the 5’ ends of RNAs in a biological sample. After mapping, a CAGE peak on the genome indicates the position of an active transcriptional start site (TSS) and the number of reads correspond to its expression level. CAGE is prominently used in both the FANTOM and ENCODE project. MOIRAI is a compact yet flexible workflow system designed to carry out the main steps in data processing and analysis of CAGE data. MOIRAI has a graphical interface allowing wet-lab researchers to create, modify and run analysis workflows. Embedded within the workflows are graphical quality control indicators allowing users assess data quality and to quickly spot potential problems. MOIRAI package comes with three main workflows allowing users to map, annotate and perform an expression analysis over multiple samples.
State Dependency deconstructor and mapper for C source code.
Flowcharts not telling you what you want to know about some C source code? Try a State Dependency map! Every C source file is an implemention of a state machine. Implicit in the definition of a state machine is a network of dependencies between the states. The Automalator * deconstructs the C code into the core state machine * collates the transactions into the dependency net * generates a diagram-format file with the states and dependencies as the nodes and edges. Pre-requisites * Perl - tested with versions 5.12.3, 5.14.2 and 5.20.2, Windows, Linux and MAC * Get the Automalator by downloading the zip, or checking out or exporting the SVN trunk * copy the source code of interest into "project.c" * windows - double-click "src2map.bat" * linux & MAC - navigate the command line to the "project" folder, then run "src2map.sh" * open "project.gml" file with diagramming software.
Lightweight Multigene Analyses in PAML
Maldonado E, Almeida D, Escalona T, Khan I, Vasconcelos V and Antunes A (2016) LMAP: Lightweight Multigene Analyses in PAML. BMC Bioinformatics, 17:354. doi: 10.1186/s12859-016-1204-5
Various tools for creating annotated parallel corpora including pre-trained tagging and parsing models for various languages, sentence alignment tools and word alignment tools. Uplug also includes a web-based interface for interactive sentence and word alignment and scripts for indexing and querying parallel corpora using the Corpus Work Bench CWB. Download 'uplug-main' first and then add other packages.
PheMaDB is a web-based data management system to store and analyze OmniLog Phenotype Microarray data. The manuscript can be accessed here: http://www.biomedcentral.com/1471-2105/12/109. Chang WE et al. BMC Bioinformatics. 2011 Apr 20;12(1):109.
A pipeline to define allele-specific genomic features
Detecting allelic biases from high-throughput sequencing data requires an approach that maximises sensitivity while minimizing false positives. Here we present Allelome.PRO, an automated userfriendly bioinformatics pipeline, which uses high-throughput sequencing data from reciprocal crosses of two genetically distinct mouse strains to detect allele-specific expression and chromatin modifications. Allelome.PRO extends approaches used in previous studies that exclusively analysed imprinted expression to give a complete picture of the “allelome” by automatically categorising the allelic expression of all genes in a given cell type into imprinted, strain-biased biallelic or noninformative. Allelome.PRO offers increased sensitivity to analyse lowly expressed transcripts, together with a robust false discovery rate empirically calculated from variation in the sequencing data.
A reference implementation of the Polish geobotanical grid
This project is a reference, open source implementation of the coordinate conversion algorithms for the Polish geobotanical grid called ATPOL.
Beer festival cellar management database and related software.
The BeerFest DB project will be designed as a reusable software package to help organize beer festivals and record information concerning past events. Initially it will focus on tracking cask information (e.g., ordering, sales, quality control).
A national virtual bridge engineering contest for kids of all ages.
A national virtual bridge engineering contest with prizes for middle and high school kids, grades 6 to 12, but anyone can enter for fun. Based on the award-wining Bridge Designer client software, now open source through this project. Web site infrastructure supports worldwide participation in qualifying and semifinal rounds with a tiny administrative staff. The new Rails back end is a RubyMine project. Thanks to JetBrains for this awesome development environment!
Bot para unir 2 canales del IRC en 2 servidores distintos. Uso: ./ConnectBot.pl -s1 <servidor1> -s2 <servidor2> -c1 <canal1> -c2 <canal2> -n <nick>
CORe microBiome Analysis Tools
Corbata is a set of statistical tools that can be used to analyze the core microbiome across a set of samples.
Bioinformatic tool to improve draft genome assemblies by closing gaps
Eng-DB-2 is a light-weight engineering database. It allows to manage components/assemblies and their associated AVLs and technical documentation, assemble BOMs for finished goods and annotate these with quotations received from suppliers.
FastAlign is a perl script which uses the heuristic method of tfasty
FastAlign.pl provides a more intuitive output to find exon-intron junctions. The query string is in amino acids and the hit string is in nucleotides. There are extra nucleotides at the end of the hit string (option -diff and by default = 10), that allow to verify if the intron start with common rules (5'-GTGCGA-... for group II intron and after an exonic T for group I intron). If you have Emboss, you can genarate a graphic with option -graph 1. Dependencies: - Perl - Bioperl - UNIX system - tfasty - Emboss (if you want a graphic)