CD-HIT is a very widely used program for clustering and comparing protein or nucleotide sequences. CD-HIT was originally developed by Dr. Weizhong Li. CD-HIT is currently maintained by Dr. Li's group at UCSD (http://weizhong-lab.ucsd.edu/).

CD-HIT is very fast and can handle extremely large databases. CD-HIT helps to significantly reduce the computational and manual efforts in many sequence analysis tasks and aids in understanding the data structure and correct the bias within a dataset.

The CD-HIT package has cd-hit, cd-hit-2d, cd-hit-est, cd-hit-est-2d, cd-hit-454, psi-cd-hit, cd-hit-otu, cd-hit-lap, cd-hit-dup and over a dozen scripts for various clustering needs.

Project Activity

See All Activity >

Follow cd-hit

cd-hit Web Site

Other Useful Business Software
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of cd-hit !

Additional Project Details

Registered

2013-09-17