CD-HIT is a very widely used program for clustering and comparing protein or nucleotide sequences. CD-HIT was originally developed by Dr. Weizhong Li. CD-HIT is currently maintained by Dr. Li's group at UCSD (http://weizhong-lab.ucsd.edu/).
CD-HIT is very fast and can handle extremely large databases. CD-HIT helps to significantly reduce the computational and manual efforts in many sequence analysis tasks and aids in understanding the data structure and correct the bias within a dataset.
The CD-HIT package has cd-hit, cd-hit-2d, cd-hit-est, cd-hit-est-2d, cd-hit-454, psi-cd-hit, cd-hit-otu, cd-hit-lap, cd-hit-dup and over a dozen scripts for various clustering needs.
Follow cd-hit
Other Useful Business Software
AI-generated apps that pass security review
Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of cd-hit !