Samudra Manthan uses C and MPI for finding interesting n-grams(terms) in a large corpus of data. We use the GigaWord corpus to find top m interesting n-grams using TF*IDF measure.

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow Samudra-Manthan

Samudra-Manthan Web Site

Other Useful Business Software
$300 in Free Credit Towards Top Cloud Services Icon
$300 in Free Credit Towards Top Cloud Services

Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
Get Started
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Samudra-Manthan!

Additional Project Details

Operating Systems

Linux

Intended Audience

Science/Research

Programming Language

C

Related Categories

C Distributed Computing Software

Registered

2008-10-24