Samudra Manthan uses C and MPI for finding interesting n-grams(terms) in a large corpus of data. We use the GigaWord corpus to find top m interesting n-grams using TF*IDF measure.
Categories
Distributed ComputingLicense
GNU General Public License version 2.0 (GPLv2)Follow Samudra-Manthan
Other Useful Business Software
Find Hidden Risks in Windows Task Scheduler
Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Samudra-Manthan!