We are using a large archive of newspaper stories(GigaWordCorpus) as input to a parallel MPI program, and produce from that a list of top R terms of varying lengths M through N that are especially interesting.
The program is done in C using MPI.

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow GigaWordCorpus

GigaWordCorpus Web Site

Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of GigaWordCorpus!

Additional Project Details

Operating Systems

BSD, Linux

Intended Audience

Science/Research

Programming Language

C

Related Categories

C Text Processing Software, C Information Analysis Software

Registered

2008-10-23