• Implemented the project using PageRank algorithm for Wikipedia pages on Amazon Elastic MapReduce.
• Designed MapReduce jobs for red links removal, outlink adjacency graph, compute the total number of pages, PageRank calculation, sorting of PageRanks.
• To run the project on amazon Elastic MapReduce specify jar location. Pass the directory locations as an argument of input and output respectively.

Features

  • PageRank Algorithm on Amazon Elastic MapReduce

Project Activity

See All Activity >

Follow PageRank for wikipedia

PageRank for wikipedia Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of PageRank for wikipedia!

Additional Project Details

Registered

2014-02-12