• Implemented the project using PageRank algorithm for Wikipedia pages on Amazon Elastic MapReduce.
• Designed MapReduce jobs for red links removal, outlink adjacency graph, compute the total number of pages, PageRank calculation, sorting of PageRanks.
• To run the project on amazon Elastic MapReduce specify jar location. Pass the directory locations as an argument of input and output respectively.
Features
- PageRank Algorithm on Amazon Elastic MapReduce
Follow PageRank for wikipedia
You Might Also Like
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of PageRank for wikipedia!