• Implemented the project using PageRank algorithm for Wikipedia pages on Amazon Elastic MapReduce.
• Designed MapReduce jobs for red links removal, outlink adjacency graph, compute the total number of pages, PageRank calculation, sorting of PageRanks.
• To run the project on amazon Elastic MapReduce specify jar location. Pass the directory locations as an argument of input and output respectively.
Features
- PageRank Algorithm on Amazon Elastic MapReduce
Follow PageRank for wikipedia
Other Useful Business Software
Build Securely on Azure with Proven Frameworks
Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of PageRank for wikipedia!