• Implemented the project using PageRank algorithm for Wikipedia pages on Amazon Elastic MapReduce.
• Designed MapReduce jobs for red links removal, outlink adjacency graph, compute the total number of pages, PageRank calculation, sorting of PageRanks.
• To run the project on amazon Elastic MapReduce specify jar location. Pass the directory locations as an argument of input and output respectively.
Features
- PageRank Algorithm on Amazon Elastic MapReduce
Follow PageRank for wikipedia
Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit
Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of PageRank for wikipedia!