newsscrape is web scraping for news headline to analyse on how it relates to a news category.
- It extracts RSS feed from Google News.
- Each news headline is matched against Google News category like Entertainment, Sports, etc.
- Called from scheduler to collect this data at 5 minutes interval and be accumulated in a database.
- It contains R statistical computing scripts to learn the pattern on words in the headline resulting a particular category.
- To test its accuracy in predicting the category from a news headline, select a news title from other sources - e.g. http://rss.news.yahoo.com/rss/entertainment - and incorporate it into the R script for outputting a news category it assumes on the news title.
Categories
Web ScrapersFollow newsscrape
You Might Also Like
With the world of work changed forever, it’s essential to manage your workplace and assets together to effectively create a high-performing environment. The Eptura experience combines the power of workplace management software with asset management, enabling you to effectively operate your building and facilitate hybrid work.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of newsscrape!