WebNews Crawler is a specific web crawler (spider, fetcher) designed to acquire and clean news articles from RSS and HTML pages. It can do a site specific extraction to extract the actual news content only, filtering out the advertising and other cruft.

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow WebNews Crawler

WebNews Crawler Web Site

Other Useful Business Software
Earn up to 16% annual interest with Nexo. Icon
Earn up to 16% annual interest with Nexo.

Access competitive interest rates on your digital assets.

Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
Get started with Nexo.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of WebNews Crawler!

Additional Project Details

Intended Audience

Advanced End Users

User Interface

Command-line

Programming Language

Java

Related Categories

Java Search Engines, Java Web Scrapers

Registered

2006-05-19