Java library for working with real-world HTML
Free Extracts Emails, Phones and custom text from Web using JAVA Regex
ACHE is a web crawler for domain-specific search
Free Extracts Emails, Phones and custom text from Web using JAVA Regex
Open source web crawler for Java
DSTK - DataScience ToolKit for All of Us
Open source Search Engine and Enterprise Search