216,000+ customers in over 135 countries grow their businesses with HubSpot
HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
For SMBs, enterprises, call centers, and governments
PBXware is the world’s first and most mature IP PBX Professional Open Standards Turnkey Telephony Platform. Since 2004, PBXware has deployed flexible, reliable, and scalable Next Generation Communication Systems and VoIP solutions to Small and Medium sized Businesses (SMBs), enterprises, Internet Telephony Service Providers (ITSPs), Contact Centers and governments worldwide by combining the most advanced of the latest technologies.
Web Textual eXtraction Tools C++ Parallel web crawler, noun phrase idenification, Multi-lingual Part of Speech Tagging, Tarjan's Algorithm, Co-RelationShip Mappings...
Wadsworth is a java based web scripting engine. It uses user-defined XML scripts to define its actions. It can be used as a web testing tool, or as a web scraper, or to automate any web actions you wish. It can also be invoked and controlled by another
A Java implementation of a flexible and extensible web spider engine.
Optional modules allow functionality to be added (searching dead links, testing the performance and scalability of a site, creating a sitemap, etc ..
Arachnid is a Java-based web spider framework. It includes a simple HTML parser object that parses an input stream containing HTML content. Simple Web spiders can be created by sub-classing Arachnid and adding a few lines of code called after each page
For companies that want to elevate their reporting journey effortlessly with dReveal’s desktop authoring tools.
A powerful, white-label, seamlessly embedded, and easily integrated reporting and dashboard solution, resulting from 20+ years of design development, targeted user feedback from internal auditors, and extensive GRC/audit management expertise. Thousands of users, from thousands of companies, benefit from using dReveal's highly customizable state-of-the-art reporting tools to easily transform their unrelated data into valuable internal audit management reports and dashboards.
WebSPHINX is a web crawler (robot, spider) Java class library, originally developed by Robert Miller of Carnegie Mellon University. Multithreaded, tollerant HTML parsing, URL filtering and page classification, pattern matching, mirroring, and more.
We are integrating existing communication systems including Wiki, IRC, Instant Messaging, e-mail, and even static web sites. We write web scrapers and servers for managing events, IRC bots, logs, local names, templates, and groups.
ApeSmit is a very simple Python module to create XML sitemaps as defined at http://www.sitemaps.org. ApeSmit doesnt contain any web spider or something like that, it just writes the data you provide to a file using the proper syntax.
Purpose of SAWS is to facilitate process of web scraping by - 1) providing a pattern specification mechanism on top of normal regular expressions 2) and implementation of common matching algorithm to run specified pattern on given source for any matches.
Web Crawling, Web Testing, and Web Scraping application
Blackfire Player is a powerful Web Crawling, Web Testing, and Web Scraper application. It provides a nice DSL to crawl HTTP services, assert responses, and extract data from HTML/XML/JSON responses.
Some Blackfire Player use cases:
Crawl a website/API and check expectations -- aka Acceptance Tests;
Scrape a website/API and extract values;
Monitor a website;
Test code with unit test integration (PHPUnit, Behat, Codeception, ...);
Test code behavior from the outside thanks to the...