FMiner
FMiner is a software for web scraping, web data extraction, screen scraping, web harvesting, web crawling and web macro support for windows and Mac OS X. It is an easy to use web data extraction tool that combines best-in-class features with an intuitive visual project design tool, to make your next data mining project a breeze. Whether faced with routine web scrapping tasks, or highly complex data extraction projects requiring form inputs, proxy server lists, ajax handling and multi-layered multi-table crawls, FMiner is the web scrapping tool for you. With FMiner, you can quickly master data mining techniques to harvest data from a variety of websites ranging from online product catalogs and real estate classifieds sites to popular search engines and yellow page directories. Simply select your output file format and record your steps on FMiner as you walk through your data extraction steps on your target web site.
Learn more
Screaming Frog SEO Spider
The Screaming Frog SEO Spider is a website crawler that helps you improve onsite SEO, by extracting data & auditing for common SEO issues. Download & crawl 500 URLs for free, or buy a license to remove the limit & access advanced features. The SEO Spider is a powerful and flexible site crawler, able to crawl both small and very large websites efficiently while allowing you to analyze the results in real-time. It gathers key onsite data to allow SEOs to make informed decisions. Crawl a website instantly and find broken links (404s) and server errors. Bulk export the errors and source URLs to fix, or send to a developer. Find temporary and permanent redirects, identify redirect chains and loops, or upload a list of URLs to audit in a site migration. Analyze page titles and meta descriptions during a crawl and identify those that are too long, short, missing, or duplicated across your site.
Learn more
Google Cloud Natural Language API
Get insightful text analysis with machine learning that extracts, analyzes, and stores text. Train high-quality machine learning custom models without a single line of code with AutoML. Apply natural language understanding (NLU) to apps with Natural Language API. Use entity analysis to find and label fields within a document, including emails, chat, and social media, and then sentiment analysis to understand customer opinions to find actionable product and UX insights. Natural Language with speech-to-text API extracts insights from audio. Vision API adds optical character recognition (OCR) for scanned docs. Translation API understands sentiments in multiple languages. Use custom entity extraction to identify domain-specific entities within documents, many of which don’t appear in standard language models, without having to spend time or money on manual analysis. Train your own high-quality machine learning custom models to classify, extract, and detect sentiment.
Learn more
Webbee SEO Spider
Webbee is a desktop based SEO spider that crawl your website following the pattern of major search engine bots. It searches every nook and corner of your website and collects data for you to spot fruitful opportunities and critical issues that can be turned into major benefits. Download it today to find out the exact steps to turn your site into a traffic magnet. Webbee SEO Spider is an ultimate web spider that crawls your website with respect to major search engine’s guidelines. It gathers everything from your website that can be used to form a perfect search engine strategy for your website. Our spider is capable of crawling titles, headings (h1 to h6 with their frequency), http and https URLs, status codes (200 OK, Redirects, 404 pages, server errors), page types (images, html, css, JS, flash, PDF), GA codes, robots denied webpages, meta robots, all internal links, all external links, links frequency to internally linked pages, all anchor texts and their frequency.
Learn more