+
+

Related Products

  • Oxylabs
    1,156 Ratings
    Visit Website
  • AddSearch
    138 Ratings
    Visit Website
  • UnForm
    18 Ratings
    Visit Website
  • Apify
    1,051 Ratings
    Visit Website
  • Dynamo Software
    68 Ratings
    Visit Website
  • Square 9
    400 Ratings
    Visit Website
  • Guru
    3,636 Ratings
    Visit Website
  • ARGOS Identity
    8 Ratings
    Visit Website
  • LM-Kit.NET
    23 Ratings
    Visit Website
  • Nutrient SDK
    104 Ratings
    Visit Website

About

Openindex is a web data and search solutions platform that helps organizations collect, extract, crawl, analyze, and integrate information from the internet or internal sources into applications, research workflows, or search experiences; its core offerings include data extraction tools that automatically gather and parse web content, detecting languages, main text, images, prices, and structured elements, and support for entity extraction to identify people, companies, locations, and other named entities from text or documents via API or demos, enabling automated text intelligence without manual work. Openindex’s data crawling and scraping services use enhanced web spiders and customized software to index and traverse sites at scale, avoid spider traps, and harvest specific datasets for research, market analysis, competitive insights, and data feeds ready for integration into systems.

About

jsoup is a Java library that simplifies working with real-world HTML and XML. It offers an easy-to-use API for URL fetching, data parsing, extraction, and manipulation using DOM API methods, CSS, and XPath selectors. jsoup implements the WHATWG HTML5 specification and parses HTML to the same DOM as modern browsers. With jsoup, you can scrape and parse HTML from a URL, file, or string; find and extract data using DOM traversal or CSS selectors; manipulate HTML elements, attributes, and text; clean user-submitted content against a safelist to prevent XSS attacks; and output tidy HTML. jsoup is designed to deal with all varieties of HTML found in the wild, from pristine and validating to invalid tag-soup, creating a sensible parse tree. For example, you can fetch the Wikipedia homepage, parse it to a DOM, and select the headlines from the "In the news" section into a list of elements.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Developers, data engineers, analysts, and businesses that need scalable web data solutions, site search, and text analytics to build insights, search features, and data-driven applications

Audience

Java developers in search of a tool to parse, extract, and manipulate data from HTML and XML documents

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

€100 per month
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Openindex
Founded: 2010
Netherlands
www.openindex.io

Company Information

jsoup
jsoup.org

Alternatives

Alternatives

parsel

parsel

Python Software Foundation
Netpeak Spider

Netpeak Spider

Netpeak Software

Categories

Categories

Integrations

CSS
GitHub
HTML
JavaScript
PHP

Integrations

CSS
GitHub
HTML
JavaScript
PHP
Claim Openindex and update features and information
Claim Openindex and update features and information
Claim jsoup and update features and information
Claim jsoup and update features and information