Audience

Java developers in search of a tool to parse, extract, and manipulate data from HTML and XML documents

About jsoup

jsoup is a Java library that simplifies working with real-world HTML and XML. It offers an easy-to-use API for URL fetching, data parsing, extraction, and manipulation using DOM API methods, CSS, and XPath selectors. jsoup implements the WHATWG HTML5 specification and parses HTML to the same DOM as modern browsers. With jsoup, you can scrape and parse HTML from a URL, file, or string; find and extract data using DOM traversal or CSS selectors; manipulate HTML elements, attributes, and text; clean user-submitted content against a safelist to prevent XSS attacks; and output tidy HTML. jsoup is designed to deal with all varieties of HTML found in the wild, from pristine and validating to invalid tag-soup, creating a sensible parse tree. For example, you can fetch the Wikipedia homepage, parse it to a DOM, and select the headlines from the "In the news" section into a list of elements.

Integrations

API:
Yes, jsoup offers API access

Ratings/Reviews

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Company Information

jsoup
jsoup.org

Videos and Screen Captures

jsoup Screenshot 1
Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account

Product Details

Platforms Supported
Windows
Mac
Linux
Training
Documentation
Support
Online

jsoup Frequently Asked Questions

Q: What kinds of users and organization types does jsoup work with?
Q: What languages does jsoup support in their product?
Q: What other applications or services does jsoup integrate with?
Q: Does jsoup have an API?
Q: What type of training does jsoup provide?

jsoup Product Features

Web Design

Templates
Element Libraries
Drag & Drop
Content Management
Syntax Highlighting
Autocompletion
Collaborative Editing
Programming Language Support