pdfGoHTML

pdfGoHTML

callas software GmbH
+
+

Related Products

  • Crowdin
    867 Ratings
    Visit Website
  • Nutrient SDK
    104 Ratings
    Visit Website
  • CirrusPrint
    2 Ratings
    Visit Website
  • ManageEngine EventLog Analyzer
    203 Ratings
    Visit Website
  • BrandMail
    313 Ratings
    Visit Website
  • Boozang
    15 Ratings
    Visit Website
  • FrontFace
    49 Ratings
    Visit Website
  • Popl
    6,810 Ratings
    Visit Website
  • Oxylabs
    1,151 Ratings
    Visit Website
  • LALAL.AI
    4,805 Ratings
    Visit Website

About

jsoup is a Java library that simplifies working with real-world HTML and XML. It offers an easy-to-use API for URL fetching, data parsing, extraction, and manipulation using DOM API methods, CSS, and XPath selectors. jsoup implements the WHATWG HTML5 specification and parses HTML to the same DOM as modern browsers. With jsoup, you can scrape and parse HTML from a URL, file, or string; find and extract data using DOM traversal or CSS selectors; manipulate HTML elements, attributes, and text; clean user-submitted content against a safelist to prevent XSS attacks; and output tidy HTML. jsoup is designed to deal with all varieties of HTML found in the wild, from pristine and validating to invalid tag-soup, creating a sensible parse tree. For example, you can fetch the Wikipedia homepage, parse it to a DOM, and select the headlines from the "In the news" section into a list of elements.

About

pdfGoHTML substantially speeds up the creation and evaluation of tagged PDFs and ensures a much higher degree of usability of tagged PDF files. One simple click on the plug-in button converts the tagged PDF into HTML making it easy to examine the tagging structure, have a more flexible reading experience, or make the document accessible for people with visual disabilities or dyslexia. pdfGoHTML is a free Acrobat plug-in (Acrobat DC Standard and Pro only) converting tagged PDF files into HTML, supporting PDF/UA. It shows where the tagging structure of the PDF needs improvement. It substantially speeds up the creation and evaluation of tagged PDFs and ensures a much higher degree of usability of tagged PDF files. When opening a PDF file, pdfGoHTML immediately indicates whether or not the file is tagged and allows a one-button conversion into HTML in the default browser. Users can easily switch how the HTML is displayed to adjust it to their specific needs.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Java developers in search of a tool to parse, extract, and manipulate data from HTML and XML documents

Audience

Companies requiring a tool to make their document accessible for people with visual disabilities

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

jsoup
jsoup.org

Company Information

callas software GmbH
Founded: 1995
Germany
www.callassoftware.com/en/products/pdfgohtml

Alternatives

parsel

parsel

Python Software Foundation

Alternatives

PDFix SDK

PDFix SDK

PDFix
Aspose.PDF

Aspose.PDF

Aspose

Categories

Categories

Integrations

HTML
Adobe Acrobat
CSS
GitHub
JavaScript

Integrations

HTML
Adobe Acrobat
CSS
GitHub
JavaScript
Claim jsoup and update features and information
Claim jsoup and update features and information
Claim pdfGoHTML and update features and information
Claim pdfGoHTML and update features and information