pdf2xml convertor based on Xpdf library (http://www.foolabs.com/xpdf/home.html). It converts information contained in a PDF file into XML. First, you need to install xpdf and libxml2 (see documentation).
Hervé Déjean
Xerox Research Centre Europe

http://www.xrce.xerox.com/About-XRCE/People/Herve-Dejean

Features

  • pdf to xml conversion
  • text extraction
  • vectorial instruction extraction

Project Activity

See All Activity >

Categories

XML, Topic, Cataloguing

License

GNU General Public License version 2.0 (GPLv2)

Follow pdf2xml

pdf2xml Web Site

Other Useful Business Software
Simple, Secure Domain Registration Icon
Simple, Secure Domain Registration

Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
Sign up for free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
10
1
0
0
1
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

User Reviews

  • The link for the SVN code is not working i want to integrate this functionality in my java project , please provide valid link
  • Thanks very good project! +
  • Used on the irs f1040.pdf to produce f1040.xml; however, when viewed in firefox, firefox indicated it had no styling; hence, it didn't look anything like the pdf file when viewed by adobe reader.
  • Very useful, a must-have program. Great job!
  • Simple, no fuss. works for all types
Read more reviews >

Additional Project Details

Operating Systems

BSD, Linux, MinGW/MSYS2, Windows

Intended Audience

Developers, End Users/Desktop, Information Technology

User Interface

Command-line

Programming Language

C++

Related Categories

C++ XML Software, C++ Topic Software, C++ Cataloguing Software

Registered

2007-07-11