A Java Application that extracts text from pdf files.
User can select different areas on the pdf file and can extract text from those areas.Extraction of text can be done for single or multiple pages.
Generate Bookmarks on the basis of Font Heights entered by the user.
Features
- Uses two pdf libraries Pdfbox and JPedal(LGPL version)
- Can convert PdfText to HTML. User can convert HTML to E book by using softwares like Calibre.
- Can Extract text from the user selected area.
- Can Generate Bookmarks on the basis of Font Heights.
- Developed using Java.
Categories
Desktop EnvironmentLicense
Apache License V2.0, Creative Commons Attribution LicenseFollow Pdf Text Extractor
Other Useful Business Software
AI-generated apps that pass security review
Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
Rate This Project
Login To Rate This Project
User Reviews
-
no recognition, why then is all this necessary at all?