A Java Application that extracts text from pdf files.
User can select different areas on the pdf file and can extract text from those areas.Extraction of text can be done for single or multiple pages.
Generate Bookmarks on the basis of Font Heights entered by the user.

Features

  • Uses two pdf libraries Pdfbox and JPedal(LGPL version)
  • Can convert PdfText to HTML. User can convert HTML to E book by using softwares like Calibre.
  • Can Extract text from the user selected area.
  • Can Generate Bookmarks on the basis of Font Heights.
  • Developed using Java.

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0, Creative Commons Attribution License

Follow Pdf Text Extractor

Pdf Text Extractor Web Site

Other Useful Business Software
Gemini 3 and 200+ AI Models on One Platform Icon
Gemini 3 and 200+ AI Models on One Platform

Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.
Start Free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
0
0
0
0
1
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • no recognition, why then is all this necessary at all?
Read more reviews >

Additional Project Details

Operating Systems

Windows

Intended Audience

End Users/Desktop

User Interface

Java Swing

Programming Language

Java

Related Categories

Java Desktop Environment Software

Registered

2015-02-03