Donate Share

PDFBox

Tracker: Bugs

5 PDFTextStripper - unwanted removal of spaces between words - ID: 1775060
Last Update: Comment added ( nobody )

I have been trying to extract text from a pdf document but the output is
fine except that no spaces are left between many of the words in the
original document. Can you advise please?

An example of the source document I am using can be found heree.

https://www.hcrregister.com/ReportDownload?key=8493-6026-4010-7237-5096

Regards

Geoff


Nobody/Anonymous ( nobody ) - 2007-08-15 23:47

5

Open

None

Ben Litchfield

text extraction

None

Public


Comments ( 3 )




Date: 2009-06-09 18:40
Sender: nobody

I have a similar issue where line breaks should be - the last word on one
line is combined with the first word on the line below it.


Date: 2007-11-09 21:20
Sender: carlemac_2007


I've similar problem, except multiple spaces are replaced by single
space...


Date: 2007-09-21 14:41
Sender: kameroliefant


I have the same problem...


Log in to comment.

Attached File

No Files Currently Attached

Change

No changes have been made to this artifact.