#18 text extraction, but still preserve pdf formatting

open
nobody
None
1
2005-03-23
2005-03-18
perspicuous
No

I use pdftohtml for text processing purposes, but the
<br> in the <div>s causes me discontinuity for the
paragraphs. I could just edit the html and remove the
<br>, but then I'd lose the orginal pdf layout, which I
don't want to do.

Would it be possible set an option to use the width
property in style to set the width, rather than use <br>?

Have a look here: www.jumpdemo.com to get an idea of
what I'm trying to acheive.

cheers.

Discussion

  • perspicuous

    perspicuous - 2005-03-23

    Logged In: YES
    user_id=1241882

    scrap that ... found a work around for my purposes.

     
  • perspicuous

    perspicuous - 2005-03-23
    • priority: 5 --> 1
     

Log in to post a comment.