Share

pdftohtml

Code

Programming Languages: C++

License: GNU General Public License (GPL)

Repositories

browse code, statistics, last commit on 2003-08-29 cvs -d:pserver:anonymous@pdftohtml.cvs.sourceforge.net:/cvsroot/pdftohtml login

cvs -z3 -d:pserver:anonymous@pdftohtml.cvs.sourceforge.net:/cvsroot/pdftohtml co -P modulename

Show:

What's happening?

  • Followup: RE: Plzzz Where is Linux compiled of this project

    I should compile it to run in Linux, but is there any precompiled version done by anyone, if there please suggest and post me the link.

    2009-11-07 06:19:29 UTC by rajesh4source

  • Plzzz Where is Linux compiled of this project

    I used the windows binary compiled software of this version in my localhost on Windows Machine, i have to run this at server of LINUX hosting.What shall i do now for getting to work this on Linux server?.

    2009-11-07 06:17:38 UTC by rajesh4source

  • Followup: RE: Compiling 0.40 on Windows

    A bit late, but thank you Michael, exactly what I needed.

    2009-10-22 20:26:32 UTC by brasmusson

  • Followup: RE: Extracting Words from PDF

    Have you heard anything or figured anything out regarding this? I know there is a tool "pdftoxml" which uses pdftohtml but I can't seem to compile it on any of my Macs. Pdftohtml works but it does line by line rather than word by word.

    2009-08-18 18:40:10 UTC by askmp

  • pdfripper

    Check this This product pdfripper ( http://www.pdfpdf.com/pdfconverter.html ) use pdf2html !!! Does he violate your licence ?.

    2009-07-22 12:05:43 UTC by manatlan

  • Output based on document outline

    Using pdftohtml: Is it possible to output html files based on document outline (bookmarks)? One html file for each bookmark instead one for each page? Thanks, Marius.

    2009-05-04 17:47:48 UTC by matanasescu

  • Comment: Cannot compile 0.39 on Debian 4.0

    Hi, It seems the error comes from gcc version 4 not supporting some c++ syntax previously supported. (http://forum.ubuntu-fr.org/viewtopic.php?id=83373) I could compile by removing the class name in isEqualDest prototype (HtmlLinks.h). Below is the diff: # diff ./pdftohtml-0.39/src/HtmlLinks.h.orig ./pdftohtml-0.39/src/HtmlLinks.h 22c22,24 < GBool HtmlLink::isEqualDest(const HtmlLink& x...

    2009-04-27 08:42:09 UTC by nobody

  • My html page isnt pretty :/

    Hey, I installed PDFtoHTML on Debian through apt-get. when i use the command: pdf2html -c demo1.pdf i get this output: http://pdf.shop.internet.dk/pdf/html/demo1.html How can i get a nice output like the one in the projects demo? Best regards Jonas.

    2009-04-21 11:18:53 UTC by jonashsdk

  • extracting non-textual objects

    Hi, how are you guys? I'm using pdftohtml and am very happy about it, but for my project it's pretty essential I extract from the pdf file non-textual objects as well - lines most importantly. Pdf lines are not graphics, i.e. they are not jpeg's or something of that sort, but are not textual, of course. Is there a way to catch them with pdftohtml? Can I make any modifications to get me on...

    2009-03-15 18:08:33 UTC by raki200582

  • Comment: Cannot compile 0.39 on Debian 4.0

    I'm having the same problem, please help!

    2009-03-09 10:51:04 UTC by nobody

Our Numbers