Menu

Tree [r1] /
 History

HTTPS access


File Date Author Commit
 CHANGELOG 2009-10-02 kylecronan [r1] import code into svn
 LICENSE 2009-10-02 kylecronan [r1] import code into svn
 MANIFEST 2009-10-02 kylecronan [r1] import code into svn
 README 2009-10-02 kylecronan [r1] import code into svn
 pdftable 2009-10-02 kylecronan [r1] import code into svn
 pdftable.py 2009-10-02 kylecronan [r1] import code into svn
 setup.py 2009-10-02 kylecronan [r1] import code into svn

Read Me

Python module and command line utility that analyzes XML output from the
program pdftohtml in order to extract tables from PDF files. Outputs CSV.

For example:

pdftohtml -xml -stdout file.pdf | pdftable -f file%d.csv


See also 'pdftable -h' and http://sourceforge.net/projects/pdftable

Author: Kyle Cronan <kyle@pbx.org>
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.