Menu

Parsing PPT using ppt2x

GregG
2012-06-06
2013-02-20
  • GregG

    GregG - 2012-06-06

    How can I retrieve body text from PPT document using StructuredStorageReader, PowerpointDocument classes?
    Appreciate any help.

     
  • Markus Kölzer

    Markus Kölzer - 2012-06-11

    Hello,

    StructuredStorageReader is a class for parsing the "Structured Storage" container format.
    PowerpointDocument is a class for parsing the ppt file format.
    You can create an instance of PowerpointDocument as follows:

    PowerpointDocument  ppt = new PowerpointDocument (new StructuredStorageReader("path/to/your/file.ppt"))

    In order to read text from a PowerPoint document you need to have knowledge of the PPT file format.
    I highly recommend to read the PPT file format specification: http://www.microsoft.com/interop/docs/OfficeBinaryFormats.mspx

    Best Regards
    Markus

     

Log in to post a comment.