I see in tutorials that the way to get the content from the PDF/Word/etc. is to use DataObject.getMetadata().getString(NIE.plainTextContent).


Many of the underlying filtering libraries (PDFBox, POI) are also capable of returning HTML, XHTML, or XML.


Is there any way to get those forms of content strings from Aperture?




Greg Holmberg