From: Antoni M. <ant...@gm...> - 2009-12-16 23:55:42
|
Greg Holmberg pisze: > I see in tutorials that the way to get the content from the > PDF/Word/etc. is to use > DataObject.getMetadata().getString(NIE.plainTextContent). > > > > Many of the underlying filtering libraries (PDFBox, POI) are also > capable of returning HTML, XHTML, or XML. > > > > Is there any way to get those forms of content strings from Aperture? The simplest answer is no. Aperture doesn't extract multiple versions of the content string, it is always plain text with all markup removed. Antoni Mylka ant...@gm... |