From: Grant I. <gra...@gm...> - 2007-11-02 12:39:12
|
On Nov 2, 2007, at 5:13 AM, Christiaan Fluit wrote: > Antoni Mylka wrote: >> On 11/1/07, Grant Ingersoll <gra...@gm...> wrote: >>> Is there somewhere that documents what keys the crawlers put into >>> the >>> RDFContainer? For example, I am using the IMap Crawler, which seems >>> to be invoking the PlainTextExtractor (PTE). > > FYI: none of the Crawlers invoke (or should invoke) any Extractors. > They > may use the same properties from the NIE namespace and subnamespaces, > but code-wise they are completely independent. > > What you're probably seeing here is that ImapCrawler is using the same > full-text property as PlainTextExtractor. > Yes, you are right, I misspoke. The content that is coming back from the IMap Crawler is text/plain, and thus the PlainTextExtractor is chosen. |