From: Adam F. <a....@dc...> - 2010-12-17 16:14:32
|
[17/12/10 15:57] Benson Margulies wrote: > I've seen Factory.newDocument(String) decide that the string is XML, > try to parse it, and end up with ... How can I force the factory to > treat the input as plain text, no markup? I think you'll have to use the slightly more complicated Factory.createResource(String resourceClassName, FeatureMap parameterValues) method. If you are doing this a lot, you probably want to write your own convenience method along these lines (untested code with the usual disclaimers, will need some try/catch, etc.): public Document plainTextDocument(String stuff) { FeatureMap parameters = Factory.newFeatureMap(); parameters.put(Document.DOCUMENT_STRING_CONTENT_PARAMETER_NAME, stuff); // might not need both of the following // I can't remember parameters.put(Document.DOCUMENT_MIME_TYPE_PARAMETER_NAME, "text/plain"); parameters.put(Document.DOCUMENT_MARKUP_AWARE_PARAMETER_NAME, false); String rcn = "gate.corpora.DocumentImpl"; Resource document = Factory.createResource(rcn, parameters); return (Document) document; } |