From: Benson M. <bim...@gm...> - 2010-12-17 15:57:18
|
I've seen Factory.newDocument(String) decide that the string is XML, try to parse it, and end up with ... How can I force the factory to treat the input as plain text, no markup? at gate.xml.SimpleErrorHandler.fatalError(SimpleErrorHandler.java:61) at gate.xml.XmlDocumentHandler.fatalError(XmlDocumentHandler.java:492) at org.apache.xerces.util.ErrorHandlerWrapper.fatalError(Unknown Source) at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown Source) at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown Source) at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown Source) at org.apache.xerces.impl.XMLScanner.reportFatalError(Unknown Source) at org.apache.xerces.impl.XMLDocumentScannerImpl$PrologDispatcher.dispatch(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) at gate.corpora.XmlDocumentFormat.unpackGeneralXmlMarkup(XmlDocumentFormat.java:306) at gate.corpora.XmlDocumentFormat.unpackMarkup(XmlDocumentFormat.java:132) at gate.corpora.XmlDocumentFormat.unpackMarkup(XmlDocumentFormat.java:83) at gate.corpora.DocumentImpl.init(DocumentImpl.java:245) at gate.Factory.createResource(Factory.java:385) at gate.Factory.createResource(Factory.java:106) at gate.Factory.newDocument(Factory.java:462) |