From: Giulia H. <gh...@li...> - 2007-04-10 18:42:50
|
The executive decision made above me, given that renaming would create several problems, has been to just avoid them. I'll check into excluding that directory from the indexing since the root point of the indexing is above it and the file structure can't be changed nor modified. Giulia At 11:33 AM 4/10/2007, Martin Haye wrote: >Hi Giulia, > >I just played around with ampersands and spaces in filenames. Some >changes to the code allow it to index them, and crossQuery to query >them. However, problems arise when you try to make a URL containing >an ampersand, and I think it'll take some extensive stylesheet >changes to make that work right. > >So I got to wondering... is this worth it? Things work okay with >spaces in the filename for me, it's just the ampersand that's >problematic. If it's only a few files, perhaps you could just rename them? > >Let me know what you think, > >--Martin > >On 4/5/07, Giulia Hill ><<mailto:gh...@li...>gh...@li... > wrote: > >I have found the problem: filenames including spaces and '&' signs. >Not sure yet about the solution, but at least I know what's going on. > >Giulia > >At 09:38 AM 4/5/2007, Giulia Hill wrote: > > >Hi, > > > >I'm running into a parsing problem while scanning the directory in > >the indexing process. What surprises me is that it happens before > >handling the individual files, so it's not cleat to me where the > >problem is generated. > > > >Suggestions? thanks > > > >Giulia > > > >/home/ghill 2: /data/_w/xtf/bin/textIndexer -config > >/data/_w/xtf/conf/textIndexerATM.conf -trace info -index atm > > > >TextIndexer v1.8 > > > > > > Purging Incomplete Documents From Indexes: > > Index: [/data/_w/xtfIndexes/index1.8.ATM/] > > No Incomplete Documents Found. > > Done. > > > > Indexing New/Updated Documents: > > Index: "atm" > > Scanning Data > >Directories........................................................ > .........................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................! > >.! > > ...... > >. > >................................................................... > .........................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................! > >.! > > ...... > >. > >................................................................... > ................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................. > > > Saxon Error on line 5 column 28 : Error reported by XML > >parser: The entity name must immediately follow the '&' in the > > entity reference. > >*** Error: class net.sf.saxon.trans.DynamicError > >net.sf.saxon.trans.DynamicError: org.xml.sax.SAXParseException: The > >entity name must immediately follow the '&' in the entity reference. > > at net.sf.saxon.event.Sender.sendSAXSource (Sender.java:247) > > at net.sf.saxon.event.Sender.send(Sender.java:106) > > at net.sf.saxon.event.Sender.send(Sender.java:45) > > at net.sf.saxon.Controller.transform(Controller.java :971) > > at > >org.cdlib.xtf.textIndexer.SrcTreeProcessor.processDir(SrcTreeProces > sor.java:476) > > at > >org.cdlib.xtf.textIndexer.SrcTreeProcessor.processDir(SrcTreeProces > sor.java:531) > > at > >org.cdlib.xtf.textIndexer.SrcTreeProcessor.processDir(SrcTreeProces > sor.java:531) > > at > >org.cdlib.xtf.textIndexer.SrcTreeProcessor.processDir(SrcTreeProces > sor.java:531) > > at > >org.cdlib.xtf.textIndexer.SrcTreeProcessor.processDir(SrcTreeProces > sor.java:531) > > at > org.cdlib.xtf.textIndexer.TextIndexer.main(TextIndexer.java:328) > >Caused by: org.xml.sax.SAXParseException : The entity name must > >immediately follow the '&' in the entity reference. > > at > >com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.createS > AXParseException(ErrorHandlerWrapper.java :236) > > at > >com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.fatalEr > ror(ErrorHandlerWrapper.java:215) > > at > >com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportErro > r (XMLErrorReporter.java:386) > > at > >com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportErro > r(XMLErrorReporter.java:316) > > at > >com.sun.org.apache.xerces.internal.impl.XMLScanner.reportFatalError > (XMLScanner.java:1438) > > at > >com.sun.org.apache.xerces.internal.impl.XMLScanner.scanAttributeVal > ue(XMLScanner.java:850) > > at > >com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.sc > anAttribute (XMLNSDocumentScannerImpl.java:572) > > at > >com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.sc > anStartElement(XMLNSDocumentScannerImpl.java:222) > > at > >com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerI > mpl$FragmentContentDispatcher.dispatch > (XMLDocumentFragmentScannerImpl.java:1693) > > at > >com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerI > mpl.scanDocument(XMLDocumentFragmentScannerImpl.java:368) > > at > > > com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:834) > > at > >com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse > (XML11Configuration.java:764) > > at > >com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(XMLParse > r.java:148) > > at > >com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse( > AbstractSAXParser.java:1242) > > at > >org.cdlib.xtf.servletBase.DTDSuppressingXMLReader.parse(DTDSuppress > ingXMLReader.java:121) > > at net.sf.saxon.event.Sender.sendSAXSource(Sender.java:239) > > ... 9 more > >--------- > >org.xml.sax.SAXParseException: The entity name must immediately > >follow the '&' in the entity reference. > > at > >com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.createS > AXParseException (ErrorHandlerWrapper.java:236) > > at > >com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.fatalEr > ror(ErrorHandlerWrapper.java:215) > > at > >com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportErro > r (XMLErrorReporter.java:386) > > at > >com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportErro > r(XMLErrorReporter.java:316) > > at > >com.sun.org.apache.xerces.internal.impl.XMLScanner.reportFatalError > (XMLScanner.java:1438) > > at > >com.sun.org.apache.xerces.internal.impl.XMLScanner.scanAttributeVal > ue(XMLScanner.java:850) > > at > >com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.sc > anAttribute (XMLNSDocumentScannerImpl.java:572) > > at > >com.sun.org.apache.xerces.internal.impl.XMLNSDocumentScannerImpl.sc > anStartElement(XMLNSDocumentScannerImpl.java:222) > > at > >com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerI > mpl$FragmentContentDispatcher.dispatch > (XMLDocumentFragmentScannerImpl.java:1693) > > at > >com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerI > mpl.scanDocument(XMLDocumentFragmentScannerImpl.java:368) > > at > > > com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:834) > > at > >com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse > (XML11Configuration.java:764) > > at > >com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(XMLParse > r.java:148) > > at > >com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse( > AbstractSAXParser.java:1242) > > at > >org.cdlib.xtf.servletBase.DTDSuppressingXMLReader.parse(DTDSuppress > ingXMLReader.java:121) > > at net.sf.saxon.event.Sender.sendSAXSource(Sender.java:239) > > at net.sf.saxon.event.Sender.send (Sender.java:106) > > at net.sf.saxon.event.Sender.send(Sender.java:45) > > at net.sf.saxon.Controller.transform(Controller.java:971) > > at > >org.cdlib.xtf.textIndexer.SrcTreeProcessor.processDir > (SrcTreeProcessor.java:476) > > at > >org.cdlib.xtf.textIndexer.SrcTreeProcessor.processDir(SrcTreeProces > sor.java:531) > > at > >org.cdlib.xtf.textIndexer.SrcTreeProcessor.processDir(SrcTreeProces > sor.java :531) > > at > >org.cdlib.xtf.textIndexer.SrcTreeProcessor.processDir(SrcTreeProces > sor.java:531) > > at > >org.cdlib.xtf.textIndexer.SrcTreeProcessor.processDir(SrcTreeProces > sor.java:531) > > at > org.cdlib.xtf.textIndexer.TextIndexer.main(TextIndexer.java:328) > > > > > >------------------------------------------------------------------------- > >Take Surveys. Earn Cash. Influence the Future of IT > >Join SourceForge.net's Techsay panel and you'll get the chance to share your > >opinions on IT & business topics through brief surveys-and earn cash > > http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV > >_______________________________________________ > >Xtf-user mailing list > ><mailto:Xtf...@li...> Xtf...@li... > ><https://lists.sourceforge.net/lists/listinfo/xtf-user>https://list > s.sourceforge.net/lists/listinfo/xtf-user > > >------------------------------------------------------------------------- >Take Surveys. Earn Cash. Influence the Future of IT >Join SourceForge.net's Techsay panel and you'll get the chance to share your >opinions on IT & business topics through brief surveys-and earn cash ><http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV>http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV >_______________________________________________ >Xtf-user mailing list ><mailto:Xtf...@li...>Xtf...@li... >https://lists.sourceforge.net/lists/listinfo/xtf-user > > >------------------------------------------------------------------------- >Take Surveys. Earn Cash. Influence the Future of IT >Join SourceForge.net's Techsay panel and you'll get the chance to share your >opinions on IT & business topics through brief surveys-and earn cash >http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV >_______________________________________________ >Xtf-user mailing list >Xtf...@li... >https://lists.sourceforge.net/lists/listinfo/xtf-user |