From: Martin H. <m1...@sn...> - 2006-08-29 19:44:56
|
Jakob, You make a good point. It's a bit arbitrary for the Java code to be making this decision, when it should be the docSelector. I'm not sure quite how it should be accomplished, but I have two ideas: (1) XTF could pass an extra attribute for each <directory> tag it passes to the docSelector. The attribute would indicate whether any files had been processed in the parent directory, and the default docSelector would simply skip such directories, but you could then change that behavior. or (2) The docSelector could specify an extra attribute on its output <indexFIles> tag indicating whether sub-directories should be traversed. I'm leaning toward option 2, but am open to anybody's preferences. --Martin On 8/29/06, Jakob Saternus <ja...@uu...> wrote: > > > Hello, > > I've found something rather confusing about the TextIndexer. > > This fragment is taken from > org\cdlib\xtf\textIndexer\SrcTreeProcessor.java > > line 520 > // If we found any files to process, the convention is that > subdirectories > // contain file related to the ones we processed, and that they > shouldn't > // be processed individually. > // > > if( anyProcessed ) > return; > > // Didn't find any files to process. Try sub-directories. > > Which means that the tree walker /never/ looks into any subdirectories of > those directories that contain any files that already have been processed. > This makes it impossible to index a structure like this: > > /dir1/series1.xml > /dir1/series1/subseries1.xml > /dir1/series1/subseries2.xml > > ..and it should be possible to control such behaviour in the > documentSelector.xsl: > > <quote from=" > http://xtf.sourceforge.net/WebDocs/HTML/XTF_Programming_Guide/XTFProgGuide.html#textIndexer_DocSelector_Prog > "> > It is the responsibility of the Document Selector XSLT code to output > an XML fragment that identifies which of the files in the directory should > be indexed. > </quote> > > > /Jakob > > ------------------------------------------------------------------------- > Using Tomcat but need to do more? Need to support web services, security? > Get stuff done quickly with pre-integrated technology to make your job > easier > Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 > _______________________________________________ > Xtf-user mailing list > Xtf...@li... > https://lists.sourceforge.net/lists/listinfo/xtf-user > |