From: <go...@us...> - 2003-09-03 01:51:11
|
Update of /cvsroot/archive-crawler/ArchiveOpenCrawler/src/org/archive/crawler/extractor In directory sc8-pr-cvs1:/tmp/cvs-serv27258/src/org/archive/crawler/extractor Modified Files: ExtractorHTML.java Log Message: added proper NOT, adjusted substring begin index Index: ExtractorHTML.java =================================================================== RCS file: /cvsroot/archive-crawler/ArchiveOpenCrawler/src/org/archive/crawler/extractor/ExtractorHTML.java,v retrieving revision 1.11 retrieving revision 1.12 diff -C2 -d -r1.11 -r1.12 *** ExtractorHTML.java 26 Aug 2003 00:16:51 -0000 1.11 --- ExtractorHTML.java 3 Sep 2003 01:51:05 -0000 1.12 *************** *** 299,303 **** return true; } ! return NON_HTML_PATH_EXTENSION.matcher(path.substring(dot)).matches(); } --- 299,304 ---- return true; } ! String ext = path.substring(dot+1); ! return ! NON_HTML_PATH_EXTENSION.matcher(ext).matches(); } |