[Htmlparser-cvs] htmlparser/docs changes.txt,1.204,1.205 release.txt,1.67,1.68
Brought to you by:
derrickoswald
From: Derrick O. <der...@us...> - 2005-03-13 15:36:26
|
Update of /cvsroot/htmlparser/htmlparser/docs In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv28181/docs Modified Files: changes.txt release.txt Log Message: Update version to 1.5-20050313. Index: release.txt =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/release.txt,v retrieving revision 1.67 retrieving revision 1.68 diff -C2 -d -r1.67 -r1.68 *** release.txt 7 Mar 2005 02:57:34 -0000 1.67 --- release.txt 13 Mar 2005 15:36:10 -0000 1.68 *************** *** 1,3 **** ! HTMLParser Version 1.5 (Integration Build Mar 06, 2005) ********************************************* --- 1,3 ---- ! HTMLParser Version 1.5 (Integration Build Mar 13, 2005) ********************************************* *************** *** 5,11 **** ---------------------------- (i) jar files - lib directory ! HTML Parser jars: htmlparser.jar, lexer.jar and thumbelina.jar. ! Also thirdparty jar files checkstyle-all-3.1.jar, commons-logging.jar, ! fit.jar and junit.jar. (ii) source code - src.zip --- 5,11 ---- ---------------------------- (i) jar files - lib directory ! HTML Parser jars: htmlparser.jar, lexer.jar, thumbelina.jar and ! filterbuilder.jar. ! Also thirdparty jar files checkstyle-all-3.1.jar, fit.jar and junit.jar. (ii) source code - src.zip *************** *** 41,45 **** --- 41,54 ---- Updated the logo and included the LGPL license. Fixed the Windows batch files. + Added optional "classes" property to build.xml. This directory is where + class files are put. It defaults to src. + To use: + ant -Dclasses=classdir <target> + where classdir is/will-be a peer directory to src. Refactoring + Added static STRICT flag to ScriptScanner to revert to legacy handling of + broken ETAGO (</). If STRICT is true, scan according to HTML specification, + else if false, scan with quote smart state machine which heuristically + yields the correct parse. Obviated LinkProcessor and moved it's functionality to the Page class. Added Tag, Text and Remark interfaces and moved concrete node *************** *** 63,77 **** Enhancement Requests -------------------- ! 943593 LinkProcessor.extract(link,base) weird behaviour? ! 943197 Accept gzip / deflate content encodings ! 874000 Remove specialized tag signatures from NodeVisitor ! 1000063 FilterBean 1017249 HTML Client Doesn't Support Cookies but will follow redirect 1010586 Add support for password protected URL 1000739 Add support for proxy scenario Bug Fixes --------- 1153508 CVS sources do not compile 1104627 Parser Crash reading javascript 1061869 Crashing when trying to capture link to XLS document --- 72,90 ---- Enhancement Requests -------------------- ! 1160345 NodeList.visitAllNodesWith 1017249 HTML Client Doesn't Support Cookies but will follow redirect 1010586 Add support for password protected URL 1000739 Add support for proxy scenario + 1000063 FilterBean + 943593 LinkProcessor.extract(link,base) weird behaviour? + 943197 Accept gzip / deflate content encodings + 874000 Remove specialized tag signatures from NodeVisitor Bug Fixes --------- + 1161137 Non English Character web page + 1160010 NullPointerException in addCookies 1153508 CVS sources do not compile + 1121401 No Parsing with yahoo! 1104627 Parser Crash reading javascript 1061869 Crashing when trying to capture link to XLS document *************** *** 80,83 **** --- 93,97 ---- 1024045 StringBean crashes on an URL 1021925 StyleTag with missing linefeed prevents page from parsing + 1018884 'compile' ant task from build.xml messes up ./src directory 1005409 Input file not free by parser. 998195 SiteCatpurer just crashed Index: changes.txt =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/changes.txt,v retrieving revision 1.204 retrieving revision 1.205 diff -C2 -d -r1.204 -r1.205 *** changes.txt 7 Mar 2005 02:57:34 -0000 1.204 --- changes.txt 13 Mar 2005 15:36:08 -0000 1.205 *************** *** 16,19 **** --- 16,88 ---- ******************************************************************************* + Integration Build 1.5 - 20050313 + -------------------------------- + + 2005-03-13 09:51 derrickoswald + + * src/org/htmlparser/: lexer/Lexer.java, lexer/Page.java, + lexer/Source.java, lexerapplications/tabby/Tabby.java, + scanners/ScriptDecoder.java, tests/lexerTests/TagTests.java, + util/IteratorImpl.java: + + Bug #1121401 No Parsing with yahoo! + By default nio.charset.CharsetDecoder replaces characters it cannot + represent in the current encoding with zero, which was the value + returned by the page when the Stream reached EOF. + This changes the Page return value to (char)Source.EOF (-1) when + the end of stream is encountered. + + 2005-03-12 16:39 derrickoswald + + * src/org/htmlparser/beans/: BeanyBaby.java, LinkBean.java: + + Fix bean example, stop sharing connections. + + 2005-03-12 15:27 derrickoswald + + * build.xml, lib/commons-logging.jar: + + Bug #1018884 'compile' ant task from build.xml messes up ./src directory + Added optional "classes" property to build.xml. + This directory is where class files are put. It defaults to src. + To use: + build -Dclasses=classdir <target> + where classdir is a peer directory to src. + Removed unused commons-logging.jar while I was in there. + + 2005-03-12 12:53 derrickoswald + + * src/org/htmlparser/: lexer/Lexer.java, + scanners/ScriptScanner.java, + tests/scannersTests/ScriptScannerTest.java: + + Add STRICT flag to ScriptScanner to revert to legacy handling of broken ETAGO (</). + If STRICT is true, scan according to HTML specification, else if false, scan with + quote smart state machine which heuristically yields the correct parse. + + 2005-03-12 08:39 derrickoswald + + * src/org/htmlparser/: + tests/visitorsTests/UrlModifyingVisitorTest.java, + util/NodeList.java: + + RFE #1160345 NodeList.visitAllNodesWith + Added visitAllNodesWith to the NodeList class. + + 2005-03-12 07:52 derrickoswald + + * src/org/htmlparser/: beans/StringBean.java, + tests/utilTests/AllTests.java, tests/utilTests/NonEnglishTest.java: + + Bug #1161137 Non English Character web page + Reinitialize the string buffer after encoding change exception processing. + + 2005-03-12 06:52 derrickoswald + + * src/org/htmlparser/http/ConnectionManager.java: + + Bug #1160010 NullPointerException in addCookies + Add test for null expiry date. + Integration Build 1.5 - 20050306 -------------------------------- |