Update of /cvsroot/htmlparser/htmlparser/docs
In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv14464/docs
Modified Files:
changes.txt release.txt
Log Message:
Update version to 1.5-20050306
Index: release.txt
===================================================================
RCS file: /cvsroot/htmlparser/htmlparser/docs/release.txt,v
retrieving revision 1.66
retrieving revision 1.67
diff -C2 -d -r1.66 -r1.67
*** release.txt 13 Feb 2005 22:45:46 -0000 1.66
--- release.txt 7 Mar 2005 02:57:34 -0000 1.67
***************
*** 1,3 ****
! HTMLParser Version 1.5 (Integration Build Feb 13, 2005)
*********************************************
--- 1,3 ----
! HTMLParser Version 1.5 (Integration Build Mar 06, 2005)
*********************************************
***************
*** 32,35 ****
--- 32,39 ----
handles proxies, passwords and cookies. Some testing still needed.
Also removed some line separator cruft.
+ Added parseCDATA to the Lexer, used in script and style scanners.
+ Note that this is significantly new behaviour that now adheres to appendix
+ B.3.2 Specifying non-HTML data of the HTML reference:
+ http://www.w3.org/TR/html4/appendix/notes.html#notes-specifying-data
Configuration Management
Removed the need for the Translate class to be packaged with htmllexer.jar.
***************
*** 69,72 ****
--- 73,83 ----
Bug Fixes
---------
+ 1153508 CVS sources do not compile
+ 1104627 Parser Crash reading javascript
+ 1061869 Crashing when trying to capture link to XLS document
+ 1056438 Byte Order Mark
+ 1044707 mark()/reset() issues
+ 1024045 StringBean crashes on an URL
+ 1021925 StyleTag with missing linefeed prevents page from parsing
1005409 Input file not free by parser.
998195 SiteCatpurer just crashed
***************
*** 115,118 ****
--- 126,130 ----
[34] Jon Gillette
[35] Enrico Triolo
+ [36] Dave Anderson
If you find any bugs, please go to
Index: changes.txt
===================================================================
RCS file: /cvsroot/htmlparser/htmlparser/docs/changes.txt,v
retrieving revision 1.203
retrieving revision 1.204
diff -C2 -d -r1.203 -r1.204
*** changes.txt 13 Feb 2005 22:45:46 -0000 1.203
--- changes.txt 7 Mar 2005 02:57:34 -0000 1.204
***************
*** 16,19 ****
--- 16,76 ----
*******************************************************************************
+ Integration Build 1.5 - 20050306
+ --------------------------------
+ 2005-03-06 21:18 derrickoswald
+
+ * src/org/htmlparser/: lexer/Lexer.java, lexer/Page.java,
+ scanners/ScriptScanner.java, scanners/StyleScanner.java,
+ tests/scannersTests/ScriptScannerTest.java:
+
+ Bug #1104627 Parser Crash reading javascript
+ Bug #1024045 StringBean crashes on an URL
+ Bug #1021925 StyleTag with missing linefeed prevents page from parsing
+ Corrected operation with script and style scanners to recognize the ETAGO
+ when parsing CDATA -- see http://www.w3.org/TR/html4/appendix/notes.html#notes-specifying-data.
+ Original solution to bug #741769 ScriptScanner doesn't handle quoted </script> tags,
+ was erroneous; it should have been recognized as faulty HTML.
+ Several test cases changed to follow this advice:
+ "Authors should therefore escape "</" within the content."
+
+ 2005-03-06 16:46 derrickoswald
+
+ * src/org/htmlparser/: lexer/InputStreamSource.java,
+ tests/lexerTests/LexerTests.java:
+
+ Bug #1044707 mark()/reset() issues
+ Added wrapping with a org.htmlparser.lexer.Stream if markSupported
+ returns false on the InputStream passed to InputStreamSource constructor.
+ Added better error message when reset fails in setEncoding().
+
+ 2005-03-04 10:57 derrickoswald
+
+ *
+ src/org/htmlparser/parserapplications/filterbuilder/FilterBuilder.java:
+
+ Bug #1153508 CVS sources do not compile
+ Repaired sources so it would compile with Java 1.4.
+
+ 2005-02-14 19:41 derrickoswald
+
+ * src/org/htmlparser/lexer/InputStreamSource.java:
+
+ Bug #1056438 Byte Order Mark
+ Not a solution, just a better error message.
+
+ 2005-02-14 18:54 derrickoswald
+
+ * docs/contributors.html:
+
+ Add David Anderson to contributors list.
+
+ 2005-02-14 18:49 derrickoswald
+
+ * src/org/htmlparser/parserapplications/SiteCapturer.java:
+
+ Implement suggested change for
+ bug #1061869 Crashing when trying to capture link to XLS document
+ checking for null from getContentType().
+
Integration Build 1.5 - 20050213
--------------------------------
|