[Htmlparser-cvs] htmlparser/docs main.html,1.6,1.7 changes.txt,1.179,1.180
Brought to you by:
derrickoswald
From: <der...@us...> - 2003-05-25 22:19:47
|
Update of /cvsroot/htmlparser/htmlparser/docs In directory sc8-pr-cvs1:/tmp/cvs-serv10357/htmlparser/docs Modified Files: main.html changes.txt Log Message: Update main html page and changelog. Index: main.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/main.html,v retrieving revision 1.6 retrieving revision 1.7 diff -C2 -d -r1.6 -r1.7 *** main.html 24 Dec 2002 06:05:37 -0000 1.6 --- main.html 25 May 2003 22:19:44 -0000 1.7 *************** *** 3,30 **** <head> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <meta name="Author" content="Somik Raha"> ! <meta name="GENERATOR" content="Mozilla/4.61 [en] (WinNT; I) [Netscape]"> <title>HTMLParser Main</title> </head> ! <body text="#000000" bgcolor="#FFFFFF" link="#3333FF" vlink="#FF6600" alink="#FFCC00"> ! ! <h1> <span ! style="FONT-SIZE: 18pt; FONT-WEIGHT: normal; mso-bidi-font-size: 16.0pt">HTMLParser ! v 1.2<O:P></O:P></span></h1> ! ! <div ! class=MsoNormal><O:P></O:P></div> ! <div class=MsoNormal>Welcome to the homepage of HTMLParser - a super-fast real-time parser for real-world HTML. What has attracted most users to HTMLParser has been its simplicity in design, speed and ability to handle streaming real-world html. ! <p>The production release of HTMLParser 1.2 is out! ! <p>You can download it from the <a href="http://sourceforge.net/project/showfiles.php?group_id=24399&release_id=129477">download ! page</a>. <p>Before you bother downloading the parser, you would probably want to check our "simple design" claim. Check <a href="samples/index.html">sample programs</a> to see how simple it is to parse HTML using HTMLParser. ! <p> <a href="http://sourceforge.net"><img src="http://sourceforge.net/sflogo.php?group_id=24399&type=1" width="88" height="31" border="0" alt="SourceForge.net Logo"></a> </div> </body> --- 3,23 ---- <head> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <meta name="Author" content="Derrick Oswald"> <title>HTMLParser Main</title> </head> ! <body> ! <h1>HTMLParser v 1.3</span></h1> ! <div>Welcome to the homepage of HTMLParser - a super-fast real-time parser for real-world HTML. What has attracted most users to HTMLParser has been its simplicity in design, speed and ability to handle streaming real-world html. ! <p>The production release of HTMLParser 1.3 is available! ! You can download it from the <a href="http://sourceforge.net/project/showfiles.php?group_id=24399&release_id=161563">download page</a>. <p>Before you bother downloading the parser, you would probably want to check our "simple design" claim. Check <a href="samples/index.html">sample programs</a> to see how simple it is to parse HTML using HTMLParser. ! <p> <a href="http://sourceforge.net/projects/htmlparser"> ! <img src="http://sourceforge.net/sflogo.php?group_id=24399&type=1" width="88" height="31" border="0" alt="SourceForge.net Logo"></a> </div> </body> Index: changes.txt =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/changes.txt,v retrieving revision 1.179 retrieving revision 1.180 diff -C2 -d -r1.179 -r1.180 *** changes.txt 19 May 2003 02:49:56 -0000 1.179 --- changes.txt 25 May 2003 22:19:44 -0000 1.180 *************** *** 13,16 **** --- 13,98 ---- ******************************************************************************* + Release Build 1.3 - 20030525 + -------------------------------- + 2003-05-25 15:30 derrickoswald + + * docs/docs/: CustomTagExtraction.html, + FrequentlyAskedQuestions.html, PatternStories.html, + TagFindingVisitor.html, WritingYourOwnScanners.html: + + Updated Wiki docs. + + 2003-05-25 14:58 derrickoswald + + * src/org/htmlparser/tests/utilTests/NodeListTest.java: + + Removed bogus assert in NodeListTest. + + 2003-05-25 09:12 derrickoswald + + * src/org/htmlparser/util/NodeList.java: + + Fixed off-by-one error in NodeList.remove(). + + 2003-05-24 17:04 derrickoswald + + * src/org/htmlparser/: NodeReader.java, + parserHelper/CompositeTagScannerHelper.java, + parserHelper/StringParser.java, scanners/CompositeTagScanner.java, + scanners/ScriptScanner.java, + tests/parserHelperTests/CompositeTagScannerHelperTest.java, + tests/scannersTests/ScriptScannerTest.java: + + Fixed bug #741769 ScriptScanner doesn't handle quoted </script> tags + Major overhaul of ScriptScanner. + It now uses the scan() method of CompositeTagScanner (i.e. doesn't override). + CompositeTagScanner now has a balance_quotes member field that dictates + whether strings tags are scanned honouring single and double quotes. + This affected the call chain through NodeReader and StringScanner which + now have this parameter. + StringScanner now correctly handles quotes if asked. The ignoreState stuff is removed, + it didn't work anyway since a single StringScanner is used recursively by the NodeReader, + and the member field would have been tromped. + Sorry to all those who have broken code because of this, but it's for the better. Really. + + 2003-05-24 06:15 derrickoswald + + * src/org/htmlparser/: tests/scannersTests/TableScannerTest.java, + scanners/TableColumnScanner.java, scanners/TableRowScanner.java: + + Fixed bug #742254 Nested <TR> &<TD> tags should not be allowed. + + 2003-05-23 22:05 derrickoswald + + * src/org/htmlparser/tests/ParserTestCase.java: + + Correct a typo. + + 2003-05-21 20:36 polarys + + * src/org/htmlparser/: parserHelper/TagParser.java, tags/Tag.java: + + - Tag now remembers (in tagLines) *all* the lines spanned by the tag (not just the last one). + - Tag now remembers line number on which tag starts (from TagData). + - Tag now has new public methods: + int getTagStartLine() + int getTagEndLine() + String[] getTagLines() + - TagParser now gives Tag (via constructor and setTagLine) sufficient info to support the above. + + 2003-05-21 08:00 derrickoswald + + * src/org/htmlparser/tests/tagTests/TagTest.java: + + See bug #740411 setParsed() has no effect on output. + Seems to work in the current 1.3 code base. + + 2003-05-21 07:13 derrickoswald + + * src/org/htmlparser/tests/tagTests/TagTest.java: + + See bug #741026 registerScanners() mangles output HTML badly. + Seems to work in the current 1.3 code base. + Integration Build 1.3 - 20030518 -------------------------------- |