htmlparser-cvs Mailing List for HTML Parser (Page 33)
Brought to you by:
derrickoswald
You can subscribe to this list here.
2003 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(141) |
Jun
(108) |
Jul
(66) |
Aug
(127) |
Sep
(155) |
Oct
(149) |
Nov
(72) |
Dec
(72) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2004 |
Jan
(100) |
Feb
(36) |
Mar
(21) |
Apr
(3) |
May
(87) |
Jun
(28) |
Jul
(84) |
Aug
(5) |
Sep
(14) |
Oct
|
Nov
|
Dec
|
2005 |
Jan
(1) |
Feb
(39) |
Mar
(26) |
Apr
(38) |
May
(14) |
Jun
(10) |
Jul
|
Aug
|
Sep
(13) |
Oct
(8) |
Nov
(10) |
Dec
|
2006 |
Jan
|
Feb
(1) |
Mar
(17) |
Apr
(20) |
May
(28) |
Jun
(24) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2015 |
Jan
|
Feb
|
Mar
(1) |
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
From: <der...@us...> - 2003-10-27 02:18:23
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer In directory sc8-pr-cvs1:/tmp/cvs-serv25308/lexer Modified Files: Page.java Log Message: Some speed improvements; passing tags to evaluate, creating strings without string buffers, etc. Index: Page.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/Page.java,v retrieving revision 1.21 retrieving revision 1.22 diff -C2 -d -r1.21 -r1.22 *** Page.java 26 Oct 2003 19:46:18 -0000 1.21 --- Page.java 27 Oct 2003 02:18:04 -0000 1.22 *************** *** 715,724 **** public String getText (int start, int end) { ! StringBuffer ret; ! ! ret = new StringBuffer (Math.abs (end - start)); ! getText (ret, start, end); ! ! return (ret.toString ()); } --- 715,719 ---- public String getText (int start, int end) { ! return (new String (mSource.mBuffer, start, end - start)); } *************** *** 756,765 **** public String getText () { ! StringBuffer ret; ! ! ret = new StringBuffer (mSource.mOffset); ! getText (ret); ! ! return (ret.toString ()); } --- 751,755 ---- public String getText () { ! return (new String (mSource.mBuffer, 0, mSource.mOffset)); } |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/nodeDecoratorTests In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/tests/nodeDecoratorTests Modified Files: AllTests.java DecodingNodeTest.java EscapeCharacterRemovingNodeTest.java NonBreakingSpaceConvertingNodeTest.java Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AllTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/nodeDecoratorTests/AllTests.java,v retrieving revision 1.12 retrieving revision 1.13 diff -C2 -d -r1.12 -r1.13 *** AllTests.java 21 Oct 2003 02:24:00 -0000 1.12 --- AllTests.java 26 Oct 2003 19:46:25 -0000 1.13 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: DecodingNodeTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/nodeDecoratorTests/DecodingNodeTest.java,v retrieving revision 1.13 retrieving revision 1.14 diff -C2 -d -r1.13 -r1.14 *** DecodingNodeTest.java 21 Oct 2003 02:24:00 -0000 1.13 --- DecodingNodeTest.java 26 Oct 2003 19:46:26 -0000 1.14 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: EscapeCharacterRemovingNodeTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/nodeDecoratorTests/EscapeCharacterRemovingNodeTest.java,v retrieving revision 1.13 retrieving revision 1.14 diff -C2 -d -r1.13 -r1.14 *** EscapeCharacterRemovingNodeTest.java 21 Oct 2003 02:24:00 -0000 1.13 --- EscapeCharacterRemovingNodeTest.java 26 Oct 2003 19:46:26 -0000 1.14 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: NonBreakingSpaceConvertingNodeTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/nodeDecoratorTests/NonBreakingSpaceConvertingNodeTest.java,v retrieving revision 1.12 retrieving revision 1.13 diff -C2 -d -r1.12 -r1.13 *** NonBreakingSpaceConvertingNodeTest.java 21 Oct 2003 02:24:00 -0000 1.12 --- NonBreakingSpaceConvertingNodeTest.java 26 Oct 2003 19:46:26 -0000 1.13 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // |
From: <der...@us...> - 2003-10-26 20:06:17
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/visitors Modified Files: HtmlPage.java LinkFindingVisitor.java NodeVisitor.java ObjectFindingVisitor.java StringFindingVisitor.java TagFindingVisitor.java TextExtractingVisitor.java UrlModifyingVisitor.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: HtmlPage.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/HtmlPage.java,v retrieving revision 1.36 retrieving revision 1.37 diff -C2 -d -r1.36 -r1.37 *** HtmlPage.java 28 Sep 2003 19:30:04 -0000 1.36 --- HtmlPage.java 26 Oct 2003 19:46:28 -0000 1.37 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LinkFindingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/LinkFindingVisitor.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** LinkFindingVisitor.java 26 Oct 2003 03:53:33 -0000 1.30 --- LinkFindingVisitor.java 26 Oct 2003 19:46:28 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: NodeVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/NodeVisitor.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** NodeVisitor.java 28 Sep 2003 19:30:04 -0000 1.31 --- NodeVisitor.java 26 Oct 2003 19:46:28 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ObjectFindingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/ObjectFindingVisitor.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** ObjectFindingVisitor.java 22 Sep 2003 02:40:16 -0000 1.34 --- ObjectFindingVisitor.java 26 Oct 2003 19:46:28 -0000 1.35 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StringFindingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/StringFindingVisitor.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** StringFindingVisitor.java 22 Sep 2003 02:40:16 -0000 1.34 --- StringFindingVisitor.java 26 Oct 2003 19:46:28 -0000 1.35 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TagFindingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/TagFindingVisitor.java,v retrieving revision 1.37 retrieving revision 1.38 diff -C2 -d -r1.37 -r1.38 *** TagFindingVisitor.java 28 Sep 2003 19:30:04 -0000 1.37 --- TagFindingVisitor.java 26 Oct 2003 19:46:28 -0000 1.38 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TextExtractingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/TextExtractingVisitor.java,v retrieving revision 1.35 retrieving revision 1.36 diff -C2 -d -r1.35 -r1.36 *** TextExtractingVisitor.java 28 Sep 2003 19:30:04 -0000 1.35 --- TextExtractingVisitor.java 26 Oct 2003 19:46:29 -0000 1.36 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: UrlModifyingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/UrlModifyingVisitor.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** UrlModifyingVisitor.java 28 Sep 2003 19:30:05 -0000 1.34 --- UrlModifyingVisitor.java 26 Oct 2003 19:46:29 -0000 1.35 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/package.html,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** package.html 22 Sep 2003 02:40:16 -0000 1.15 --- package.html 26 Oct 2003 19:46:29 -0000 1.16 *************** *** 6,10 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 6,10 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
From: <der...@us...> - 2003-10-26 20:05:51
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/utilTests In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/tests/utilTests Modified Files: AllTests.java BeanTest.java CharacterTranslationTest.java HTMLLinkProcessorTest.java HTMLParserUtilsTest.java NodeListTest.java SortTest.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AllTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/utilTests/AllTests.java,v retrieving revision 1.50 retrieving revision 1.51 diff -C2 -d -r1.50 -r1.51 *** AllTests.java 21 Oct 2003 02:24:01 -0000 1.50 --- AllTests.java 26 Oct 2003 19:46:27 -0000 1.51 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BeanTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/utilTests/BeanTest.java,v retrieving revision 1.44 retrieving revision 1.45 diff -C2 -d -r1.44 -r1.45 *** BeanTest.java 21 Oct 2003 02:24:01 -0000 1.44 --- BeanTest.java 26 Oct 2003 19:46:27 -0000 1.45 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: CharacterTranslationTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/utilTests/CharacterTranslationTest.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** CharacterTranslationTest.java 21 Oct 2003 02:24:01 -0000 1.33 --- CharacterTranslationTest.java 26 Oct 2003 19:46:27 -0000 1.34 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: HTMLLinkProcessorTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/utilTests/HTMLLinkProcessorTest.java,v retrieving revision 1.47 retrieving revision 1.48 diff -C2 -d -r1.47 -r1.48 *** HTMLLinkProcessorTest.java 21 Oct 2003 02:24:01 -0000 1.47 --- HTMLLinkProcessorTest.java 26 Oct 2003 19:46:27 -0000 1.48 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: HTMLParserUtilsTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/utilTests/HTMLParserUtilsTest.java,v retrieving revision 1.11 retrieving revision 1.12 diff -C2 -d -r1.11 -r1.12 *** HTMLParserUtilsTest.java 21 Oct 2003 02:24:01 -0000 1.11 --- HTMLParserUtilsTest.java 26 Oct 2003 19:46:27 -0000 1.12 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: NodeListTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/utilTests/NodeListTest.java,v retrieving revision 1.20 retrieving revision 1.21 diff -C2 -d -r1.20 -r1.21 *** NodeListTest.java 21 Oct 2003 02:24:01 -0000 1.20 --- NodeListTest.java 26 Oct 2003 19:46:27 -0000 1.21 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: SortTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/utilTests/SortTest.java,v retrieving revision 1.7 retrieving revision 1.8 diff -C2 -d -r1.7 -r1.8 *** SortTest.java 21 Oct 2003 02:24:01 -0000 1.7 --- SortTest.java 26 Oct 2003 19:46:27 -0000 1.8 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/utilTests/package.html,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** package.html 22 Sep 2003 02:40:14 -0000 1.15 --- package.html 26 Oct 2003 19:46:28 -0000 1.16 *************** *** 6,10 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 6,10 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/tests/tagTests Modified Files: AllTests.java AppletTagTest.java BaseHrefTagTest.java BodyTagTest.java CompositeTagTest.java DoctypeTagTest.java EndTagTest.java FormTagTest.java FrameSetTagTest.java FrameTagTest.java ImageTagTest.java InputTagTest.java JspTagTest.java LinkTagTest.java MetaTagTest.java ObjectCollectionTest.java OptionTagTest.java ScriptTagTest.java SelectTagTest.java StyleTagTest.java TagTest.java TextareaTagTest.java TitleTagTest.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AllTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/AllTests.java,v retrieving revision 1.45 retrieving revision 1.46 diff -C2 -d -r1.45 -r1.46 *** AllTests.java 21 Oct 2003 02:24:01 -0000 1.45 --- AllTests.java 26 Oct 2003 19:46:27 -0000 1.46 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: AppletTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/AppletTagTest.java,v retrieving revision 1.32 retrieving revision 1.33 diff -C2 -d -r1.32 -r1.33 *** AppletTagTest.java 21 Oct 2003 02:24:01 -0000 1.32 --- AppletTagTest.java 26 Oct 2003 19:46:27 -0000 1.33 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BaseHrefTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/BaseHrefTagTest.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** BaseHrefTagTest.java 21 Oct 2003 02:24:01 -0000 1.31 --- BaseHrefTagTest.java 26 Oct 2003 19:46:27 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BodyTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/BodyTagTest.java,v retrieving revision 1.14 retrieving revision 1.15 diff -C2 -d -r1.14 -r1.15 *** BodyTagTest.java 21 Oct 2003 02:24:01 -0000 1.14 --- BodyTagTest.java 26 Oct 2003 19:46:27 -0000 1.15 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: CompositeTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/CompositeTagTest.java,v retrieving revision 1.10 retrieving revision 1.11 diff -C2 -d -r1.10 -r1.11 *** CompositeTagTest.java 21 Oct 2003 02:24:01 -0000 1.10 --- CompositeTagTest.java 26 Oct 2003 19:46:27 -0000 1.11 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: DoctypeTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/DoctypeTagTest.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** DoctypeTagTest.java 21 Oct 2003 02:24:01 -0000 1.30 --- DoctypeTagTest.java 26 Oct 2003 19:46:27 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: EndTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/EndTagTest.java,v retrieving revision 1.32 retrieving revision 1.33 diff -C2 -d -r1.32 -r1.33 *** EndTagTest.java 21 Oct 2003 02:24:01 -0000 1.32 --- EndTagTest.java 26 Oct 2003 19:46:27 -0000 1.33 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FormTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/FormTagTest.java,v retrieving revision 1.36 retrieving revision 1.37 diff -C2 -d -r1.36 -r1.37 *** FormTagTest.java 21 Oct 2003 02:24:01 -0000 1.36 --- FormTagTest.java 26 Oct 2003 19:46:27 -0000 1.37 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FrameSetTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/FrameSetTagTest.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** FrameSetTagTest.java 21 Oct 2003 02:24:01 -0000 1.31 --- FrameSetTagTest.java 26 Oct 2003 19:46:27 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FrameTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/FrameTagTest.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** FrameTagTest.java 21 Oct 2003 02:24:01 -0000 1.31 --- FrameTagTest.java 26 Oct 2003 19:46:27 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ImageTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/ImageTagTest.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** ImageTagTest.java 21 Oct 2003 02:24:01 -0000 1.33 --- ImageTagTest.java 26 Oct 2003 19:46:27 -0000 1.34 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: InputTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/InputTagTest.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** InputTagTest.java 21 Oct 2003 02:24:01 -0000 1.34 --- InputTagTest.java 26 Oct 2003 19:46:27 -0000 1.35 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: JspTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/JspTagTest.java,v retrieving revision 1.36 retrieving revision 1.37 diff -C2 -d -r1.36 -r1.37 *** JspTagTest.java 25 Oct 2003 20:19:44 -0000 1.36 --- JspTagTest.java 26 Oct 2003 19:46:27 -0000 1.37 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LinkTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/LinkTagTest.java,v retrieving revision 1.39 retrieving revision 1.40 diff -C2 -d -r1.39 -r1.40 *** LinkTagTest.java 21 Oct 2003 02:24:01 -0000 1.39 --- LinkTagTest.java 26 Oct 2003 19:46:27 -0000 1.40 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: MetaTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/MetaTagTest.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** MetaTagTest.java 21 Oct 2003 02:24:01 -0000 1.33 --- MetaTagTest.java 26 Oct 2003 19:46:27 -0000 1.34 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ObjectCollectionTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/ObjectCollectionTest.java,v retrieving revision 1.14 retrieving revision 1.15 diff -C2 -d -r1.14 -r1.15 *** ObjectCollectionTest.java 21 Oct 2003 02:24:01 -0000 1.14 --- ObjectCollectionTest.java 26 Oct 2003 19:46:27 -0000 1.15 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: OptionTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/OptionTagTest.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** OptionTagTest.java 26 Oct 2003 03:53:33 -0000 1.34 --- OptionTagTest.java 26 Oct 2003 19:46:27 -0000 1.35 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ScriptTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/ScriptTagTest.java,v retrieving revision 1.35 retrieving revision 1.36 diff -C2 -d -r1.35 -r1.36 *** ScriptTagTest.java 21 Oct 2003 02:24:01 -0000 1.35 --- ScriptTagTest.java 26 Oct 2003 19:46:27 -0000 1.36 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: SelectTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/SelectTagTest.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** SelectTagTest.java 25 Oct 2003 20:19:44 -0000 1.34 --- SelectTagTest.java 26 Oct 2003 19:46:27 -0000 1.35 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StyleTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/StyleTagTest.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** StyleTagTest.java 21 Oct 2003 02:24:01 -0000 1.30 --- StyleTagTest.java 26 Oct 2003 19:46:27 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/TagTest.java,v retrieving revision 1.50 retrieving revision 1.51 diff -C2 -d -r1.50 -r1.51 *** TagTest.java 25 Oct 2003 20:19:44 -0000 1.50 --- TagTest.java 26 Oct 2003 19:46:27 -0000 1.51 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TextareaTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/TextareaTagTest.java,v retrieving revision 1.32 retrieving revision 1.33 diff -C2 -d -r1.32 -r1.33 *** TextareaTagTest.java 21 Oct 2003 02:24:01 -0000 1.32 --- TextareaTagTest.java 26 Oct 2003 19:46:27 -0000 1.33 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TitleTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/TitleTagTest.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** TitleTagTest.java 21 Oct 2003 02:24:01 -0000 1.29 --- TitleTagTest.java 26 Oct 2003 19:46:27 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/package.html,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** package.html 22 Sep 2003 02:40:13 -0000 1.15 --- package.html 26 Oct 2003 19:46:27 -0000 1.16 *************** *** 6,10 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 6,10 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/tests/scannersTests Modified Files: AllTests.java AppletScannerTest.java BaseHREFScannerTest.java BodyScannerTest.java BulletListScannerTest.java BulletScannerTest.java CompositeTagScannerTest.java DivScannerTest.java FormScannerTest.java FrameScannerTest.java FrameSetScannerTest.java HeadScannerTest.java HtmlTest.java ImageScannerTest.java InputTagScannerTest.java JspScannerTest.java LabelScannerTest.java LinkScannerTest.java MetaTagScannerTest.java OptionTagScannerTest.java ScriptScannerTest.java SelectTagScannerTest.java SpanScannerTest.java StyleScannerTest.java TableScannerTest.java TagScannerTest.java TextareaTagScannerTest.java TitleScannerTest.java XmlEndTagScanningTest.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AllTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/AllTests.java,v retrieving revision 1.50 retrieving revision 1.51 diff -C2 -d -r1.50 -r1.51 *** AllTests.java 21 Oct 2003 02:24:01 -0000 1.50 --- AllTests.java 26 Oct 2003 19:46:26 -0000 1.51 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // *************** *** 19,23 **** // Email :so...@ki... // ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 19,23 ---- // Email :so...@ki... // ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: AppletScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/AppletScannerTest.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** AppletScannerTest.java 21 Oct 2003 02:24:01 -0000 1.29 --- AppletScannerTest.java 26 Oct 2003 19:46:26 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BaseHREFScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/BaseHREFScannerTest.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** BaseHREFScannerTest.java 21 Oct 2003 02:24:01 -0000 1.28 --- BaseHREFScannerTest.java 26 Oct 2003 19:46:26 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BodyScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/BodyScannerTest.java,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** BodyScannerTest.java 21 Oct 2003 02:24:01 -0000 1.15 --- BodyScannerTest.java 26 Oct 2003 19:46:26 -0000 1.16 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BulletListScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/BulletListScannerTest.java,v retrieving revision 1.11 retrieving revision 1.12 diff -C2 -d -r1.11 -r1.12 *** BulletListScannerTest.java 21 Oct 2003 02:24:01 -0000 1.11 --- BulletListScannerTest.java 26 Oct 2003 19:46:26 -0000 1.12 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BulletScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/BulletScannerTest.java,v retrieving revision 1.12 retrieving revision 1.13 diff -C2 -d -r1.12 -r1.13 *** BulletScannerTest.java 26 Oct 2003 03:53:33 -0000 1.12 --- BulletScannerTest.java 26 Oct 2003 19:46:26 -0000 1.13 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: CompositeTagScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/CompositeTagScannerTest.java,v retrieving revision 1.44 retrieving revision 1.45 diff -C2 -d -r1.44 -r1.45 *** CompositeTagScannerTest.java 25 Oct 2003 20:19:44 -0000 1.44 --- CompositeTagScannerTest.java 26 Oct 2003 19:46:26 -0000 1.45 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: DivScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/DivScannerTest.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** DivScannerTest.java 21 Oct 2003 02:24:01 -0000 1.34 --- DivScannerTest.java 26 Oct 2003 19:46:26 -0000 1.35 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FormScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/FormScannerTest.java,v retrieving revision 1.37 retrieving revision 1.38 diff -C2 -d -r1.37 -r1.38 *** FormScannerTest.java 21 Oct 2003 02:24:01 -0000 1.37 --- FormScannerTest.java 26 Oct 2003 19:46:26 -0000 1.38 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FrameScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/FrameScannerTest.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** FrameScannerTest.java 21 Oct 2003 02:24:01 -0000 1.29 --- FrameScannerTest.java 26 Oct 2003 19:46:26 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FrameSetScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/FrameSetScannerTest.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** FrameSetScannerTest.java 21 Oct 2003 02:24:01 -0000 1.28 --- FrameSetScannerTest.java 26 Oct 2003 19:46:26 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: HeadScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/HeadScannerTest.java,v retrieving revision 1.18 retrieving revision 1.19 diff -C2 -d -r1.18 -r1.19 *** HeadScannerTest.java 21 Oct 2003 02:24:01 -0000 1.18 --- HeadScannerTest.java 26 Oct 2003 19:46:26 -0000 1.19 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: HtmlTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/HtmlTest.java,v retrieving revision 1.12 retrieving revision 1.13 diff -C2 -d -r1.12 -r1.13 *** HtmlTest.java 21 Oct 2003 02:24:01 -0000 1.12 --- HtmlTest.java 26 Oct 2003 19:46:26 -0000 1.13 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ImageScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/ImageScannerTest.java,v retrieving revision 1.35 retrieving revision 1.36 diff -C2 -d -r1.35 -r1.36 *** ImageScannerTest.java 21 Oct 2003 02:24:01 -0000 1.35 --- ImageScannerTest.java 26 Oct 2003 19:46:26 -0000 1.36 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: InputTagScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/InputTagScannerTest.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** InputTagScannerTest.java 21 Oct 2003 02:24:01 -0000 1.28 --- InputTagScannerTest.java 26 Oct 2003 19:46:26 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: JspScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/JspScannerTest.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** JspScannerTest.java 21 Oct 2003 02:24:01 -0000 1.31 --- JspScannerTest.java 26 Oct 2003 19:46:26 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LabelScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/LabelScannerTest.java,v retrieving revision 1.39 retrieving revision 1.40 diff -C2 -d -r1.39 -r1.40 *** LabelScannerTest.java 21 Oct 2003 02:24:01 -0000 1.39 --- LabelScannerTest.java 26 Oct 2003 19:46:26 -0000 1.40 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LinkScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/LinkScannerTest.java,v retrieving revision 1.44 retrieving revision 1.45 diff -C2 -d -r1.44 -r1.45 *** LinkScannerTest.java 25 Oct 2003 20:19:44 -0000 1.44 --- LinkScannerTest.java 26 Oct 2003 19:46:26 -0000 1.45 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: MetaTagScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/MetaTagScannerTest.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** MetaTagScannerTest.java 21 Oct 2003 02:24:01 -0000 1.33 --- MetaTagScannerTest.java 26 Oct 2003 19:46:26 -0000 1.34 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: OptionTagScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/OptionTagScannerTest.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** OptionTagScannerTest.java 21 Oct 2003 02:24:01 -0000 1.31 --- OptionTagScannerTest.java 26 Oct 2003 19:46:27 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ScriptScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/ScriptScannerTest.java,v retrieving revision 1.43 retrieving revision 1.44 diff -C2 -d -r1.43 -r1.44 *** ScriptScannerTest.java 21 Oct 2003 02:24:01 -0000 1.43 --- ScriptScannerTest.java 26 Oct 2003 19:46:27 -0000 1.44 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: SelectTagScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/SelectTagScannerTest.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** SelectTagScannerTest.java 21 Oct 2003 02:24:01 -0000 1.30 --- SelectTagScannerTest.java 26 Oct 2003 19:46:27 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: SpanScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/SpanScannerTest.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** SpanScannerTest.java 21 Oct 2003 02:24:01 -0000 1.31 --- SpanScannerTest.java 26 Oct 2003 19:46:27 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StyleScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/StyleScannerTest.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** StyleScannerTest.java 21 Oct 2003 02:24:01 -0000 1.30 --- StyleScannerTest.java 26 Oct 2003 19:46:27 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TableScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/TableScannerTest.java,v retrieving revision 1.38 retrieving revision 1.39 diff -C2 -d -r1.38 -r1.39 *** TableScannerTest.java 21 Oct 2003 02:24:01 -0000 1.38 --- TableScannerTest.java 26 Oct 2003 19:46:27 -0000 1.39 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TagScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/TagScannerTest.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** TagScannerTest.java 21 Oct 2003 02:24:01 -0000 1.33 --- TagScannerTest.java 26 Oct 2003 19:46:27 -0000 1.34 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TextareaTagScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/TextareaTagScannerTest.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** TextareaTagScannerTest.java 21 Oct 2003 02:24:01 -0000 1.28 --- TextareaTagScannerTest.java 26 Oct 2003 19:46:27 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TitleScannerTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/TitleScannerTest.java,v retrieving revision 1.32 retrieving revision 1.33 diff -C2 -d -r1.32 -r1.33 *** TitleScannerTest.java 25 Oct 2003 15:46:03 -0000 1.32 --- TitleScannerTest.java 26 Oct 2003 19:46:27 -0000 1.33 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: XmlEndTagScanningTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/XmlEndTagScanningTest.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** XmlEndTagScanningTest.java 21 Oct 2003 02:24:01 -0000 1.31 --- XmlEndTagScanningTest.java 26 Oct 2003 19:46:27 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/scannersTests/package.html,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** package.html 22 Sep 2003 02:40:11 -0000 1.15 --- package.html 26 Oct 2003 19:46:27 -0000 1.16 *************** *** 6,10 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 6,10 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/util Modified Files: ChainedException.java CommandLine.java DefaultParserFeedback.java FeedbackManager.java Generate.java IteratorImpl.java LinkProcessor.java NodeIterator.java NodeList.java ParserException.java ParserFeedback.java ParserUtils.java PeekingIterator.java SimpleNodeIterator.java SpecialHashtable.java Translate.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: ChainedException.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/ChainedException.java,v retrieving revision 1.40 retrieving revision 1.41 diff -C2 -d -r1.40 -r1.41 *** ChainedException.java 22 Sep 2003 02:40:15 -0000 1.40 --- ChainedException.java 26 Oct 2003 19:46:28 -0000 1.41 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: CommandLine.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/CommandLine.java,v retrieving revision 1.38 retrieving revision 1.39 diff -C2 -d -r1.38 -r1.39 *** CommandLine.java 22 Sep 2003 02:40:15 -0000 1.38 --- CommandLine.java 26 Oct 2003 19:46:28 -0000 1.39 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: DefaultParserFeedback.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/DefaultParserFeedback.java,v retrieving revision 1.27 retrieving revision 1.28 diff -C2 -d -r1.27 -r1.28 *** DefaultParserFeedback.java 22 Sep 2003 02:40:15 -0000 1.27 --- DefaultParserFeedback.java 26 Oct 2003 19:46:28 -0000 1.28 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FeedbackManager.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/FeedbackManager.java,v retrieving revision 1.40 retrieving revision 1.41 diff -C2 -d -r1.40 -r1.41 *** FeedbackManager.java 22 Sep 2003 02:40:15 -0000 1.40 --- FeedbackManager.java 26 Oct 2003 19:46:28 -0000 1.41 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Generate.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/Generate.java,v retrieving revision 1.43 retrieving revision 1.44 diff -C2 -d -r1.43 -r1.44 *** Generate.java 28 Sep 2003 15:33:59 -0000 1.43 --- Generate.java 26 Oct 2003 19:46:28 -0000 1.44 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: IteratorImpl.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/IteratorImpl.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** IteratorImpl.java 5 Oct 2003 13:49:54 -0000 1.30 --- IteratorImpl.java 26 Oct 2003 19:46:28 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LinkProcessor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/LinkProcessor.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** LinkProcessor.java 20 Oct 2003 01:28:04 -0000 1.29 --- LinkProcessor.java 26 Oct 2003 19:46:28 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: NodeIterator.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/NodeIterator.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** NodeIterator.java 22 Sep 2003 02:40:15 -0000 1.28 --- NodeIterator.java 26 Oct 2003 19:46:28 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: NodeList.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/NodeList.java,v retrieving revision 1.47 retrieving revision 1.48 diff -C2 -d -r1.47 -r1.48 *** NodeList.java 20 Oct 2003 01:28:04 -0000 1.47 --- NodeList.java 26 Oct 2003 19:46:28 -0000 1.48 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ParserException.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/ParserException.java,v retrieving revision 1.25 retrieving revision 1.26 diff -C2 -d -r1.25 -r1.26 *** ParserException.java 22 Sep 2003 02:40:15 -0000 1.25 --- ParserException.java 26 Oct 2003 19:46:28 -0000 1.26 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ParserFeedback.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/ParserFeedback.java,v retrieving revision 1.26 retrieving revision 1.27 diff -C2 -d -r1.26 -r1.27 *** ParserFeedback.java 22 Sep 2003 02:40:15 -0000 1.26 --- ParserFeedback.java 26 Oct 2003 19:46:28 -0000 1.27 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ParserUtils.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/ParserUtils.java,v retrieving revision 1.32 retrieving revision 1.33 diff -C2 -d -r1.32 -r1.33 *** ParserUtils.java 2 Oct 2003 23:48:54 -0000 1.32 --- ParserUtils.java 26 Oct 2003 19:46:28 -0000 1.33 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: PeekingIterator.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/PeekingIterator.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** PeekingIterator.java 22 Sep 2003 02:40:15 -0000 1.16 --- PeekingIterator.java 26 Oct 2003 19:46:28 -0000 1.17 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: SimpleNodeIterator.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/SimpleNodeIterator.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** SimpleNodeIterator.java 22 Sep 2003 02:40:15 -0000 1.30 --- SimpleNodeIterator.java 26 Oct 2003 19:46:28 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: SpecialHashtable.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/SpecialHashtable.java,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** SpecialHashtable.java 2 Oct 2003 23:48:54 -0000 1.1 --- SpecialHashtable.java 26 Oct 2003 19:46:28 -0000 1.2 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Translate.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/Translate.java,v retrieving revision 1.36 retrieving revision 1.37 diff -C2 -d -r1.36 -r1.37 *** Translate.java 22 Sep 2003 02:40:15 -0000 1.36 --- Translate.java 26 Oct 2003 19:46:28 -0000 1.37 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/package.html,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** package.html 22 Sep 2003 02:40:15 -0000 1.15 --- package.html 26 Oct 2003 19:46:28 -0000 1.16 *************** *** 6,10 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 6,10 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/parserHelperTests In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/tests/parserHelperTests Modified Files: AllTests.java CompositeTagScannerHelperTest.java RemarkNodeParserTest.java StringParserTest.java Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AllTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/parserHelperTests/AllTests.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** AllTests.java 21 Oct 2003 02:24:00 -0000 1.29 --- AllTests.java 26 Oct 2003 19:46:26 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: CompositeTagScannerHelperTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/parserHelperTests/CompositeTagScannerHelperTest.java,v retrieving revision 1.26 retrieving revision 1.27 diff -C2 -d -r1.26 -r1.27 *** CompositeTagScannerHelperTest.java 26 Oct 2003 16:04:27 -0000 1.26 --- CompositeTagScannerHelperTest.java 26 Oct 2003 19:46:26 -0000 1.27 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: RemarkNodeParserTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/parserHelperTests/RemarkNodeParserTest.java,v retrieving revision 1.38 retrieving revision 1.39 diff -C2 -d -r1.38 -r1.39 *** RemarkNodeParserTest.java 21 Oct 2003 02:24:00 -0000 1.38 --- RemarkNodeParserTest.java 26 Oct 2003 19:46:26 -0000 1.39 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StringParserTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/parserHelperTests/StringParserTest.java,v retrieving revision 1.41 retrieving revision 1.42 diff -C2 -d -r1.41 -r1.42 *** StringParserTest.java 25 Oct 2003 20:19:44 -0000 1.41 --- StringParserTest.java 26 Oct 2003 19:46:26 -0000 1.42 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // |
From: <der...@us...> - 2003-10-26 20:05:19
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/sort In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/util/sort Modified Files: Ordered.java Sort.java Sortable.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: Ordered.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/sort/Ordered.java,v retrieving revision 1.7 retrieving revision 1.8 diff -C2 -d -r1.7 -r1.8 *** Ordered.java 22 Sep 2003 02:40:16 -0000 1.7 --- Ordered.java 26 Oct 2003 19:46:28 -0000 1.8 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Sort.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/sort/Sort.java,v retrieving revision 1.7 retrieving revision 1.8 diff -C2 -d -r1.7 -r1.8 *** Sort.java 22 Sep 2003 02:40:16 -0000 1.7 --- Sort.java 26 Oct 2003 19:46:28 -0000 1.8 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Sortable.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/sort/Sortable.java,v retrieving revision 1.7 retrieving revision 1.8 diff -C2 -d -r1.7 -r1.8 *** Sortable.java 22 Sep 2003 02:40:16 -0000 1.7 --- Sortable.java 26 Oct 2003 19:46:28 -0000 1.8 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/sort/package.html,v retrieving revision 1.6 retrieving revision 1.7 diff -C2 -d -r1.6 -r1.7 *** package.html 22 Sep 2003 02:40:16 -0000 1.6 --- package.html 26 Oct 2003 19:46:28 -0000 1.7 *************** *** 7,11 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 7,11 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
Update of /cvsroot/htmlparser/htmlparser/docs/docs In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/docs/docs Modified Files: BlockFeedback.html CollectingParameter.html CompositePattern.html CustomTagExtraction.html EmailExtraction.html EnableFeedback.html ExternalIterators.html FactoryMethod.html FeedbackMechanism.html FirstName.html FrequentlyAskedQuestions.html FullName.html ImageExtraction.html InternalIterators.html IteratorPattern.html JavaBeans.html LastName.html LinkExtraction.html ParserDesign.html ParsingXml.html PatternStories.html PostOperation.html ReverseHtml.html SamplePrograms.html SearchingForData.html SomikRaha.html StrategyPattern.html StringExtraction.html TagFindingVisitor.html TagScanner.html TemplateMethod.html TestDrivenDevelopment.html TextExtractingVisitor.html UnitTestingPdf.html UnitTestingXsl.html UsingCookiesWithParser.html VisitorPattern.html WebCrawler.html WebRipper.html WritingYourOwnScanners.html index.html Added Files: Benchmarks.html Log Message: Update version headers to 1.4-20031026 and update changelog. --- NEW FILE: Benchmarks.html --- <html><head><title>Benchmarks</title></head><body> <div class="wikitext"> <p>Peter Lin, who works on the <a href="http://jakarta.apache.org/jmeter/index.html" class="namedurl"><span style="white-space: nowrap">JMeter</span></a> project has performed some benchmarks that indicate htmlparser is 40% to 600% faster than JTidy: <pre> 10 20 30 40 50 100 500 Yahoo Cnet Htmlparser 80.0 126.4 160.4 200.4 236.4 400.6 1630.2 474.4 1251.8 Tidy 498.6 531.0 626.8 658.8 687.0 849.4 2319.4 965.2 2049.0 Delta 6.23 4.2 3.91 3.29 2.91 2.12 1.42 2.03 1.64 <p>Full details are available in a <a href="http://htmlparser.sourceforge.net/benchmarks.zip" class="namedurl"><span style="white-space: nowrap">zip</span> file</a>. <div id="actionbar" class="toolbar"> <hr class="printer" noshade="noshade" /> <p class="editdate">Last edited on Wednesday, October 1, 2003 6:54:10 am. <hr class="toolbar" noshade="noshade" /> </body></html> Index: BlockFeedback.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/BlockFeedback.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** BlockFeedback.html 24 Aug 2003 18:44:10 -0000 1.3 --- BlockFeedback.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,6 **** ! <html><head><title>Block Feedback</title></head><body><DIV class="wikitext"> ! <P><B>Block Feedback</B></P> ! <P>The parser sends warning and error messages to standard output by default. You might want to block that. To achieve this, use a different feedback object, like so:</P> ! <PRE> Parser parser = new Parser( "http://...", --- 1,13 ---- ! <html><head><title>Block Feedback</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Block Feedback ! ! <p>The parser sends warning and error messages to standard output by default. You might want to block that. To achieve this, use a different feedback object, like so: ! ! <pre> ! Parser parser = new Parser( "http://...", *************** *** 8,12 **** DefaultParserFeedback.QUIET ) ! );</PRE> ! <P>You can also switch the feedback to DEBUG mode, to get extra details. Check <A class="wiki" HREF="EnableFeedback.html">EnableFeedback</A>.</P> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Sunday, February 23, 2003 5:40:45 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 15,32 ---- DefaultParserFeedback.QUIET ) ! ); ! ! <p>You can also switch the feedback to DEBUG mode, to get extra details. Check <a HREF=EnableFeedback.html class="wiki">EnableFeedback</a>. ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Sunday, February 23, 2003 5:40:45 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: CollectingParameter.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/CollectingParameter.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** CollectingParameter.html 24 Aug 2003 18:44:10 -0000 1.3 --- CollectingParameter.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,8 **** ! <html><head><title>Collecting Parameter</title></head><body><DIV class="wikitext"> ! <P><B>Collecting Parameter</B></P> ! <P>The parser allows the use of a collecting parameter in two modes</P> ! <UL> ! <LI>a direct call to <I>extractAllNodesThatAre()</I></LI> ! <LI>Node.collectInto() during external iteration</LI></UL> ! <P>Either way, nodes are collected into the collecting parameter object if they satisfy a match criterion (usually the type).</P> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Sunday, February 23, 2003 5:40:12 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,30 ---- ! <html><head><title>Collecting Parameter</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Collecting Parameter ! ! <p>The parser allows the use of a collecting parameter in two modes ! ! <ul> ! ! <li>a direct call to <i>extractAllNodesThatAre() ! ! <li>Node.collectInto() during external iteration ! ! ! <p>Either way, nodes are collected into the collecting parameter object if they satisfy a match criterion (usually the type). ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Sunday, February 23, 2003 5:40:12 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: CompositePattern.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/CompositePattern.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** CompositePattern.html 24 Aug 2003 18:44:10 -0000 1.3 --- CompositePattern.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,4 **** ! <html><head><title>Composite Pattern</title></head><body><DIV class="wikitext"> ! <P><B>Composite Pattern</B></P> ! <P>The Composite can be seen in action in the <I><SPAN class="wikiunknown"><U>CompositeTag</U></SPAN></I> class. All tags that can have children subclass <I><SPAN class="wikiunknown"><U>CompositeTag</U></SPAN></I>, which contains methods for iterating over these children in a uniform way. A <SPAN class="wikiunknown"><U>CompositeTag</U></SPAN> can be composed of leaf nodes or <I><SPAN class="wikiunknown"><U>CompositeTag</U></SPAN></I>s.</P> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Sunday, February 16, 2003 4:52:03 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,21 ---- ! <html><head><title>Composite Pattern</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Composite Pattern ! ! <p>The Composite can be seen in action in the <i><span class="wikiunknown"><u>CompositeTag class. All tags that can have children subclass <i><span class="wikiunknown"><u>CompositeTag, which contains methods for iterating over these children in a uniform way. A <span class="wikiunknown"><u>CompositeTag can be composed of leaf nodes or <i><span class="wikiunknown"><u>CompositeTags. ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Sunday, February 16, 2003 4:52:03 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: CustomTagExtraction.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/CustomTagExtraction.html,v retrieving revision 1.5 retrieving revision 1.6 diff -C2 -d -r1.5 -r1.6 *** CustomTagExtraction.html 24 Aug 2003 18:44:10 -0000 1.5 --- CustomTagExtraction.html 26 Oct 2003 19:46:17 -0000 1.6 *************** *** 1,6 **** ! <html><head><title>Custom Tag Extraction</title></head><body><DIV class="wikitext"> ! <P><B>Custom Tag Extraction</B></P> ! <P>Custom tag extraction is easy. Simply create an array of tag names that you want to extract from a page, and pass it in to <A class="wiki" HREF="TagFindingVisitor.html">TagFindingVisitor</A>, like so :</P> ! <PRE>Parser parser = new Parser(..); String [] tagsToBeFound = {"P","BR","MYTAG"}; TagFindingVisitor visitor = new TagFindingVisitor(tagsToBeFound); --- 1,13 ---- ! <html><head><title>Custom Tag Extraction</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Custom Tag Extraction ! ! <p>Custom tag extraction is easy. Simply create an array of tag names that you want to extract from a page, and pass it in to <a HREF=TagFindingVisitor.html class="wiki">TagFindingVisitor</a>, like so : ! ! <pre> ! Parser parser = new Parser(..); String [] tagsToBeFound = {"P","BR","MYTAG"}; TagFindingVisitor visitor = new TagFindingVisitor(tagsToBeFound); *************** *** 11,14 **** Node [] allBRTags = visitor.getTags(1); // Third tag specified in search ! Node [] allMyTags = visitor.getTags(2);</PRE> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A>// Just a test of wiki</P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Wednesday, April 2, 2003 1:38:24 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 18,34 ---- Node [] allBRTags = visitor.getTags(1); // Third tag specified in search ! Node [] allMyTags = visitor.getTags(2); ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! // Just a test of wiki ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Wednesday, April 2, 2003 1:38:24 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: EmailExtraction.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/EmailExtraction.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** EmailExtraction.html 24 Aug 2003 18:44:10 -0000 1.3 --- EmailExtraction.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,6 **** ! <html><head><title>Email Extraction</title></head><body><DIV class="wikitext"> ! <P><B>Email Extraction</B></P> ! <P>This is very similar to link extraction. You have to extract links from a page and verify that they are email addresses. Link tags have a method - <I>isMailLink()</I></P> ! <PRE> Parser parser = new Parser(..); parser.registerScanners(); Node links [] = parser.extractAllNodesThatAre(LinkTag.class); --- 1,13 ---- ! <html><head><title>Email Extraction</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Email Extraction ! ! <p>This is very similar to link extraction. You have to extract links from a page and verify that they are email addresses. Link tags have a method - <i>isMailLink() ! ! <pre> ! Parser parser = new Parser(..); parser.registerScanners(); Node links [] = parser.extractAllNodesThatAre(LinkTag.class); *************** *** 11,14 **** System.out.println("Email address: "+linkTag.getLink()); } ! }</PRE> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A>, February 16, 2003 11:41 am</P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Sunday, February 23, 2003 5:24:25 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 18,33 ---- System.out.println("Email address: "+linkTag.getLink()); } ! } ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a>, February 16, 2003 11:41 am ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Sunday, February 23, 2003 5:24:25 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: EnableFeedback.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/EnableFeedback.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** EnableFeedback.html 24 Aug 2003 18:44:10 -0000 1.3 --- EnableFeedback.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,6 **** ! <html><head><title>Enable Feedback</title></head><body><DIV class="wikitext"> ! <P><B>Enable Feedback</B></P> ! <P>If the parser needs to be switched to normal or debug mode, you can do this like so:</P> ! <PRE> Parser parser = new Parser( "http://...", --- 1,13 ---- ! <html><head><title>Enable Feedback</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Enable Feedback ! ! <p>If the parser needs to be switched to normal or debug mode, you can do this like so: ! ! <pre> ! Parser parser = new Parser( "http://...", *************** *** 17,21 **** ) ); ! </PRE> ! <P>You can also turn the feedback to QUIET mode (none of the events will be triggered), to get extra details. Check <A class="wiki" HREF="BlockFeedback.html">BlockFeedback</A>. To handle the feedback yourself, without displaying it to standard output, subclass <SPAN class="wikiunknown"><U>ParserFeedback</U></SPAN>, and override <I>info()</I>, <I>warning()</I> and <I>error()</I>.</P> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Sunday, February 23, 2003 5:41:24 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 24,41 ---- ) ); ! ! ! <p>You can also turn the feedback to QUIET mode (none of the events will be triggered), to get extra details. Check <a HREF=BlockFeedback.html class="wiki">BlockFeedback</a>. To handle the feedback yourself, without displaying it to standard output, subclass <span class="wikiunknown"><u>ParserFeedback, and override <i>info(), <i>warning() and <i>error(). ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Sunday, February 23, 2003 5:41:24 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: ExternalIterators.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/ExternalIterators.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** ExternalIterators.html 24 Aug 2003 18:44:10 -0000 1.3 --- ExternalIterators.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,6 **** ! <html><head><title>External Iterators</title></head><body><DIV class="wikitext"> ! <P><B>External Iterators</B></P> ! <P>You can use external iterators to drive the entire parsing process like so :</P> ! <PRE> for (NodeIterator i = parser.elements();i.hasMoreNodes();) { Node node = e.nextNode(); if (node instanceof LinkTag) { --- 1,13 ---- ! <html><head><title>External Iterators</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>External Iterators ! ! <p>You can use external iterators to drive the entire parsing process like so : ! ! <pre> ! for (NodeIterator i = parser.elements();i.hasMoreNodes();) { Node node = e.nextNode(); if (node instanceof LinkTag) { *************** *** 8,12 **** if (node instanceof ImageTag) { } ! }</PRE> ! <P>You should think of this only when you want to conduct a really quick search, and the moment you've found what you've wanted, you want to stop parsing. The iterator here drives the parsing.</P> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Sunday, February 23, 2003 5:36:09 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 15,32 ---- if (node instanceof ImageTag) { } ! } ! ! <p>You should think of this only when you want to conduct a really quick search, and the moment you've found what you've wanted, you want to stop parsing. The iterator here drives the parsing. ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Sunday, February 23, 2003 5:36:09 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: FactoryMethod.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/FactoryMethod.html,v retrieving revision 1.4 retrieving revision 1.5 diff -C2 -d -r1.4 -r1.5 *** FactoryMethod.html 24 Aug 2003 18:44:10 -0000 1.4 --- FactoryMethod.html 26 Oct 2003 19:46:17 -0000 1.5 *************** *** 1,9 **** ! <html><head><title>Factory Method</title></head><body><DIV class="wikitext"> ! <P><B>Factory Method</B></P> ! <P><I><A class="wiki" HREF="TagScanner.html">TagScanner</A></I> possess an FM for the creation of a tag.</P> ! <PRE> protected Tag createTag(TagData tagData);</PRE> ! <P>Scanner subclasses override this to specify the type of tag to be constructed.</P> ! <P><I><SPAN class="wikiunknown"><U>CompositeTagScanner</U></SPAN></I> possesses an FM for the creation of a tag.</P> ! <PRE> protected Tag createTag(TagData tagData,CompositeTagData compositeTagData);</PRE> ! <P>Composite scanners override this to specify the type of tag to be constructed.</P> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Sunday, February 23, 2003 5:37:36 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,33 ---- ! <html><head><title>Factory Method</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Factory Method ! ! <p><i><a HREF=TagScanner.html class="wiki">TagScanner</a> possess an FM for the creation of a tag. ! ! <pre> ! protected Tag createTag(TagData tagData); ! ! <p>Scanner subclasses override this to specify the type of tag to be constructed. ! ! <p><i><span class="wikiunknown"><u>CompositeTagScanner possesses an FM for the creation of a tag. ! ! <pre> ! protected Tag createTag(TagData tagData,CompositeTagData compositeTagData); ! ! <p>Composite scanners override this to specify the type of tag to be constructed. ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Sunday, February 23, 2003 5:37:36 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: FeedbackMechanism.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/FeedbackMechanism.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** FeedbackMechanism.html 24 Aug 2003 18:44:10 -0000 1.3 --- FeedbackMechanism.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,18 **** ! <html><head><title>Feedback Mechanism</title></head><body><DIV class="wikitext"> ! <P><B>Feedback Mechanism</B></P> ! <P>The parser has a feedback mechanism that allows you to obtain feedback about the parsing process. You can get to know if there were any errors, or any warnings, or any general information. Warnings occur when the parser has encountered dirty html, but was able to fix it and continue. Errors occur when the parser was not able to handle the html.</P> ! <P>An understanding of the feedback mechanism is useful if you wish to perform logging, or turn off the default feedback and incorporate your own.</P> ! <P>When you create a parser object without specifying any feedback object, the parser creates a default feedback object - DefaultHTMLParserFeedback. This works in three modes - NORMAL, QUIET and DEBUG, and when no feedback object is specified, it defaults to normal. In this mode, all information, warnings and errors are sent to standard output.</P> ! <PRE>HTMLParser parser = new HTMLParser(someUrl);</PRE> ! <P>The above code snippet shows the default configuration - the feedback object is created in the normal mode. You can turn off the messages by turning the feedback mechanism to the quiet mode. This can be done in two ways :</P> ! <PRE>HTMLParser parser = new HTMLParser(someUrl,null); ! Java2html</PRE> ! <P>or</P> ! <PRE>HTMLParser parser = new HTMLParser(someUrl,new DefaultHTMLParserFeedback(DefaultHTMLParserFeedback.QUIET));</PRE> ! <P>In this mode, there is no feedback on standard output. ! For debugging purposes, you can use the debug mode to receive all stack traces of exceptions that are thrown.</P> ! <PRE>HTMLParser parser = new HTMLParser(someUrl,new DefaultHTMLParserFeedback(DefaultHTMLParserFeedback.DEBUG));</PRE> ! <P>If you wish to add a file logger- you can write your own custom feedback class like this :</P> ! <PRE>public class FileFeedback implements HTMLParserFeedback{ public FileFeedback(String file) { // .. Initialize the file for logging --- 1,39 ---- ! <html><head><title>Feedback Mechanism</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Feedback Mechanism ! ! <p>The parser has a feedback mechanism that allows you to obtain feedback about the parsing process. You can get to know if there were any errors, or any warnings, or any general information. Warnings occur when the parser has encountered dirty html, but was able to fix it and continue. Errors occur when the parser was not able to handle the html. ! ! <p>An understanding of the feedback mechanism is useful if you wish to perform logging, or turn off the default feedback and incorporate your own. ! ! <p>When you create a parser object without specifying any feedback object, the parser creates a default feedback object - DefaultHTMLParserFeedback. This works in three modes - NORMAL, QUIET and DEBUG, and when no feedback object is specified, it defaults to normal. In this mode, all information, warnings and errors are sent to standard output. ! ! <pre> ! HTMLParser parser = new HTMLParser(someUrl); ! ! <p>The above code snippet shows the default configuration - the feedback object is created in the normal mode. You can turn off the messages by turning the feedback mechanism to the quiet mode. This can be done in two ways : ! ! <pre> ! HTMLParser parser = new HTMLParser(someUrl,null); ! Java2html ! ! <p>or ! ! <pre> ! HTMLParser parser = new HTMLParser(someUrl,new DefaultHTMLParserFeedback(DefaultHTMLParserFeedback.QUIET)); ! ! <p>In this mode, there is no feedback on standard output. ! For debugging purposes, you can use the debug mode to receive all stack traces of exceptions that are thrown. ! ! <pre> ! HTMLParser parser = new HTMLParser(someUrl,new DefaultHTMLParserFeedback(DefaultHTMLParserFeedback.DEBUG)); ! ! <p>If you wish to add a file logger- you can write your own custom feedback class like this : ! ! <pre> ! public class FileFeedback implements HTMLParserFeedback{ public FileFeedback(String file) { // .. Initialize the file for logging *************** *** 27,30 **** // .. log the error message } ! }</PRE> ! <P>You can supply an object of this type to the parser in the constructor, and accordingly channel the feedback.</P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Friday, March 21, 2003 11:51:12 am.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 48,63 ---- // .. log the error message } ! } ! ! <p>You can supply an object of this type to the parser in the constructor, and accordingly channel the feedback. ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Friday, March 21, 2003 11:51:12 am. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: FirstName.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/FirstName.html,v retrieving revision 1.2 retrieving revision 1.3 diff -C2 -d -r1.2 -r1.3 *** FirstName.html 24 Aug 2003 18:44:10 -0000 1.2 --- FirstName.html 26 Oct 2003 19:46:17 -0000 1.3 *************** *** 1,2 **** ! <html><head><title>First Name</title></head><body><DIV class="wikitext"> ! <P>Describe <A class="wiki" HREF="FirstName.html">FirstName</A> here.</P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Thursday, July 17, 2003 4:35:59 am.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,17 ---- ! <html><head><title>First Name</title></head><body> ! ! ! ! <div class="wikitext"> ! <p>Describe <a HREF=FirstName.html class="wiki">FirstName</a> here. ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Thursday, July 17, 2003 4:35:59 am. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: FrequentlyAskedQuestions.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/FrequentlyAskedQuestions.html,v retrieving revision 1.5 retrieving revision 1.6 diff -C2 -d -r1.5 -r1.6 *** FrequentlyAskedQuestions.html 24 Aug 2003 18:44:10 -0000 1.5 --- FrequentlyAskedQuestions.html 26 Oct 2003 19:46:17 -0000 1.6 *************** *** 1,7 **** ! <html><head><title>Frequently Asked Questions</title></head><body><DIV class="wikitext"> ! <P><B>FAQ</B></P><HR/> ! <P><B>How does the parser deal with tags like <tag/> ?</B></P> ! <P>The parser handles them as a normal Tag object. The Tag class has a method - isEmptyXmlTag() which can be queried to find if this an empty xml tag.</P><HR/> ! <P><B>How does the parser deal with HTML tags which should be terminated with /> but are not, i.e. <BR/> and <HR>? Is there any way to automatically know that some HTML tags are empty?</B></P><HR/> ! <P><B>How is JSP parsed using the HTMLParser?</B></P><HR/> ! <P><B>How do you find the byte offset from the beginning of a document for a tag?</B></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Thursday, June 19, 2003 10:49:11 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,32 ---- ! <html><head><title>Frequently Asked Questions</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>FAQ ! ! <hr /> ! <p><b>How does the parser deal with tags like <tag/> ? ! ! <p>The parser handles them as a normal Tag object. The Tag class has a method - isEmptyXmlTag() which can be queried to find if this an empty xml tag. ! ! <hr /> ! <p><b>How does the parser deal with HTML tags which should be terminated with /> but are not, i.e. ! <br /> and <HR>? Is there any way to automatically know that some HTML tags are empty? ! ! <hr /> ! <p><b>How is JSP parsed using the HTMLParser? ! ! <hr /> ! <p><b>How do you find the byte offset from the beginning of a document for a tag? ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Thursday, June 19, 2003 10:49:11 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: FullName.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/FullName.html,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** FullName.html 24 Aug 2003 18:44:10 -0000 1.1 --- FullName.html 26 Oct 2003 19:46:17 -0000 1.2 *************** *** 1,2 **** ! <html><head><title>Full Name</title></head><body><DIV class="wikitext"> ! <P>Describe [<SPAN class="wikiunknown"><U>FullNa</U></SPAN></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Friday, August 15, 2003 10:24:19 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,17 ---- ! <html><head><title>Full Name</title></head><body> ! ! ! ! <div class="wikitext"> ! <p>Describe [<span class="wikiunknown"><u>FullNa ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Friday, August 15, 2003 10:24:19 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: ImageExtraction.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/ImageExtraction.html,v retrieving revision 1.4 retrieving revision 1.5 diff -C2 -d -r1.4 -r1.5 *** ImageExtraction.html 24 Aug 2003 18:44:10 -0000 1.4 --- ImageExtraction.html 26 Oct 2003 19:46:17 -0000 1.5 *************** *** 1,7 **** ! <html><head><title>Image Extraction</title></head><body><DIV class="wikitext"> ! <P><B>Image Extractions</B></P> ! <P>This is very similar to <A class="wiki" HREF="LinkExtraction.html">LinkExtraction</A>.</P> ! <P>1. Use the <I><SPAN class="wikiunknown"><U>ObjectFindingVisitor</U></SPAN></I> like so :</P> ! <PRE>Parser parser = new Parser("http://urlIWantToParse.com"); // Create a visitor, specify that you want to recurse through its children // Recursion is needed only if you register all scanners, and a link tag could be embedded --- 1,15 ---- ! <html><head><title>Image Extraction</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Image Extractions ! ! <p>This is very similar to <a HREF=LinkExtraction.html class="wiki">LinkExtraction</a>. ! ! <p>1. Use the <i><span class="wikiunknown"><u>ObjectFindingVisitor like so : ! ! <pre> ! Parser parser = new Parser("http://urlIWantToParse.com"); // Create a visitor, specify that you want to recurse through its children // Recursion is needed only if you register all scanners, and a link tag could be embedded *************** *** 19,25 **** ImageTag imageTag = (ImageTag)images[i]; System.out.println(imageTag.getImageLocation()); ! }</PRE> ! <P>2: Use <I>extractAllNodesThatAre()</I></P> ! <PRE> Parser parser = new Parser("http://urlIWantToParse.com"); parser.registerScanners(); // Instead of registering all scanners, --- 27,36 ---- ImageTag imageTag = (ImageTag)images[i]; System.out.println(imageTag.getImageLocation()); ! } ! ! <p>2: Use <i>extractAllNodesThatAre() ! ! <pre> ! Parser parser = new Parser("http://urlIWantToParse.com"); parser.registerScanners(); // Instead of registering all scanners, *************** *** 30,33 **** ImageTag imageTag = (ImageTag)images[i]; System.out.println(imageTag.getImageLocation()); ! }</PRE> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A>, Sunday, February 16, 2003 2:02:18 pm.</P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Wednesday, June 25, 2003 9:11:46 am.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 41,56 ---- ImageTag imageTag = (ImageTag)images[i]; System.out.println(imageTag.getImageLocation()); ! } ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a>, Sunday, February 16, 2003 2:02:18 pm. ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Wednesday, June 25, 2003 9:11:46 am. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: InternalIterators.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/InternalIterators.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** InternalIterators.html 24 Aug 2003 18:44:10 -0000 1.3 --- InternalIterators.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,4 **** ! <html><head><title>Internal Iterators</title></head><body><DIV class="wikitext"> ! <P><B>Internal Iterators</B></P> ! <P>You can use internal iterators by overriding trigger methods that you're interested in. This is done by subclassing HTMLVisitor. An example can be found in <A class="wiki" HREF="LinkExtraction.html">LinkExtraction</A>.</P> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Sunday, February 16, 2003 4:08:46 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,21 ---- ! <html><head><title>Internal Iterators</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Internal Iterators ! ! <p>You can use internal iterators by overriding trigger methods that you're interested in. This is done by subclassing HTMLVisitor. An example can be found in <a HREF=LinkExtraction.html class="wiki">LinkExtraction</a>. ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Sunday, February 16, 2003 4:08:46 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: IteratorPattern.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/IteratorPattern.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** IteratorPattern.html 24 Aug 2003 18:44:10 -0000 1.3 --- IteratorPattern.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,6 **** ! <html><head><title>Iterator Pattern</title></head><body><DIV class="wikitext"> ! <P><B>Iterator Pattern</B></P> ! <P>The Iterator can be seen in action in two of its flavors - <A class="wiki" HREF="ExternalIterators.html">ExternalIterators</A>, and <A class="wiki" HREF="InternalIterators.html">InternalIterators</A>. ! The <I>HTMLEnumeration</I> class provides the external iteration facility. ! <I><SPAN class="wikiunknown"><U>SimpleEnumeration</U></SPAN></I> allows external iteration over <I><SPAN class="wikiunknown"><U>NodeList</U></SPAN></I>s.</P> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Sunday, February 16, 2003 5:04:10 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,23 ---- ! <html><head><title>Iterator Pattern</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Iterator Pattern ! ! <p>The Iterator can be seen in action in two of its flavors - <a HREF=ExternalIterators.html class="wiki">ExternalIterators</a>, and <a HREF=InternalIterators.html class="wiki">InternalIterators</a>. ! The <i>HTMLEnumeration class provides the external iteration facility. ! <i><span class="wikiunknown"><u>SimpleEnumeration allows external iteration over <i><span class="wikiunknown"><u>NodeLists. ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Sunday, February 16, 2003 5:04:10 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: JavaBeans.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/JavaBeans.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** JavaBeans.html 24 Aug 2003 18:44:10 -0000 1.3 --- JavaBeans.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,19 **** ! <html><head><title>Java Beans</title></head><body><DIV class="wikitext"> ! <P><B>Quick Introduction</B></P> ! <P>Run the example program that demonstrates the capabilities of the Java Beans that are already included in the htmparser.jar (it's assumed that the htmlparser.jar file from an integration build 1.3 later than April 12, 2003 is in your current directory):</P> ! <PRE>java -classpath htmlparser.jar org.htmlparser.beans.BeanyBaby</PRE> ! <P>What you should see is a split window showing a URL extraction with a list of links on the left and the text on the right.<BR/></P> ! <P><IMG alt="http://htmlparser.sourceforge.net/images/BeanyBaby.jpg" SRC="images/BeanyBaby.jpg" class="inlineimage"><BR/></P> ! <P>The splitter on the left contains a GUI oriented <TT>HTMLLinkBean</TT> (which uses an underlying API <TT>LinkBean</TT>) and the splitter on the right contains a GUI oriented <TT>HTMLStringBean</TT> (which uses an underlying API <TT>StringBean</TT>).<BR/></P> ! <P>Type in a URL or double-click a URL from the list. Use the Go menu to go back to a previous link or step to the next link you already visited.</P> ! <P>The options menu provides access to the binary properties:<BR/></P> ! <P><IMG alt="http://htmlparser.sourceforge.net/images/BeanyBabyOptions.jpg" SRC="images/BeanyBabyOptions.jpg" class="inlineimage"><BR/></P> ! <UL> ! <LI>Links - turn on and off the extraction of hyperlinks with the text</LI> ! <LI>Collapse - turn on and off collapsing whitespace</LI> ! <LI>Non-Breaking Spaces - turn on and off transforming non-break spaces into regular spaces</LI></UL> ! <P><B>Simple Usage</B></P> ! <P>The simplest operation (this shows StringBean use, but LinkBean use is similar) is just to create a new one, set the URL and then get the text:<BR/></P> ! <PRE>#import org.htmlparser.beans.StringBean; public class TryBeans --- 1,47 ---- ! <html><head><title>Java Beans</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Quick Introduction ! ! <p>Run the example program that demonstrates the capabilities of the Java Beans that are already included in the htmparser.jar (it's assumed that the htmlparser.jar file from an integration build 1.3 later than April 12, 2003 is in your current directory): ! ! <pre> ! java -classpath htmlparser.jar org.htmlparser.beans.BeanyBaby ! ! <p>What you should see is a split window showing a URL extraction with a list of links on the left and the text on the right. ! <br /> ! ! <p><img SRC="images/BeanyBaby.jpg" alt="http://htmlparser.sourceforge.net/images/BeanyBaby.jpg" class="inlineimage" /> ! <br /> ! ! <p>The splitter on the left contains a GUI oriented <tt>HTMLLinkBean (which uses an underlying API <tt>LinkBean) and the splitter on the right contains a GUI oriented <tt>HTMLStringBean (which uses an underlying API <tt>StringBean). ! <br /> ! ! <p>Type in a URL or double-click a URL from the list. Use the Go menu to go back to a previous link or step to the next link you already visited. ! ! <p>The options menu provides access to the binary properties: ! <br /> ! ! <p><img SRC="images/BeanyBabyOptions.jpg" alt="http://htmlparser.sourceforge.net/images/BeanyBabyOptions.jpg" class="inlineimage" /> ! <br /> ! ! <ul> ! ! <li>Links - turn on and off the extraction of hyperlinks with the text ! ! <li>Collapse - turn on and off collapsing whitespace ! ! <li>Non-Breaking Spaces - turn on and off transforming non-break spaces into regular spaces ! ! ! <p><b>Simple Usage ! ! <p>The simplest operation (this shows StringBean use, but LinkBean use is similar) is just to create a new one, set the URL and then get the text: ! <br /> ! ! <pre> ! #import org.htmlparser.beans.StringBean; public class TryBeans *************** *** 22,45 **** { StringBean sb = new StringBean (); ! sb.setURL ("<A class="namedurl" href="http://cbc.ca"><SPAN style="white-space: nowrap">http://cbc.ca</SPAN></A>"); System.out.println (sb.getStrings ()); } ! }</PRE> ! <P>Save this in a file called TryBeans.java and then run the following commands:</P> ! <PRE>javac -classpath htmlparser.jar TryBeans.java ! java -classpath htmlparser.jar:. TryBeans</PRE> ! <P>or for Windows:</P> ! <PRE>java -classpath htmlparser.jar;. TryBeans</PRE> ! <P><B>Simple GUI Usage</B></P> ! <P>The following instructions are for the <A class="namedurl" href="http://www.netbeans.org"><SPAN style="white-space: nowrap">NetBeans</SPAN></A> IDE but other environments will have a similar operation.</P> ! <P>You can mount the htmlparser.jar file:<BR/></P> ! <P><IMG alt="http://htmlparser.sourceforge.net/images/Mount.jpg" SRC="images/Mount.jpg" class="inlineimage"><BR/></P> ! <P>and use the bean classes directly or if you want to use them in the Form designer you'll need to install them. Use the Install New Javabean menu item in the Tools menu:<BR/></P> ! <P><IMG alt="http://htmlparser.sourceforge.net/images/InstallBean.jpg" SRC="images/InstallBean.jpg" class="inlineimage"><BR/></P> ! <P>There are a number of beans in the jar, as indicated above the GUI beans are the HTMLStringBean and HTMLLinkBean. You can install them all, but it might clutter up your palette a bit, so I would recomend only install the ones you need for the project at hand. You'll also need to specify the palette that the beans will be added to:</P> ! <P><IMG alt="http://htmlparser.sourceforge.net/images/ChooseBean.jpg" SRC="images/ChooseBean.jpg" class="inlineimage"><IMG alt="http://htmlparser.sourceforge.net/images/ChoosePalette.jpg" SRC="images/ChoosePalette.jpg" class="inlineimage"><BR/></P> ! <P>Once the bean is installed it will show up on the tool palette and you can click it and drop it onto a JFrame or JPanel or whatever:<BR/></P> ! <P><IMG alt="http://htmlparser.sourceforge.net/images/AddingBean.jpg" SRC="images/AddingBean.jpg" class="inlineimage"><BR/></P> ! <P>Once it's in your designer you can set the properties and have it display the text even while designing (assuming you're online):<BR/></P> ! <P><IMG alt="http://htmlparser.sourceforge.net/images/SettingProperties.jpg" SRC="images/SettingProperties.jpg" class="inlineimage"><BR/></P> ! <P>Of course you can subclass the provided beans or write your own.</P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Saturday, April 5, 2003 7:25:55 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 50,113 ---- { StringBean sb = new StringBean (); ! sb.setURL ("<a href="http://cbc.ca" class="namedurl"><span style="white-space: nowrap">http://cbc.ca</span></a>"); System.out.println (sb.getStrings ()); } ! } ! ! <p>Save this in a file called TryBeans.java and then run the following commands: ! ! <pre> ! javac -classpath htmlparser.jar TryBeans.java ! java -classpath htmlparser.jar:. TryBeans ! ! <p>or for Windows: ! ! <pre> ! java -classpath htmlparser.jar;. TryBeans ! ! <p><b>Simple GUI Usage ! ! <p>The following instructions are for the <a href="http://www.netbeans.org" class="namedurl"><span style="white-space: nowrap">NetBeans</span></a> IDE but other environments will have a similar operation. ! ! <p>You can mount the htmlparser.jar file: ! <br /> ! ! <p><img SRC="images/Mount.jpg" alt="http://htmlparser.sourceforge.net/images/Mount.jpg" class="inlineimage" /> ! <br /> ! ! <p>and use the bean classes directly or if you want to use them in the Form designer you'll need to install them. Use the Install New Javabean menu item in the Tools menu: ! <br /> ! ! <p><img SRC="images/InstallBean.jpg" alt="http://htmlparser.sourceforge.net/images/InstallBean.jpg" class="inlineimage" /> ! <br /> ! ! <p>There are a number of beans in the jar, as indicated above the GUI beans are the HTMLStringBean and HTMLLinkBean. You can install them all, but it might clutter up your palette a bit, so I would recomend only install the ones you need for the project at hand. You'll also need to specify the palette that the beans will be added to: ! ! <p><img SRC="images/ChooseBean.jpg" alt="http://htmlparser.sourceforge.net/images/ChooseBean.jpg" class="inlineimage" /> ! <img SRC="images/ChoosePalette.jpg" alt="http://htmlparser.sourceforge.net/images/ChoosePalette.jpg" class="inlineimage" /> ! <br /> ! ! <p>Once the bean is installed it will show up on the tool palette and you can click it and drop it onto a JFrame or JPanel or whatever: ! <br /> ! ! <p><img SRC="images/AddingBean.jpg" alt="http://htmlparser.sourceforge.net/images/AddingBean.jpg" class="inlineimage" /> ! <br /> ! ! <p>Once it's in your designer you can set the properties and have it display the text even while designing (assuming you're online): ! <br /> ! ! <p><img SRC="images/SettingProperties.jpg" alt="http://htmlparser.sourceforge.net/images/SettingProperties.jpg" class="inlineimage" /> ! <br /> ! ! <p>Of course you can subclass the provided beans or write your own. ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Saturday, April 5, 2003 7:25:55 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: LastName.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/LastName.html,v retrieving revision 1.2 retrieving revision 1.3 diff -C2 -d -r1.2 -r1.3 *** LastName.html 24 Aug 2003 18:44:10 -0000 1.2 --- LastName.html 26 Oct 2003 19:46:17 -0000 1.3 *************** *** 1,2 **** ! <html><head><title>Last Name</title></head><body><DIV class="wikitext"> ! <P>Describe <A class="wiki" HREF="LastName.html">LastName</A> here.fdsadfsafdsaf</P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Thursday, July 17, 2003 4:38:05 am.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,17 ---- ! <html><head><title>Last Name</title></head><body> ! ! ! ! <div class="wikitext"> ! <p>Describe <a HREF=LastName.html class="wiki">LastName</a> here.fdsadfsafdsaf ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Thursday, July 17, 2003 4:38:05 am. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: LinkExtraction.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/LinkExtraction.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** LinkExtraction.html 24 Aug 2003 18:44:10 -0000 1.3 --- LinkExtraction.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,7 **** ! <html><head><title>Link Extraction</title></head><body><DIV class="wikitext"> ! <P><B>Link Extraction</B></P> ! <P>There are many ways of extracting links.</P> ! <P>1. Use the <SPAN class="wikiunknown"><U>ObjectFindingVisitor</U></SPAN> to extract links, like so:</P> ! <PRE> Parser parser = new Parser("http://urlIWantToParse.com"); // Create a visitor, specify that you want to recurse through its children // Recursion is needed only if you register all scanners, and a link tag could be embedded --- 1,15 ---- ! <html><head><title>Link Extraction</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Link Extraction ! ! <p>There are many ways of extracting links. ! ! <p>1. Use the <span class="wikiunknown"><u>ObjectFindingVisitor to extract links, like so: ! ! <pre> ! Parser parser = new Parser("http://urlIWantToParse.com"); // Create a visitor, specify that you want to recurse through its children // Recursion is needed only if you register all scanners, and a link tag could be embedded *************** *** 20,26 **** System.out.println(linkTag.getLink()); System.out.println(linkTag.getLinkText()); ! }</PRE> ! <P>2. Use the parser utility method - extractAllNodesThatAre().</P> ! <PRE> Parser parser = new Parser("http://urlIWantToParse.com"); parser.registerScanners(); Node [] links = parser.extractAllNodesThatAre(LinkTag.class); --- 28,37 ---- System.out.println(linkTag.getLink()); System.out.println(linkTag.getLinkText()); ! } ! ! <p>2. Use the parser utility method - extractAllNodesThatAre(). ! ! <pre> ! Parser parser = new Parser("http://urlIWantToParse.com"); parser.registerScanners(); Node [] links = parser.extractAllNodesThatAre(LinkTag.class); *************** *** 31,37 **** System.out.println(linkTag.getLink()); System.out.println(linkTag.getLinkText()); ! }</PRE> ! <P>3. It is possible that you are interested in extracting more than just links. In order to customize extraction, write your own visitor. Extend the Visitor class (in the package org.htmlparser.visitors - Parser v1.3 upwards) like so :</P> ! <PRE> public class MyCustomizedVisitor extends Visitor { public MyCustomizedVisitor(Parser parser) { super(true); /// Its usually a good idea to perform recursion --- 42,51 ---- System.out.println(linkTag.getLink()); System.out.println(linkTag.getLinkText()); ! } ! ! <p>3. It is possible that you are interested in extracting more than just links. In order to customize extraction, write your own visitor. Extend the Visitor class (in the package org.htmlparser.visitors - Parser v1.3 upwards) like so : ! ! <pre> ! public class MyCustomizedVisitor extends Visitor { public MyCustomizedVisitor(Parser parser) { super(true); /// Its usually a good idea to perform recursion *************** *** 78,84 **** In your app.. Parser parser = new Parser(...); ! MyCustomizedVisitor visitor = new MyCustomizedVisitor(); parser.visitAllNodesWith(visitor); // You can now get the data from the visitor interface. ! </PRE> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Sunday, February 23, 2003 5:22:44 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 92,110 ---- In your app.. Parser parser = new Parser(...); ! MyCustomizedVisitor visitor = new MyCustomizedVisitor(parser); parser.visitAllNodesWith(visitor); // You can now get the data from the visitor interface. ! ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Tuesday, September 2, 2003 1:59:15 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: ParserDesign.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/ParserDesign.html,v retrieving revision 1.4 retrieving revision 1.5 diff -C2 -d -r1.4 -r1.5 *** ParserDesign.html 24 Aug 2003 18:44:10 -0000 1.4 --- ParserDesign.html 26 Oct 2003 19:46:17 -0000 1.5 *************** *** 1,6 **** ! <html><head><title>Parser Design</title></head><body><DIV class="wikitext"> ! <P><B>Parser Design</B></P> ! <P>HTMLParser is a SAX-like parser streaming parser, that has the capability to correct dirty-html on the fly. It is extremely fast and lightweight. The binary distribution of the jar file is around 135 KB only, and it can easily be brought down to 65 KB for a minimal parsing requirement (prior to optimization and obfuscation).</P> ! <P>It is also extensible. The parser provides both <A class="wiki" HREF="InternalIterators.html">InternalIterators</A> and <A class="wiki" HREF="ExternalIterators.html">ExternalIterators</A>. ! The parser has some interesting <A class="wiki" HREF="PatternStories.html">PatternStories</A>..</P> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Monday, March 17, 2003 6:18:45 am.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,24 ---- ! <html><head><title>Parser Design</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Parser Design ! ! <p>HTMLParser is a SAX-like parser streaming parser, that has the capability to correct dirty-html on the fly. It is extremely fast and lightweight. The binary distribution of the jar file is around 135 KB only, and it can easily be brought down to 65 KB for a minimal parsing requirement (prior to optimization and obfuscation). ! ! <p>It is also extensible. The parser provides both <a HREF=InternalIterators.html class="wiki">InternalIterators</a> and <a HREF=ExternalIterators.html class="wiki">ExternalIterators</a>. ! The parser has some interesting <a HREF=PatternStories.html class="wiki">PatternStories</a>.. ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Monday, March 17, 2003 6:18:45 am. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: ParsingXml.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/ParsingXml.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** ParsingXml.html 24 Aug 2003 18:44:10 -0000 1.3 --- ParsingXml.html 26 Oct 2003 19:46:17 -0000 1.4 *************** *** 1,1813 **** ! <html><head><title>Parsing Xml</title></head><body><DIV class="wikitext"> ! <P><?xml version="1.0" encoding="iso-8859-1" ?></P><BLOCKQUOTE style="border-left-width: medium; border-left-color: #0f0; border-left-style: ridge; padding-left: 1em; margin-left: 0em; margin-right: 0em;"> ! <BLOCKQUOTE> ! <P><<SPAN class="wikiunknown"><U>ReviewerInformation</U></SPAN>></P> ! <P><Reviewer></P> ! <P><PeopleID>9</PeopleID></P> ! <P><<A class="wiki" HREF="FirstName.html">FirstName</A>>Niall</<A class="wiki" HREF="FirstName.html">FirstName</A>></P> ! <P><<A class="wiki" HREF="LastName.html">LastName</A>>Adams</<A class="wiki" HREF="LastName.html">LastName</A>></P> ! <P><<A class="wiki" HREF="FullName.html">FullName</A>>Niall Adams</<A class="wiki" HREF="FullName.html">FullName</A>></P> ! <P><Organization>Imperial College</Organization></P> [...5429 lines suppressed...] ! ! <p><Fax>509-479-4522</Fax> ! ! <p></Reviewer> ! ! <p></<span class="wikiunknown"><u>ReviewerInformation> ! ! ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Tuesday, June 24, 2003 1:32:51 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: PatternStories.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/PatternStories.html,v retrieving revision 1.4 retrieving revision 1.5 diff -C2 -d -r1.4 -r1.5 *** PatternStories.html 24 Aug 2003 18:44:10 -0000 1.4 --- PatternStories.html 26 Oct 2003 19:46:17 -0000 1.5 *************** *** 1,12 **** ! <html><head><title>Pattern Stories</title></head><body><DIV class="wikitext"> ! <P><B>Pattern Stories</B></P> ! <P>The parser uses the following patterns:</P> ! <UL> ! <LI><A class="wiki" HREF="FactoryMethod.html">FactoryMethod</A></LI> ! <LI><A class="wiki" HREF="TemplateMethod.html">TemplateMethod</A></LI> ! <LI><A class="wiki" HREF="IteratorPattern.html">IteratorPattern</A></LI> ! <LI><A class="wiki" HREF="VisitorPattern.html">VisitorPattern</A></LI> ! <LI><A class="wiki" HREF="CollectingParameter.html">CollectingParameter</A></LI> ! <LI><A class="wiki" HREF="StrategyPattern.html">StrategyPattern</A></LI> ! <LI><A class="wiki" HREF="CompositePattern.html">CompositePattern</A></LI></UL> ! <P>--<A class="wiki" HREF="SomikRaha.html">SomikRaha</A></P></DIV><DIV id="actionbar" class="toolbar"><HR noshade="noshade" class="printer"/><P class="editdate">Last edited on Friday, May 16, 2003 2:30:12 pm.</P><HR noshade="noshade" class="toolbar"/></body></html> \ No newline at end of file --- 1,38 ---- ! <html><head><title>Pattern Stories</title></head><body> ! ! ! ! <div class="wikitext"> ! <p><b>Pattern Stories ! ! <p>The parser uses the following patterns: ! ! <ul> ! ! <li><a HREF=FactoryMethod.html class="wiki">FactoryMethod</a> ! ! <li><a HREF=TemplateMethod.html class="wiki">TemplateMethod</a> ! ! <li><a HREF=IteratorPattern.html class="wiki">IteratorPattern</a> ! ! <li><a HREF=VisitorPattern.html class="wiki">VisitorPattern</a> ! ! <li><a HREF=CollectingParameter.html class="wiki">CollectingParameter</a> ! ! <li><a HREF=StrategyPattern.html class="wiki">StrategyPattern</a> ! ! <li><a HREF=CompositePattern.html class="wiki">CompositePattern</a> ! ! ! <p>--<a HREF=SomikRaha.html class="wiki">SomikRaha</a> ! ! ! ! <div id="actionbar" class="toolbar"> ! ! <hr class="printer" noshade="noshade" /> ! ! <p class="editdate">Last edited on Friday, May 16, 2003 2:30:12 pm. ! ! <hr class="toolbar" noshade="noshade" /> ! </body></html> \ No newline at end of file Index: PostOperation.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/docs/PostOperation.html,v retrieving revision 1.2 retrieving revision 1.3 diff -C2 -d -r1.2 -r1.3 *** PostOperation.html 24 Aug 2003 18:44:10 -0000 1.2 --- PostOperation.html 26 Oct 2003 19:46:17 -0000 1.3 *************** *** 1,18 **** ! <html><head><title>Post Operation</title></head><body><DIV class="wikitext"> ! <H4>POST Operation</H4> ! <P>The standard HTTP request submitted by the parser is a GET. This note describes how to use POST, which is the usual request submitted by a form.</P> ! <P>As an example, we'll submit a form to the U.S. postal service web site.<BR/><I>Note: This is suboptimal, the postal service provides tools for this type of thing: <A class="namedurl" href="http://www.uspswebtools.com"><SPAN style="white-space: nowrap">http://www.uspswebtools.com</SPAN></A></I><BR/></P> ! <P>On the USPS web site, the page <A class="namedurl" href="http://www.usps.com/zip4/citytown.htm"><SPAN style="white-space: nowrap">http://www.usps.com/zip4/citytown.htm</SPAN></A> has the following FORM that asks for a zip code and returns the cities or towns covered by the zip code (only form elements are shown removing all the formatting markup):</P> ! <PRE><form NAME="frmzip" ACTION="zip_response.jsp" METHOD="post" OnSubmit="return validate(frmzip)"> <input type="text" id="zipcode" name="zipcode" size="5" maxlength="5" TABINDEX="10"> ! <input TYPE="image" NAME="Submit" SRC="/zip4/images/submit.jpg" BORDER="0" WIDTH="50" HEIGHT="17" ALT="Submit" TABINDEX="11"></PRE> ! <P>From this we determine that the <TT>METHOD</TT> is <TT>POST</TT> and the form should be submitted to <TT>zip_response.jsp</TT>. This relative URL is relative to the page it is found on, so the form should be submitted to <TT>http://www.usps.com/zip4/zip_response.jsp</TT> when the <TT>Submit</TT> input is clicked. The only <TT>input</TT> element other than the ... [truncated message content] |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/tags Modified Files: AppletTag.java BaseHrefTag.java BodyTag.java Bullet.java BulletList.java CompositeTag.java Div.java DoctypeTag.java FormTag.java FrameSetTag.java FrameTag.java HeadTag.java Html.java ImageTag.java InputTag.java JspTag.java LabelTag.java LinkTag.java MetaTag.java OptionTag.java ScriptTag.java SelectTag.java Span.java StyleTag.java TableColumn.java TableRow.java TableTag.java Tag.java TextareaTag.java TitleTag.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AppletTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/AppletTag.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** AppletTag.java 26 Oct 2003 03:53:32 -0000 1.33 --- AppletTag.java 26 Oct 2003 19:46:21 -0000 1.34 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BaseHrefTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/BaseHrefTag.java,v retrieving revision 1.27 retrieving revision 1.28 diff -C2 -d -r1.27 -r1.28 *** BaseHrefTag.java 20 Oct 2003 01:28:03 -0000 1.27 --- BaseHrefTag.java 26 Oct 2003 19:46:22 -0000 1.28 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BodyTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/BodyTag.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** BodyTag.java 20 Oct 2003 01:28:03 -0000 1.16 --- BodyTag.java 26 Oct 2003 19:46:23 -0000 1.17 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Bullet.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/Bullet.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** Bullet.java 20 Oct 2003 01:28:03 -0000 1.16 --- Bullet.java 26 Oct 2003 19:46:23 -0000 1.17 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BulletList.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/BulletList.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** BulletList.java 20 Oct 2003 01:28:03 -0000 1.16 --- BulletList.java 26 Oct 2003 19:46:23 -0000 1.17 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: CompositeTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/CompositeTag.java,v retrieving revision 1.60 retrieving revision 1.61 diff -C2 -d -r1.60 -r1.61 *** CompositeTag.java 25 Oct 2003 20:19:43 -0000 1.60 --- CompositeTag.java 26 Oct 2003 19:46:23 -0000 1.61 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Div.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/Div.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** Div.java 20 Oct 2003 01:28:03 -0000 1.16 --- Div.java 26 Oct 2003 19:46:23 -0000 1.17 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: DoctypeTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/DoctypeTag.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** DoctypeTag.java 20 Oct 2003 01:28:03 -0000 1.30 --- DoctypeTag.java 26 Oct 2003 19:46:23 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FormTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/FormTag.java,v retrieving revision 1.35 retrieving revision 1.36 diff -C2 -d -r1.35 -r1.36 *** FormTag.java 20 Oct 2003 01:28:03 -0000 1.35 --- FormTag.java 26 Oct 2003 19:46:23 -0000 1.36 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FrameSetTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/FrameSetTag.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** FrameSetTag.java 20 Oct 2003 01:28:03 -0000 1.28 --- FrameSetTag.java 26 Oct 2003 19:46:23 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FrameTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/FrameTag.java,v retrieving revision 1.27 retrieving revision 1.28 diff -C2 -d -r1.27 -r1.28 *** FrameTag.java 20 Oct 2003 01:28:03 -0000 1.27 --- FrameTag.java 26 Oct 2003 19:46:23 -0000 1.28 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: HeadTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/HeadTag.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** HeadTag.java 20 Oct 2003 01:28:03 -0000 1.16 --- HeadTag.java 26 Oct 2003 19:46:24 -0000 1.17 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Html.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/Html.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** Html.java 20 Oct 2003 01:28:03 -0000 1.28 --- Html.java 26 Oct 2003 19:46:24 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ImageTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/ImageTag.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** ImageTag.java 20 Oct 2003 01:28:03 -0000 1.30 --- ImageTag.java 26 Oct 2003 19:46:24 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: InputTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/InputTag.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** InputTag.java 20 Oct 2003 01:28:03 -0000 1.28 --- InputTag.java 26 Oct 2003 19:46:24 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: JspTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/JspTag.java,v retrieving revision 1.32 retrieving revision 1.33 diff -C2 -d -r1.32 -r1.33 *** JspTag.java 20 Oct 2003 01:28:03 -0000 1.32 --- JspTag.java 26 Oct 2003 19:46:24 -0000 1.33 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LabelTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/LabelTag.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** LabelTag.java 20 Oct 2003 01:28:03 -0000 1.29 --- LabelTag.java 26 Oct 2003 19:46:24 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LinkTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/LinkTag.java,v retrieving revision 1.37 retrieving revision 1.38 diff -C2 -d -r1.37 -r1.38 *** LinkTag.java 20 Oct 2003 01:28:03 -0000 1.37 --- LinkTag.java 26 Oct 2003 19:46:24 -0000 1.38 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: MetaTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/MetaTag.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** MetaTag.java 20 Oct 2003 01:28:03 -0000 1.28 --- MetaTag.java 26 Oct 2003 19:46:24 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: OptionTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/OptionTag.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** OptionTag.java 20 Oct 2003 01:28:03 -0000 1.31 --- OptionTag.java 26 Oct 2003 19:46:24 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ScriptTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/ScriptTag.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** ScriptTag.java 20 Oct 2003 01:28:03 -0000 1.29 --- ScriptTag.java 26 Oct 2003 19:46:24 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: SelectTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/SelectTag.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** SelectTag.java 20 Oct 2003 01:28:03 -0000 1.30 --- SelectTag.java 26 Oct 2003 19:46:24 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Span.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/Span.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** Span.java 20 Oct 2003 01:28:03 -0000 1.30 --- Span.java 26 Oct 2003 19:46:24 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StyleTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/StyleTag.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** StyleTag.java 20 Oct 2003 01:28:03 -0000 1.29 --- StyleTag.java 26 Oct 2003 19:46:24 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TableColumn.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/TableColumn.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** TableColumn.java 20 Oct 2003 01:28:03 -0000 1.30 --- TableColumn.java 26 Oct 2003 19:46:24 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TableRow.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/TableRow.java,v retrieving revision 1.32 retrieving revision 1.33 diff -C2 -d -r1.32 -r1.33 *** TableRow.java 20 Oct 2003 01:28:03 -0000 1.32 --- TableRow.java 26 Oct 2003 19:46:24 -0000 1.33 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TableTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/TableTag.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** TableTag.java 20 Oct 2003 01:28:03 -0000 1.33 --- TableTag.java 26 Oct 2003 19:46:24 -0000 1.34 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Tag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/Tag.java,v retrieving revision 1.53 retrieving revision 1.54 diff -C2 -d -r1.53 -r1.54 *** Tag.java 20 Oct 2003 01:28:03 -0000 1.53 --- Tag.java 26 Oct 2003 19:46:24 -0000 1.54 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TextareaTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/TextareaTag.java,v retrieving revision 1.27 retrieving revision 1.28 diff -C2 -d -r1.27 -r1.28 *** TextareaTag.java 20 Oct 2003 01:28:03 -0000 1.27 --- TextareaTag.java 26 Oct 2003 19:46:24 -0000 1.28 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TitleTag.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/TitleTag.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** TitleTag.java 20 Oct 2003 01:28:03 -0000 1.28 --- TitleTag.java 26 Oct 2003 19:46:24 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tags/package.html,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** package.html 22 Sep 2003 02:40:02 -0000 1.15 --- package.html 26 Oct 2003 19:46:24 -0000 1.16 *************** *** 6,10 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 6,10 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
From: <der...@us...> - 2003-10-26 19:48:34
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/codeMetrics In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/tests/codeMetrics Modified Files: LineCounter.java Log Message: Update version headers to 1.4-20031026 and update changelog. Index: LineCounter.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/codeMetrics/LineCounter.java,v retrieving revision 1.8 retrieving revision 1.9 diff -C2 -d -r1.8 -r1.9 *** LineCounter.java 22 Sep 2003 02:40:05 -0000 1.8 --- LineCounter.java 26 Oct 2003 19:46:25 -0000 1.9 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/nodeDecorators In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/nodeDecorators Modified Files: AbstractNodeDecorator.java DecodingNode.java EscapeCharacterRemovingNode.java NonBreakingSpaceConvertingNode.java Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AbstractNodeDecorator.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/nodeDecorators/AbstractNodeDecorator.java,v retrieving revision 1.12 retrieving revision 1.13 diff -C2 -d -r1.12 -r1.13 *** AbstractNodeDecorator.java 5 Oct 2003 13:49:49 -0000 1.12 --- AbstractNodeDecorator.java 26 Oct 2003 19:46:18 -0000 1.13 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: DecodingNode.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/nodeDecorators/DecodingNode.java,v retrieving revision 1.11 retrieving revision 1.12 diff -C2 -d -r1.11 -r1.12 *** DecodingNode.java 22 Sep 2003 02:39:59 -0000 1.11 --- DecodingNode.java 26 Oct 2003 19:46:19 -0000 1.12 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: EscapeCharacterRemovingNode.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/nodeDecorators/EscapeCharacterRemovingNode.java,v retrieving revision 1.9 retrieving revision 1.10 diff -C2 -d -r1.9 -r1.10 *** EscapeCharacterRemovingNode.java 22 Sep 2003 02:39:59 -0000 1.9 --- EscapeCharacterRemovingNode.java 26 Oct 2003 19:46:19 -0000 1.10 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: NonBreakingSpaceConvertingNode.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/nodeDecorators/NonBreakingSpaceConvertingNode.java,v retrieving revision 1.9 retrieving revision 1.10 diff -C2 -d -r1.9 -r1.10 *** NonBreakingSpaceConvertingNode.java 22 Sep 2003 02:39:59 -0000 1.9 --- NonBreakingSpaceConvertingNode.java 26 Oct 2003 19:46:19 -0000 1.10 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/scanners Modified Files: AppletScanner.java BaseHrefScanner.java BodyScanner.java BulletListScanner.java BulletScanner.java CompositeTagScanner.java DivScanner.java DoctypeScanner.java FormScanner.java FrameScanner.java FrameSetScanner.java HeadScanner.java HtmlScanner.java ImageScanner.java InputTagScanner.java JspScanner.java LabelScanner.java LinkScanner.java MetaTagScanner.java OptionTagScanner.java ScriptScanner.java SelectTagScanner.java SpanScanner.java StyleScanner.java TableColumnScanner.java TableRowScanner.java TableScanner.java TagScanner.java TextareaTagScanner.java TitleScanner.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AppletScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/AppletScanner.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** AppletScanner.java 20 Oct 2003 01:28:02 -0000 1.33 --- AppletScanner.java 26 Oct 2003 19:46:19 -0000 1.34 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BaseHrefScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/BaseHrefScanner.java,v retrieving revision 1.27 retrieving revision 1.28 diff -C2 -d -r1.27 -r1.28 *** BaseHrefScanner.java 20 Oct 2003 01:28:03 -0000 1.27 --- BaseHrefScanner.java 26 Oct 2003 19:46:19 -0000 1.28 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BodyScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/BodyScanner.java,v retrieving revision 1.19 retrieving revision 1.20 diff -C2 -d -r1.19 -r1.20 *** BodyScanner.java 20 Oct 2003 01:28:03 -0000 1.19 --- BodyScanner.java 26 Oct 2003 19:46:19 -0000 1.20 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BulletListScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/BulletListScanner.java,v retrieving revision 1.18 retrieving revision 1.19 diff -C2 -d -r1.18 -r1.19 *** BulletListScanner.java 20 Oct 2003 01:28:03 -0000 1.18 --- BulletListScanner.java 26 Oct 2003 19:46:19 -0000 1.19 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BulletScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/BulletScanner.java,v retrieving revision 1.23 retrieving revision 1.24 diff -C2 -d -r1.23 -r1.24 *** BulletScanner.java 20 Oct 2003 01:28:03 -0000 1.23 --- BulletScanner.java 26 Oct 2003 19:46:19 -0000 1.24 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: CompositeTagScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/CompositeTagScanner.java,v retrieving revision 1.72 retrieving revision 1.73 diff -C2 -d -r1.72 -r1.73 *** CompositeTagScanner.java 26 Oct 2003 16:04:26 -0000 1.72 --- CompositeTagScanner.java 26 Oct 2003 19:46:19 -0000 1.73 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: DivScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/DivScanner.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** DivScanner.java 20 Oct 2003 01:28:03 -0000 1.31 --- DivScanner.java 26 Oct 2003 19:46:19 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: DoctypeScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/DoctypeScanner.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** DoctypeScanner.java 20 Oct 2003 01:28:03 -0000 1.29 --- DoctypeScanner.java 26 Oct 2003 19:46:19 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FormScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/FormScanner.java,v retrieving revision 1.46 retrieving revision 1.47 diff -C2 -d -r1.46 -r1.47 *** FormScanner.java 20 Oct 2003 01:28:03 -0000 1.46 --- FormScanner.java 26 Oct 2003 19:46:19 -0000 1.47 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FrameScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/FrameScanner.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** FrameScanner.java 20 Oct 2003 01:28:03 -0000 1.30 --- FrameScanner.java 26 Oct 2003 19:46:19 -0000 1.31 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FrameSetScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/FrameSetScanner.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** FrameSetScanner.java 20 Oct 2003 01:28:03 -0000 1.29 --- FrameSetScanner.java 26 Oct 2003 19:46:19 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: HeadScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/HeadScanner.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** HeadScanner.java 20 Oct 2003 01:28:03 -0000 1.16 --- HeadScanner.java 26 Oct 2003 19:46:19 -0000 1.17 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: HtmlScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/HtmlScanner.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** HtmlScanner.java 20 Oct 2003 01:28:03 -0000 1.31 --- HtmlScanner.java 26 Oct 2003 19:46:19 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ImageScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/ImageScanner.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** ImageScanner.java 20 Oct 2003 01:28:03 -0000 1.31 --- ImageScanner.java 26 Oct 2003 19:46:20 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: InputTagScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/InputTagScanner.java,v retrieving revision 1.27 retrieving revision 1.28 diff -C2 -d -r1.27 -r1.28 *** InputTagScanner.java 20 Oct 2003 01:28:03 -0000 1.27 --- InputTagScanner.java 26 Oct 2003 19:46:20 -0000 1.28 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: JspScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/JspScanner.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** JspScanner.java 20 Oct 2003 01:28:03 -0000 1.29 --- JspScanner.java 26 Oct 2003 19:46:20 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LabelScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/LabelScanner.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** LabelScanner.java 20 Oct 2003 01:28:03 -0000 1.34 --- LabelScanner.java 26 Oct 2003 19:46:20 -0000 1.35 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LinkScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/LinkScanner.java,v retrieving revision 1.55 retrieving revision 1.56 diff -C2 -d -r1.55 -r1.56 *** LinkScanner.java 20 Oct 2003 01:28:03 -0000 1.55 --- LinkScanner.java 26 Oct 2003 19:46:20 -0000 1.56 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: MetaTagScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/MetaTagScanner.java,v retrieving revision 1.27 retrieving revision 1.28 diff -C2 -d -r1.27 -r1.28 *** MetaTagScanner.java 20 Oct 2003 01:28:03 -0000 1.27 --- MetaTagScanner.java 26 Oct 2003 19:46:21 -0000 1.28 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: OptionTagScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/OptionTagScanner.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** OptionTagScanner.java 20 Oct 2003 01:28:03 -0000 1.34 --- OptionTagScanner.java 26 Oct 2003 19:46:21 -0000 1.35 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ScriptScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/ScriptScanner.java,v retrieving revision 1.43 retrieving revision 1.44 diff -C2 -d -r1.43 -r1.44 *** ScriptScanner.java 20 Oct 2003 01:28:03 -0000 1.43 --- ScriptScanner.java 26 Oct 2003 19:46:21 -0000 1.44 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: SelectTagScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/SelectTagScanner.java,v retrieving revision 1.32 retrieving revision 1.33 diff -C2 -d -r1.32 -r1.33 *** SelectTagScanner.java 20 Oct 2003 01:28:03 -0000 1.32 --- SelectTagScanner.java 26 Oct 2003 19:46:21 -0000 1.33 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: SpanScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/SpanScanner.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** SpanScanner.java 20 Oct 2003 01:28:03 -0000 1.33 --- SpanScanner.java 26 Oct 2003 19:46:21 -0000 1.34 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StyleScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/StyleScanner.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** StyleScanner.java 20 Oct 2003 01:28:03 -0000 1.28 --- StyleScanner.java 26 Oct 2003 19:46:21 -0000 1.29 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TableColumnScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/TableColumnScanner.java,v retrieving revision 1.36 retrieving revision 1.37 diff -C2 -d -r1.36 -r1.37 *** TableColumnScanner.java 20 Oct 2003 01:28:03 -0000 1.36 --- TableColumnScanner.java 26 Oct 2003 19:46:21 -0000 1.37 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TableRowScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/TableRowScanner.java,v retrieving revision 1.39 retrieving revision 1.40 diff -C2 -d -r1.39 -r1.40 *** TableRowScanner.java 20 Oct 2003 01:28:03 -0000 1.39 --- TableRowScanner.java 26 Oct 2003 19:46:21 -0000 1.40 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TableScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/TableScanner.java,v retrieving revision 1.38 retrieving revision 1.39 diff -C2 -d -r1.38 -r1.39 *** TableScanner.java 20 Oct 2003 01:28:03 -0000 1.38 --- TableScanner.java 26 Oct 2003 19:46:21 -0000 1.39 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TagScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/TagScanner.java,v retrieving revision 1.43 retrieving revision 1.44 diff -C2 -d -r1.43 -r1.44 *** TagScanner.java 20 Oct 2003 01:28:03 -0000 1.43 --- TagScanner.java 26 Oct 2003 19:46:21 -0000 1.44 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TextareaTagScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/TextareaTagScanner.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** TextareaTagScanner.java 20 Oct 2003 01:28:03 -0000 1.29 --- TextareaTagScanner.java 26 Oct 2003 19:46:21 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TitleScanner.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/TitleScanner.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** TitleScanner.java 25 Oct 2003 15:46:02 -0000 1.31 --- TitleScanner.java 26 Oct 2003 19:46:21 -0000 1.32 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/package.html,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** package.html 22 Sep 2003 02:40:00 -0000 1.15 --- package.html 26 Oct 2003 19:46:21 -0000 1.16 *************** *** 6,10 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 6,10 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/nodes In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/lexer/nodes Modified Files: Attribute.java PageAttribute.java RemarkNode.java StringNode.java TagNode.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: Attribute.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/nodes/Attribute.java,v retrieving revision 1.12 retrieving revision 1.13 diff -C2 -d -r1.12 -r1.13 *** Attribute.java 18 Oct 2003 20:50:37 -0000 1.12 --- Attribute.java 26 Oct 2003 19:46:18 -0000 1.13 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: PageAttribute.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/nodes/PageAttribute.java,v retrieving revision 1.2 retrieving revision 1.3 diff -C2 -d -r1.2 -r1.3 *** PageAttribute.java 26 Oct 2003 17:58:25 -0000 1.2 --- PageAttribute.java 26 Oct 2003 19:46:18 -0000 1.3 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: RemarkNode.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/nodes/RemarkNode.java,v retrieving revision 1.9 retrieving revision 1.10 diff -C2 -d -r1.9 -r1.10 *** RemarkNode.java 25 Oct 2003 15:46:02 -0000 1.9 --- RemarkNode.java 26 Oct 2003 19:46:18 -0000 1.10 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StringNode.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/nodes/StringNode.java,v retrieving revision 1.10 retrieving revision 1.11 diff -C2 -d -r1.10 -r1.11 *** StringNode.java 20 Oct 2003 01:28:02 -0000 1.10 --- StringNode.java 26 Oct 2003 19:46:18 -0000 1.11 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TagNode.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/nodes/TagNode.java,v retrieving revision 1.18 retrieving revision 1.19 diff -C2 -d -r1.18 -r1.19 *** TagNode.java 20 Oct 2003 01:28:02 -0000 1.18 --- TagNode.java 26 Oct 2003 19:46:18 -0000 1.19 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/nodes/package.html,v retrieving revision 1.6 retrieving revision 1.7 diff -C2 -d -r1.6 -r1.7 *** package.html 26 Oct 2003 17:58:25 -0000 1.6 --- package.html 26 Oct 2003 19:46:18 -0000 1.7 *************** *** 7,11 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 7,11 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
From: <der...@us...> - 2003-10-26 19:48:33
|
Update of /cvsroot/htmlparser/htmlparser/docs In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/docs Modified Files: changes.txt release.txt Log Message: Update version headers to 1.4-20031026 and update changelog. Index: changes.txt =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/changes.txt,v retrieving revision 1.190 retrieving revision 1.191 diff -C2 -d -r1.190 -r1.191 *** changes.txt 22 Sep 2003 02:39:58 -0000 1.190 --- changes.txt 26 Oct 2003 19:46:16 -0000 1.191 *************** *** 13,16 **** --- 13,449 ---- ******************************************************************************* + Integration Build 1.4 - 20031026 + -------------------------------- + 2003-10-26 12:58 derrickoswald + + * src/org/htmlparser/lexer/: PageIndex.java, package.html, + nodes/PageAttribute.java, nodes/package.html: + + Doco update. Move the lexer from future tense to current. + + 2003-10-26 11:44 derrickoswald + + * src/org/htmlparser/lexerapplications/thumbelina/Thumbelina.java: + + Get thumbelina working again. The tag.getName() method doesn't include the / of end tags. + + 2003-10-26 11:04 derrickoswald + + * src/org/htmlparser/: scanners/CompositeTagScanner.java, + tests/parserHelperTests/CompositeTagScannerHelperTest.java: + + Oops, remove references to CompositeTagScannerHelper. + + 2003-10-26 10:50 derrickoswald + + * src/org/htmlparser/scanners/CompositeTagScanner.java: + + Removed the need for CompositeTagScannerHelper, finally getting rid of the parserHelper package. + + 2003-10-25 23:53 derrickoswald + + * src/org/htmlparser/: lexer/Page.java, tags/AppletTag.java, + tests/ParserTest.java, tests/ParserTestCase.java, + tests/lexerTests/StreamTests.java, + tests/scannersTests/BulletScannerTest.java, + tests/tagTests/OptionTagTest.java, + visitors/LinkFindingVisitor.java: + + Quiet down the test output. + + 2003-10-25 16:19 derrickoswald + + * src/org/htmlparser/: tags/CompositeTag.java, + tests/parserHelperTests/CompositeTagScannerHelperTest.java, + tests/parserHelperTests/StringParserTest.java, + tests/scannersTests/CompositeTagScannerTest.java, + tests/scannersTests/LinkScannerTest.java, + tests/tagTests/JspTagTest.java, tests/tagTests/OptionTagTest.java, + tests/tagTests/SelectTagTest.java, tests/tagTests/TagTest.java: + + Clean up the last few failing tests. + *** The bar is green again *** + + 2003-10-25 11:46 derrickoswald + + * src/org/htmlparser/: lexer/Lexer.java, + lexer/nodes/RemarkNode.java, scanners/TitleScanner.java, + tests/scannersTests/TitleScannerTest.java: + + Handle some broken end tags. + Handle some pathological remark nodes. + + 2003-10-25 08:03 derrickoswald + + * build.xml, bin/parser: + + Fix htmllexer.jar, add parser linux/unix script. + + 2003-10-20 22:24 derrickoswald + + * src/org/htmlparser/tests/: AllTests.java, + AssertXmlEqualsTest.java, FunctionalTests.java, + LineNumberAssignedByNodeReaderTest.java, ParserTest.java, + ParserTestCase.java, lexerTests/AllTests.java, + lexerTests/AttributeTests.java, lexerTests/LexerTests.java, + lexerTests/PageIndexTests.java, lexerTests/PageTests.java, + lexerTests/SourceTests.java, lexerTests/StreamTests.java, + lexerTests/TagTests.java, nodeDecoratorTests/AllTests.java, + nodeDecoratorTests/DecodingNodeTest.java, + nodeDecoratorTests/EscapeCharacterRemovingNodeTest.java, + nodeDecoratorTests/NonBreakingSpaceConvertingNodeTest.java, + parserHelperTests/AllTests.java, + parserHelperTests/CompositeTagScannerHelperTest.java, + parserHelperTests/RemarkNodeParserTest.java, + parserHelperTests/StringParserTest.java, + scannersTests/AllTests.java, scannersTests/AppletScannerTest.java, + scannersTests/BaseHREFScannerTest.java, + scannersTests/BodyScannerTest.java, + scannersTests/BulletListScannerTest.java, + scannersTests/BulletScannerTest.java, + scannersTests/CompositeTagScannerTest.java, + scannersTests/DivScannerTest.java, + scannersTests/FormScannerTest.java, + scannersTests/FrameScannerTest.java, + scannersTests/FrameSetScannerTest.java, + scannersTests/HeadScannerTest.java, scannersTests/HtmlTest.java, + scannersTests/ImageScannerTest.java, + scannersTests/InputTagScannerTest.java, + scannersTests/JspScannerTest.java, + scannersTests/LabelScannerTest.java, + scannersTests/LinkScannerTest.java, + scannersTests/MetaTagScannerTest.java, + scannersTests/OptionTagScannerTest.java, + scannersTests/ScriptScannerTest.java, + scannersTests/SelectTagScannerTest.java, + scannersTests/SpanScannerTest.java, + scannersTests/StyleScannerTest.java, + scannersTests/TableScannerTest.java, + scannersTests/TagScannerTest.java, + scannersTests/TextareaTagScannerTest.java, + scannersTests/TitleScannerTest.java, + scannersTests/XmlEndTagScanningTest.java, tagTests/AllTests.java, + tagTests/AppletTagTest.java, tagTests/BaseHrefTagTest.java, + tagTests/BodyTagTest.java, tagTests/CompositeTagTest.java, + tagTests/DoctypeTagTest.java, tagTests/EndTagTest.java, + tagTests/FormTagTest.java, tagTests/FrameSetTagTest.java, + tagTests/FrameTagTest.java, tagTests/ImageTagTest.java, + tagTests/InputTagTest.java, tagTests/JspTagTest.java, + tagTests/LinkTagTest.java, tagTests/MetaTagTest.java, + tagTests/ObjectCollectionTest.java, tagTests/OptionTagTest.java, + tagTests/ScriptTagTest.java, tagTests/SelectTagTest.java, + tagTests/StyleTagTest.java, tagTests/TagTest.java, + tagTests/TextareaTagTest.java, tagTests/TitleTagTest.java, + utilTests/AllTests.java, utilTests/BeanTest.java, + utilTests/CharacterTranslationTest.java, + utilTests/HTMLLinkProcessorTest.java, + utilTests/HTMLParserUtilsTest.java, utilTests/NodeListTest.java, + utilTests/SortTest.java, visitorsTests/AllTests.java, + visitorsTests/CompositeTagFindingVisitorTest.java, + visitorsTests/HtmlPageTest.java, + visitorsTests/LinkFindingVisitorTest.java, + visitorsTests/NodeVisitorTest.java, + visitorsTests/StringFindingVisitorTest.java, + visitorsTests/TagFindingVisitorTest.java, + visitorsTests/TextExtractingVisitorTest.java, + visitorsTests/UrlModifyingVisitorTest.java: + + Consolidated the various testing main() methods into ParserTestCase. + All unit test classes in the org.htmlparser.tests heirarchy should now be executable. + + 2003-10-19 21:28 derrickoswald + + * src/org/htmlparser/: AbstractNode.java, Parser.java, + lexer/Lexer.java, lexer/nodes/AbstractNode.java, + lexer/nodes/NodeFactory.java, lexer/nodes/RemarkNode.java, + lexer/nodes/StringNode.java, lexer/nodes/TagNode.java, + scanners/AppletScanner.java, scanners/BaseHrefScanner.java, + scanners/BodyScanner.java, scanners/BulletListScanner.java, + scanners/BulletScanner.java, scanners/CompositeTagScanner.java, + scanners/DivScanner.java, scanners/DoctypeScanner.java, + scanners/FormScanner.java, scanners/FrameScanner.java, + scanners/FrameSetScanner.java, scanners/HeadScanner.java, + scanners/HtmlScanner.java, scanners/ImageScanner.java, + scanners/InputTagScanner.java, scanners/JspScanner.java, + scanners/LabelScanner.java, scanners/LinkScanner.java, + scanners/MetaTagScanner.java, scanners/OptionTagScanner.java, + scanners/ScriptScanner.java, scanners/SelectTagScanner.java, + scanners/SpanScanner.java, scanners/StyleScanner.java, + scanners/TableColumnScanner.java, scanners/TableRowScanner.java, + scanners/TableScanner.java, scanners/TagScanner.java, + scanners/TextareaTagScanner.java, scanners/TitleScanner.java, + tags/AppletTag.java, tags/BaseHrefTag.java, tags/BodyTag.java, + tags/Bullet.java, tags/BulletList.java, tags/CompositeTag.java, + tags/Div.java, tags/DoctypeTag.java, tags/FormTag.java, + tags/FrameSetTag.java, tags/FrameTag.java, tags/HeadTag.java, + tags/Html.java, tags/ImageTag.java, tags/InputTag.java, + tags/JspTag.java, tags/LabelTag.java, tags/LinkTag.java, + tags/MetaTag.java, tags/OptionTag.java, tags/ScriptTag.java, + tags/SelectTag.java, tags/Span.java, tags/StyleTag.java, + tags/TableColumn.java, tags/TableRow.java, tags/TableTag.java, + tags/Tag.java, tags/TextareaTag.java, tags/TitleTag.java, + tests/FunctionalTests.java, + tests/LineNumberAssignedByNodeReaderTest.java, + tests/ParserTestCase.java, tests/lexerTests/AttributeTests.java, + tests/lexerTests/KitTest.java, + tests/parserHelperTests/CompositeTagScannerHelperTest.java, + tests/scannersTests/CompositeTagScannerTest.java, + tests/scannersTests/ImageScannerTest.java, + tests/scannersTests/LinkScannerTest.java, + tests/scannersTests/TableScannerTest.java, + tests/scannersTests/TagScannerTest.java, + tests/tagTests/BaseHrefTagTest.java, + tests/tagTests/LinkTagTest.java, tests/tagTests/ScriptTagTest.java, + tests/utilTests/NodeListTest.java, + tests/visitorsTests/UrlModifyingVisitorTest.java, + util/LinkProcessor.java, util/NodeList.java: + + Removed lexer level AbstractNode. + Removed data package from parser level tags. + Separated tag creation from recursion in NodeFactory interface. + + 2003-10-18 16:50 derrickoswald + + * src/org/htmlparser/: lexer/Lexer.java, + lexer/nodes/Attribute.java, lexer/nodes/PageAttribute.java, + lexer/nodes/TagNode.java, tags/AppletTag.java, + tests/lexerTests/AttributeTests.java, + tests/scannersTests/FormScannerTest.java, + tests/scannersTests/LinkScannerTest.java, + tests/tagTests/AppletTagTest.java, tests/tagTests/FormTagTest.java, + tests/tagTests/JspTagTest.java, tests/tagTests/ScriptTagTest.java, + tests/tagTests/TagTest.java, tests/utilTests/AllTests.java, + tests/utilTests/HTMLTagParserTest.java, util/NodeList.java: + + Partition Attribute into a base class and PageAttribute class for the Lexer. + Fixed the AppletTag.setAppletParams in a cheesy manner. + Clear out the released NodeList entry on remove(). + Dropped the HTMLTagParserTest tests, because they really weren't relevant any more. + + 2003-10-13 17:48 derrickoswald + + * src/org/htmlparser/: Parser.java, lexer/Cursor.java, + lexer/Lexer.java, lexer/Page.java, lexer/nodes/Attribute.java, + lexer/nodes/TagNode.java, scanners/ScriptScanner.java, + tests/AllTests.java, tests/lexerTests/AllTests.java, + tests/lexerTests/AttributeTests.java, + tests/lexerTests/TagTests.java, + tests/scannersTests/JspScannerTest.java, + tests/scannersTests/MetaTagScannerTest.java, + tests/scannersTests/ScriptScannerTest.java, + tests/tagTests/FormTagTest.java, tests/tagTests/InputTagTest.java, + tests/tagTests/JspTagTest.java, tests/tagTests/MetaTagTest.java, + tests/tagTests/TagTest.java, tests/tagTests/TextareaTagTest.java: + + Eliminated ParserHelper static class. + Add fixAttributes() to handle bad tags. + Provide for more than just an equals sign between the attribute name and the value. + Unquote the values in getAttributes() hashtable. + Fixed a bug regarding factory creation in script scanner. + Returned temporaryFailures classes to servicability. + Skip JSP testing, fix tests broken because of unquoted attribute values. + Some JavaDoc cleanup. + + 2003-10-05 21:43 derrickoswald + + * src/org/htmlparser/: tags/JspTag.java, + tests/parserHelperTests/RemarkNodeParserTest.java, + tests/parserHelperTests/StringParserTest.java, + tests/scannersTests/BodyScannerTest.java, + tests/scannersTests/BulletListScannerTest.java, + tests/scannersTests/CompositeTagScannerTest.java, + tests/scannersTests/FormScannerTest.java, + tests/scannersTests/LabelScannerTest.java, + tests/scannersTests/LinkScannerTest.java, + tests/scannersTests/MetaTagScannerTest.java, + tests/scannersTests/StyleScannerTest.java, + tests/scannersTests/TableScannerTest.java, + tests/scannersTests/TitleScannerTest.java, + tests/tagTests/AppletTagTest.java, + tests/tagTests/BaseHrefTagTest.java, + tests/tagTests/EndTagTest.java, tests/tagTests/FormTagTest.java, + tests/tagTests/FrameSetTagTest.java, + tests/tagTests/FrameTagTest.java, tests/tagTests/ImageTagTest.java, + tests/tagTests/InputTagTest.java, tests/tagTests/LinkTagTest.java, + tests/tagTests/MetaTagTest.java, tests/tagTests/OptionTagTest.java, + tests/tagTests/ScriptTagTest.java, + tests/tagTests/SelectTagTest.java, + tests/tagTests/StyleTagTest.java, tests/tagTests/TagTest.java, + tests/tagTests/TextareaTagTest.java: + + Updated tests to correspond to new behaviour. + Mostly due to changes in order and case of tag contents. + Of the forty odd remaining failing tests, the majority comprise altered functionality that needs to be resolved. + + 2003-10-05 09:49 derrickoswald + + * src/org/htmlparser/: AbstractNode.java, Node.java, + lexer/Cursor.java, lexer/Lexer.java, lexer/nodes/Attribute.java, + lexer/nodes/TagNode.java, + nodeDecorators/AbstractNodeDecorator.java, + scanners/CompositeTagScanner.java, scanners/ImageScanner.java, + scanners/LinkScanner.java, scanners/ScriptScanner.java, + scanners/TagScanner.java, tests/ParserTest.java, + tests/ParserTestCase.java, + tests/scannersTests/AppletScannerTest.java, + tests/scannersTests/FormScannerTest.java, + tests/scannersTests/FrameScannerTest.java, + tests/scannersTests/ImageScannerTest.java, + tests/scannersTests/JspScannerTest.java, + tests/scannersTests/LabelScannerTest.java, + tests/scannersTests/LinkScannerTest.java, + tests/scannersTests/MetaTagScannerTest.java, + tests/scannersTests/OptionTagScannerTest.java, + tests/scannersTests/ScriptScannerTest.java, + tests/scannersTests/TagScannerTest.java, + tests/scannersTests/TitleScannerTest.java, + tests/tagTests/AppletTagTest.java, + tests/tagTests/DoctypeTagTest.java, tests/tagTests/JspTagTest.java, + tests/tagTests/LinkTagTest.java, tests/tagTests/MetaTagTest.java, + tests/tagTests/ScriptTagTest.java, util/IteratorImpl.java, + util/NodeList.java: + + Add bean like accessors for positions on Node, AbstractNode and AbstractNodeDecorator. + Handle null page in Cursor. + Add smartquotes mode in Lexer and CompositeTagScannerHelper. + Add simple name constructor in Attribute. + Remove emptyxmltag member, replace with computing accessors in TagNode. + Removed ScriptScannerHelper and moved scanning logic to ScriptScanner. + Reworked extractImageLocn in ImageScanner + Implement extractXMLData in TagScanner. + Made virtual tags zero length in TagData. + Added push() to IteratorImpl. + Added single node constructor to NodeList. + Numerous and various test adjustments. Still 133 failures. + + 2003-10-02 22:15 derrickoswald + + * src/org/htmlparser/: lexer/nodes/StringNode.java, + tags/CompositeTag.java, tags/FrameSetTag.java, tags/SelectTag.java, + tests/AllTests.java, tests/ParserTestCase.java: + + Fix all testcases generating exceptions. Still 160 failures. + + 2003-10-02 20:20 derrickoswald + + * src/org/htmlparser/: Parser.java, lexer/nodes/TagNode.java, + tests/LineNumberAssignedByNodeReaderTest.java: + + Updated tag line numbers test. + ***** Line numbers reported by tags are now zero based, not one based. ***** + Strip off possible ending slash in tag name. + + 2003-10-02 19:48 derrickoswald + + * src/org/htmlparser/: lexer/nodes/Attribute.java, + lexer/nodes/TagNode.java, tags/Tag.java, tests/ParserTestCase.java, + tests/tagTests/TagTest.java, util/ParserUtils.java, + util/SpecialHashtable.java: + + Moved SpecialHashTable to util. + Fixed some attribute bugs and some test cases. + + 2003-09-29 22:12 derrickoswald + + * src/org/htmlparser/: lexer/Page.java, tags/Tag.java: + + Doco update. Privatize tag fields leading up to removal. + + 2003-09-28 20:00 derrickoswald + + * src/org/htmlparser/: Parser.java, lexer/Cursor.java, + lexer/Lexer.java, lexer/Page.java, lexer/PageIndex.java, + lexer/Source.java, tests/utilTests/BeanTest.java: + + Fix broken serializability. + + 2003-09-28 15:30 derrickoswald + + * src/org/htmlparser/: Parser.java, RemarkNode.java, + StringNode.java, beans/StringBean.java, tags/CompositeTag.java, + tags/ImageTag.java, tags/LinkTag.java, tags/Tag.java, + tags/TitleTag.java, + tests/visitorsTests/UrlModifyingVisitorTest.java, + util/LinkProcessor.java, visitors/HtmlPage.java, + visitors/NodeVisitor.java, visitors/TagFindingVisitor.java, + visitors/TextExtractingVisitor.java, + visitors/UrlModifyingVisitor.java: + + Fixed up the broken visitor logic. + Added some docos on NodeVisitor. + + 2003-09-28 11:33 derrickoswald + + * src/org/htmlparser/: AbstractNode.java, NodeReader.java, + Parser.java, RemarkNode.java, RemarkNodeParser.java, + StringNode.java, beans/StringBean.java, lexer/Cursor.java, + lexer/Lexer.java, lexer/Page.java, lexer/Source.java, + lexer/nodes/StringNode.java, lexer/nodes/TagNode.java, + lexer/nodes/NodeFactory.java, scanners/CompositeTagScanner.java, + scanners/DoctypeScanner.java, scanners/ImageScanner.java, + scanners/JspScanner.java, scanners/ScriptScanner.java, + scanners/TagScanner.java, tags/AppletTag.java, + tags/CompositeTag.java, tags/DoctypeTag.java, tags/EndTag.java, + tags/ImageTag.java, tags/JspTag.java, tags/StyleTag.java, + tags/Tag.java, tests/ParserTest.java, tests/ParserTestCase.java, + tests/lexerTests/LexerTests.java, + tests/parserHelperTests/CompositeTagScannerHelperTest.java, + tests/scannersTests/CompositeTagScannerTest.java, + tests/scannersTests/ImageScannerTest.java, + tests/scannersTests/LinkScannerTest.java, + tests/scannersTests/MetaTagScannerTest.java, + tests/scannersTests/TagScannerTest.java, + tests/tagTests/BaseHrefTagTest.java, + tests/tagTests/EndTagTest.java, tests/tagTests/LinkTagTest.java, + tests/tagTests/ScriptTagTest.java, tests/tagTests/TagTest.java, + tests/utilTests/HTMLTagParserTest.java, util/Generate.java, + util/IteratorImpl.java, util/ParserUtils.java, + visitors/HtmlPage.java, visitors/NodeVisitor.java, + visitors/TagFindingVisitor.java, + visitors/TextExtractingVisitor.java, + visitors/UrlModifyingVisitor.java: + + Lexer Integration + Removed old Parser classes. + Removed EndTag, this class was replaced by a call to the new isEndTag() method on the Tag class + The StringNode, RemarkNode and tags.Tag class now derive from their lexeme counterparts in lexer.nodes instead of the other way around. + The beginnings of a node factory interface are included. This was added so the lexer could return 'visitable' nodes to the parser. The parser acts as it's own node factory, as does the Lexer. + The node count for parsing goes up in most cases because every whitespace (i.e. newline) now counts as a StringNode. This has whacked out a lot of the tests that were expecting fewer nodes or a certain type of node at a particular index. + Attributes now maintain their order and case. The count of attributes also went up because whitespace is maintained within tags too. The storage in a Vector means the element 0 Attribute is actually the name of the tag, rather than having the $TAGNAME entry in a HashTable. + + 2003-09-22 23:41 derrickoswald + + * build.xml, cvs2cl.pl, htmlparser_checks.xml, java.header, + src/org/htmlparser/lexer/nodes/TagNode.java, + src/org/htmlparser/tags/AppletTag.java, bin/crawler.bat, bin/lexer, + bin/lexer.bat, bin/parser.bat, bin/ripper.bat, bin/thumbelina, + bin/thumbelina.bat, lib/fit.jar, resources/cvs2cl.pl, + resources/fit.jar, resources/htmlparser_checks.xml, + resources/java.header, resources/lexer, resources/runCrawler.bat, + resources/runLexer.bat, resources/runParser.bat, + resources/runRipper.bat, resources/runThumbelina.bat, + resources/thumbelina: + + Distribution cleanup. + + - Removed duplicate documentation files from src.zip. + - Jars are now built in lib, and stay there, rather than being deleting in the clean task. + *** NOTE *** No more release directory. + - Added checkstyle-all-3.1.jar to the lib directory, so others can run it too. + - Moved executable scripts from resources to a new bin directory + so they can be executed in a development environment. + - Moved fit.jar from resources to the lib directory. + This left the resources directory empty, but... + - Moved cvs2cl and checkstyle files into the resources directory. + - Eliminated staging of source files and release files just to construct a + zip. These are now aggregated by their respective zip tasks. + - Changed name of changeLog task to changelog. + - Fixed a few javadoc warnings. + - Removed the spurious 'run' from the front of all the names of the DOS batch files. + + The only files that aren't shipped now are the results, specs and .ssh directory, + (whatever they are), and the development environment is identical to the unpacked + zips except for maybe the built directories (distribution, javadocs). + Integration Build 1.4 - 20030921 -------------------------------- Index: release.txt =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/release.txt,v retrieving revision 1.49 retrieving revision 1.50 diff -C2 -d -r1.49 -r1.50 *** release.txt 22 Sep 2003 02:39:58 -0000 1.49 --- release.txt 26 Oct 2003 19:46:17 -0000 1.50 *************** *** 1,3 **** ! HTMLParser Version 1.4 (Integration Build Sep 21, 2003) ********************************************* --- 1,3 ---- ! HTMLParser Version 1.4 (Integration Build Oct 26, 2003) ********************************************* |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/beans In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/beans Modified Files: BeanyBaby.java HTMLLinkBean.java HTMLTextBean.java LinkBean.java StringBean.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: BeanyBaby.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/beans/BeanyBaby.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** BeanyBaby.java 22 Sep 2003 02:39:58 -0000 1.16 --- BeanyBaby.java 26 Oct 2003 19:46:17 -0000 1.17 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: HTMLLinkBean.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/beans/HTMLLinkBean.java,v retrieving revision 1.17 retrieving revision 1.18 diff -C2 -d -r1.17 -r1.18 *** HTMLLinkBean.java 22 Sep 2003 02:39:58 -0000 1.17 --- HTMLLinkBean.java 26 Oct 2003 19:46:17 -0000 1.18 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: HTMLTextBean.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/beans/HTMLTextBean.java,v retrieving revision 1.18 retrieving revision 1.19 diff -C2 -d -r1.18 -r1.19 *** HTMLTextBean.java 22 Sep 2003 02:39:58 -0000 1.18 --- HTMLTextBean.java 26 Oct 2003 19:46:17 -0000 1.19 *************** *** 1,3 **** ! /// HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! /// HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LinkBean.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/beans/LinkBean.java,v retrieving revision 1.21 retrieving revision 1.22 diff -C2 -d -r1.21 -r1.22 *** LinkBean.java 22 Sep 2003 02:39:58 -0000 1.21 --- LinkBean.java 26 Oct 2003 19:46:17 -0000 1.22 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StringBean.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/beans/StringBean.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** StringBean.java 28 Sep 2003 19:30:03 -0000 1.29 --- StringBean.java 26 Oct 2003 19:46:17 -0000 1.30 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/beans/package.html,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** package.html 22 Sep 2003 02:39:58 -0000 1.15 --- package.html 26 Oct 2003 19:46:17 -0000 1.16 *************** *** 6,10 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 6,10 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/lexer Modified Files: Cursor.java Lexer.java Page.java PageIndex.java Source.java Stream.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: Cursor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/Cursor.java,v retrieving revision 1.12 retrieving revision 1.13 diff -C2 -d -r1.12 -r1.13 *** Cursor.java 13 Oct 2003 21:48:12 -0000 1.12 --- Cursor.java 26 Oct 2003 19:46:18 -0000 1.13 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Lexer.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/Lexer.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** Lexer.java 25 Oct 2003 15:46:02 -0000 1.16 --- Lexer.java 26 Oct 2003 19:46:18 -0000 1.17 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Page.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/Page.java,v retrieving revision 1.20 retrieving revision 1.21 diff -C2 -d -r1.20 -r1.21 *** Page.java 26 Oct 2003 03:53:32 -0000 1.20 --- Page.java 26 Oct 2003 19:46:18 -0000 1.21 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: PageIndex.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/PageIndex.java,v retrieving revision 1.11 retrieving revision 1.12 diff -C2 -d -r1.11 -r1.12 *** PageIndex.java 26 Oct 2003 17:58:25 -0000 1.11 --- PageIndex.java 26 Oct 2003 19:46:18 -0000 1.12 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Source.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/Source.java,v retrieving revision 1.11 retrieving revision 1.12 diff -C2 -d -r1.11 -r1.12 *** Source.java 29 Sep 2003 00:00:39 -0000 1.11 --- Source.java 26 Oct 2003 19:46:18 -0000 1.12 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Stream.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/Stream.java,v retrieving revision 1.7 retrieving revision 1.8 diff -C2 -d -r1.7 -r1.8 *** Stream.java 22 Sep 2003 02:39:59 -0000 1.7 --- Stream.java 26 Oct 2003 19:46:18 -0000 1.8 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/package.html,v retrieving revision 1.8 retrieving revision 1.9 diff -C2 -d -r1.8 -r1.9 *** package.html 26 Oct 2003 17:58:25 -0000 1.8 --- package.html 26 Oct 2003 19:46:18 -0000 1.9 *************** *** 7,11 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 7,11 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
From: <der...@us...> - 2003-10-26 19:48:32
|
Update of /cvsroot/htmlparser/WikiCapturer/src/org/htmlparser/wikicapturer In directory sc8-pr-cvs1:/tmp/cvs-serv24811/WikiCapturer/src/org/htmlparser/wikicapturer Modified Files: PhpWikiVisitor.java Log Message: Update version headers to 1.4-20031026 and update changelog. Index: PhpWikiVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/WikiCapturer/src/org/htmlparser/wikicapturer/PhpWikiVisitor.java,v retrieving revision 1.4 retrieving revision 1.5 diff -C2 -d -r1.4 -r1.5 *** PhpWikiVisitor.java 12 May 2003 00:59:25 -0000 1.4 --- PhpWikiVisitor.java 26 Oct 2003 19:46:16 -0000 1.5 *************** *** 11,15 **** import org.htmlparser.scanners.LinkScanner; import org.htmlparser.scanners.TitleScanner; - import org.htmlparser.tags.EndTag; import org.htmlparser.tags.ImageTag; import org.htmlparser.tags.LinkTag; --- 11,14 ---- *************** *** 41,51 **** } - public void visitEndTag(EndTag endTag) { - if (captureBegin) { - capturedHtml.append(endTag.toHtml()); - // System.out.println(endTag.toHtml()); - } - } - public void visitRemarkNode(RemarkNode remarkNode) { if (remarkNode.getText().indexOf("Begin actionbar")!=-1) { --- 40,43 ---- *************** *** 72,85 **** if (tag instanceof LinkTag) return; if (tag instanceof ImageTag) return; ! ! if (captureBegin) { ! if (tag.breaksFlow ()) ! capturedHtml.append (newline); ! if (!tag.getTagName().equals("A")) { ! capturedHtml.append(tag.toHtml()); ! // System.out.println("Tag captured: "+tag.toHtml()); ! } ! } } --- 64,84 ---- if (tag instanceof LinkTag) return; if (tag instanceof ImageTag) return; ! if (tag.isEndTag ()) { ! if (captureBegin) ! capturedHtml.append(tag.toHtml()); ! } ! else ! { ! if (captureBegin) ! { ! if (tag.breaksFlow ()) ! capturedHtml.append (newline); ! if (!tag.getTagName().equals("A")) { ! capturedHtml.append(tag.toHtml()); ! // System.out.println("Tag captured: "+tag.toHtml()); ! } ! } ! } } |
From: <der...@us...> - 2003-10-26 19:48:25
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/tests Modified Files: AllTests.java AssertXmlEqualsTest.java BadTagIdentifier.java FunctionalTests.java InstanceofPerformanceTest.java LineNumberAssignedByNodeReaderTest.java ParserTest.java ParserTestCase.java PerformanceTest.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AllTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/AllTests.java,v retrieving revision 1.54 retrieving revision 1.55 diff -C2 -d -r1.54 -r1.55 *** AllTests.java 21 Oct 2003 02:24:00 -0000 1.54 --- AllTests.java 26 Oct 2003 19:46:24 -0000 1.55 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: AssertXmlEqualsTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/AssertXmlEqualsTest.java,v retrieving revision 1.14 retrieving revision 1.15 diff -C2 -d -r1.14 -r1.15 *** AssertXmlEqualsTest.java 21 Oct 2003 02:24:00 -0000 1.14 --- AssertXmlEqualsTest.java 26 Oct 2003 19:46:24 -0000 1.15 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: BadTagIdentifier.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/BadTagIdentifier.java,v retrieving revision 1.13 retrieving revision 1.14 diff -C2 -d -r1.13 -r1.14 *** BadTagIdentifier.java 22 Sep 2003 02:40:03 -0000 1.13 --- BadTagIdentifier.java 26 Oct 2003 19:46:25 -0000 1.14 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: FunctionalTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/FunctionalTests.java,v retrieving revision 1.46 retrieving revision 1.47 diff -C2 -d -r1.46 -r1.47 *** FunctionalTests.java 21 Oct 2003 02:24:00 -0000 1.46 --- FunctionalTests.java 26 Oct 2003 19:46:25 -0000 1.47 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: InstanceofPerformanceTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/InstanceofPerformanceTest.java,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** InstanceofPerformanceTest.java 22 Sep 2003 02:40:03 -0000 1.15 --- InstanceofPerformanceTest.java 26 Oct 2003 19:46:25 -0000 1.16 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LineNumberAssignedByNodeReaderTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/LineNumberAssignedByNodeReaderTest.java,v retrieving revision 1.25 retrieving revision 1.26 diff -C2 -d -r1.25 -r1.26 *** LineNumberAssignedByNodeReaderTest.java 21 Oct 2003 02:24:00 -0000 1.25 --- LineNumberAssignedByNodeReaderTest.java 26 Oct 2003 19:46:25 -0000 1.26 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ParserTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/ParserTest.java,v retrieving revision 1.46 retrieving revision 1.47 diff -C2 -d -r1.46 -r1.47 *** ParserTest.java 26 Oct 2003 03:53:33 -0000 1.46 --- ParserTest.java 26 Oct 2003 19:46:25 -0000 1.47 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: ParserTestCase.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/ParserTestCase.java,v retrieving revision 1.35 retrieving revision 1.36 diff -C2 -d -r1.35 -r1.36 *** ParserTestCase.java 26 Oct 2003 03:53:33 -0000 1.35 --- ParserTestCase.java 26 Oct 2003 19:46:25 -0000 1.36 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: PerformanceTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/PerformanceTest.java,v retrieving revision 1.42 retrieving revision 1.43 diff -C2 -d -r1.42 -r1.43 *** PerformanceTest.java 22 Sep 2003 02:40:04 -0000 1.42 --- PerformanceTest.java 26 Oct 2003 19:46:25 -0000 1.43 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/package.html,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** package.html 22 Sep 2003 02:40:05 -0000 1.15 --- package.html 26 Oct 2003 19:46:25 -0000 1.16 *************** *** 6,10 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 6,10 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
From: <der...@us...> - 2003-10-26 19:48:25
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/tests/lexerTests Modified Files: AllTests.java AttributeTests.java LexerTests.java PageIndexTests.java PageTests.java SourceTests.java StreamTests.java TagTests.java Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AllTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests/AllTests.java,v retrieving revision 1.13 retrieving revision 1.14 diff -C2 -d -r1.13 -r1.14 *** AllTests.java 21 Oct 2003 02:24:00 -0000 1.13 --- AllTests.java 26 Oct 2003 19:46:25 -0000 1.14 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: AttributeTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests/AttributeTests.java,v retrieving revision 1.4 retrieving revision 1.5 diff -C2 -d -r1.4 -r1.5 *** AttributeTests.java 21 Oct 2003 02:24:00 -0000 1.4 --- AttributeTests.java 26 Oct 2003 19:46:25 -0000 1.5 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: LexerTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests/LexerTests.java,v retrieving revision 1.9 retrieving revision 1.10 diff -C2 -d -r1.9 -r1.10 *** LexerTests.java 21 Oct 2003 02:24:00 -0000 1.9 --- LexerTests.java 26 Oct 2003 19:46:25 -0000 1.10 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: PageIndexTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests/PageIndexTests.java,v retrieving revision 1.9 retrieving revision 1.10 diff -C2 -d -r1.9 -r1.10 *** PageIndexTests.java 21 Oct 2003 02:24:00 -0000 1.9 --- PageIndexTests.java 26 Oct 2003 19:46:25 -0000 1.10 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: PageTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests/PageTests.java,v retrieving revision 1.11 retrieving revision 1.12 diff -C2 -d -r1.11 -r1.12 *** PageTests.java 21 Oct 2003 02:24:00 -0000 1.11 --- PageTests.java 26 Oct 2003 19:46:25 -0000 1.12 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: SourceTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests/SourceTests.java,v retrieving revision 1.10 retrieving revision 1.11 diff -C2 -d -r1.10 -r1.11 *** SourceTests.java 21 Oct 2003 02:24:00 -0000 1.10 --- SourceTests.java 26 Oct 2003 19:46:25 -0000 1.11 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StreamTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests/StreamTests.java,v retrieving revision 1.10 retrieving revision 1.11 diff -C2 -d -r1.10 -r1.11 *** StreamTests.java 26 Oct 2003 03:53:33 -0000 1.10 --- StreamTests.java 26 Oct 2003 19:46:25 -0000 1.11 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: TagTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests/TagTests.java,v retrieving revision 1.2 retrieving revision 1.3 diff -C2 -d -r1.2 -r1.3 *** TagTests.java 21 Oct 2003 02:24:00 -0000 1.2 --- TagTests.java 26 Oct 2003 19:46:25 -0000 1.3 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser/parserapplications Modified Files: LinkExtractor.java MailRipper.java Robot.java StringExtractor.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: LinkExtractor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/LinkExtractor.java,v retrieving revision 1.45 retrieving revision 1.46 diff -C2 -d -r1.45 -r1.46 *** LinkExtractor.java 22 Sep 2003 02:40:00 -0000 1.45 --- LinkExtractor.java 26 Oct 2003 19:46:19 -0000 1.46 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: MailRipper.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/MailRipper.java,v retrieving revision 1.46 retrieving revision 1.47 diff -C2 -d -r1.46 -r1.47 *** MailRipper.java 22 Sep 2003 02:40:00 -0000 1.46 --- MailRipper.java 26 Oct 2003 19:46:19 -0000 1.47 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Robot.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/Robot.java,v retrieving revision 1.48 retrieving revision 1.49 diff -C2 -d -r1.48 -r1.49 *** Robot.java 22 Sep 2003 02:40:00 -0000 1.48 --- Robot.java 26 Oct 2003 19:46:19 -0000 1.49 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StringExtractor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/StringExtractor.java,v retrieving revision 1.42 retrieving revision 1.43 diff -C2 -d -r1.42 -r1.43 *** StringExtractor.java 22 Sep 2003 02:40:00 -0000 1.42 --- StringExtractor.java 26 Oct 2003 19:46:19 -0000 1.43 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/package.html,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** package.html 22 Sep 2003 02:40:00 -0000 1.15 --- package.html 26 Oct 2003 19:46:19 -0000 1.16 *************** *** 5,9 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 5,9 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser/src/org/htmlparser Modified Files: AbstractNode.java Node.java Parser.java RemarkNode.java StringNode.java StringNodeFactory.java package.html Log Message: Update version headers to 1.4-20031026 and update changelog. Index: AbstractNode.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/AbstractNode.java,v retrieving revision 1.17 retrieving revision 1.18 diff -C2 -d -r1.17 -r1.18 *** AbstractNode.java 20 Oct 2003 01:28:02 -0000 1.17 --- AbstractNode.java 26 Oct 2003 19:46:17 -0000 1.18 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Node.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/Node.java,v retrieving revision 1.41 retrieving revision 1.42 diff -C2 -d -r1.41 -r1.42 *** Node.java 5 Oct 2003 13:49:40 -0000 1.41 --- Node.java 26 Oct 2003 19:46:17 -0000 1.42 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: Parser.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/Parser.java,v retrieving revision 1.67 retrieving revision 1.68 diff -C2 -d -r1.67 -r1.68 *** Parser.java 20 Oct 2003 01:28:02 -0000 1.67 --- Parser.java 26 Oct 2003 19:46:17 -0000 1.68 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // *************** *** 159,163 **** */ public final static String ! VERSION_DATE = "Sep 21, 2003" ; --- 159,163 ---- */ public final static String ! VERSION_DATE = "Oct 26, 2003" ; Index: RemarkNode.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/RemarkNode.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** RemarkNode.java 28 Sep 2003 19:30:03 -0000 1.33 --- RemarkNode.java 26 Oct 2003 19:46:17 -0000 1.34 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StringNode.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/StringNode.java,v retrieving revision 1.41 retrieving revision 1.42 diff -C2 -d -r1.41 -r1.42 *** StringNode.java 28 Sep 2003 19:30:03 -0000 1.41 --- StringNode.java 26 Oct 2003 19:46:17 -0000 1.42 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: StringNodeFactory.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/StringNodeFactory.java,v retrieving revision 1.5 retrieving revision 1.6 diff -C2 -d -r1.5 -r1.6 *** StringNodeFactory.java 22 Sep 2003 02:39:58 -0000 1.5 --- StringNodeFactory.java 26 Oct 2003 19:46:17 -0000 1.6 *************** *** 1,3 **** ! // HTMLParser Library v1_4_20030921 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // --- 1,3 ---- ! // HTMLParser Library v1_4_20031026 - A java-based parser for HTML // Copyright (C) Dec 31, 2000 Somik Raha // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/package.html,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** package.html 22 Sep 2003 02:39:58 -0000 1.16 --- package.html 26 Oct 2003 19:46:17 -0000 1.17 *************** *** 6,10 **** @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20030921 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha --- 6,10 ---- @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031026 - A java-based parser for HTML Copyright (C) Dec 31, 2000 Somik Raha |
From: <der...@us...> - 2003-10-26 19:48:16
|
Update of /cvsroot/htmlparser/htmlparser In directory sc8-pr-cvs1:/tmp/cvs-serv24811/htmlparser Modified Files: build.xml Log Message: Update version headers to 1.4-20031026 and update changelog. Index: build.xml =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/build.xml,v retrieving revision 1.50 retrieving revision 1.51 diff -C2 -d -r1.50 -r1.51 *** build.xml 25 Oct 2003 12:03:52 -0000 1.50 --- build.xml 26 Oct 2003 19:46:16 -0000 1.51 *************** *** 20,25 **** deletes local Wiki pages, of course any one else would have to adjust this and also the hard-coded path in WikiCapturer ! - 'javac -classpath release/htmlparser1_4/lib/htmlparser.jar ../WikiCapturer/src/org/htmlparser/wikicapturer/CaptureWiki.java ../WikiCapturer/src/org/htmlparser/wikicapturer/PhpWikiVisitor.java' ! and 'java -classpath release/htmlparser1_4/lib/htmlparser.jar:../WikiCapturer/src org.htmlparser.wikicapturer.CaptureWiki' fetches current Wiki pages - perform a CVS update on htmlparser/docs/docs to identify new and changed files --- 20,25 ---- deletes local Wiki pages, of course any one else would have to adjust this and also the hard-coded path in WikiCapturer ! - 'javac -classpath lib/htmlparser.jar ../WikiCapturer/src/org/htmlparser/wikicapturer/CaptureWiki.java ../WikiCapturer/src/org/htmlparser/wikicapturer/PhpWikiVisitor.java' ! and 'java -classpath lib/htmlparser.jar:../WikiCapturer/src org.htmlparser.wikicapturer.CaptureWiki' fetches current Wiki pages - perform a CVS update on htmlparser/docs/docs to identify new and changed files |
From: <der...@us...> - 2003-10-26 18:00:56
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/nodes In directory sc8-pr-cvs1:/tmp/cvs-serv7966/nodes Modified Files: PageAttribute.java package.html Log Message: Doco update. Move the lexer from future tense to current. Index: PageAttribute.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/nodes/PageAttribute.java,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** PageAttribute.java 18 Oct 2003 20:50:37 -0000 1.1 --- PageAttribute.java 26 Oct 2003 17:58:25 -0000 1.2 *************** *** 40,44 **** * <code>Page</code> by providing the page and cursor offsets * into the page for the name and value. This is done for speed, since ! * if the name and value are not been needed we can avoid the cost and memory * overhead of creating the strings. * <p> --- 40,44 ---- * <code>Page</code> by providing the page and cursor offsets * into the page for the name and value. This is done for speed, since ! * if the name and value are not needed we can avoid the cost and memory * overhead of creating the strings. * <p> Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/nodes/package.html,v retrieving revision 1.5 retrieving revision 1.6 diff -C2 -d -r1.5 -r1.6 *** package.html 22 Sep 2003 02:39:59 -0000 1.5 --- package.html 26 Oct 2003 17:58:25 -0000 1.6 *************** *** 39,44 **** </HEAD> <BODY> ! The nodes package will eventually be the lexemes returned by the base level I/O subsystem. ! <EM>It is currently under development.</EM> There are three types of lexems so far, <code>RemarkNode</code>, <code>StringNode</code> and <code>TagNode</code>. Within the <code>TagNode</code> objects is a list of --- 39,43 ---- </HEAD> <BODY> ! The nodes package are the lexemes returned by the base level I/O subsystem. There are three types of lexems so far, <code>RemarkNode</code>, <code>StringNode</code> and <code>TagNode</code>. Within the <code>TagNode</code> objects is a list of |