htmlparser-cvs Mailing List for HTML Parser (Page 24)
Brought to you by:
derrickoswald
You can subscribe to this list here.
2003 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(141) |
Jun
(108) |
Jul
(66) |
Aug
(127) |
Sep
(155) |
Oct
(149) |
Nov
(72) |
Dec
(72) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2004 |
Jan
(100) |
Feb
(36) |
Mar
(21) |
Apr
(3) |
May
(87) |
Jun
(28) |
Jul
(84) |
Aug
(5) |
Sep
(14) |
Oct
|
Nov
|
Dec
|
2005 |
Jan
(1) |
Feb
(39) |
Mar
(26) |
Apr
(38) |
May
(14) |
Jun
(10) |
Jul
|
Aug
|
Sep
(13) |
Oct
(8) |
Nov
(10) |
Dec
|
2006 |
Jan
|
Feb
(1) |
Mar
(17) |
Apr
(20) |
May
(28) |
Jun
(24) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2015 |
Jan
|
Feb
|
Mar
(1) |
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
From: <der...@us...> - 2004-01-10 15:23:36
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer In directory sc8-pr-cvs1:/tmp/cvs-serv3574/lexer Modified Files: Page.java Log Message: Fix bug #874175 StringBean doesn't handle charset change well Add EncodingChangeException to distinguish a recoverable character set change occuring after the lexer has already coughed up some characters using the wrong encoding. Added testEncodingChange in LexerTests to excercise it. Changed IteratorImpl to not wrap a ParserException with another ParserException. Changed StringBean to retry the URL when an encoding change exception is caught. Index: Page.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/Page.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** Page.java 2 Jan 2004 16:24:53 -0000 1.31 --- Page.java 10 Jan 2004 15:23:33 -0000 1.32 *************** *** 27,38 **** package org.htmlparser.lexer; ! import java.io.*; import java.io.IOException; import java.io.Serializable; ! import java.lang.reflect.*; ! import java.net.*; ! import org.htmlparser.util.*; import org.htmlparser.util.LinkProcessor; /** --- 27,46 ---- package org.htmlparser.lexer; ! import java.io.ByteArrayInputStream; ! import java.io.InputStream; import java.io.IOException; + import java.io.ObjectInputStream; + import java.io.ObjectOutputStream; import java.io.Serializable; ! import java.io.UnsupportedEncodingException; ! import java.lang.reflect.InvocationTargetException; ! import java.lang.reflect.Method; ! import java.net.URL; ! import java.net.URLConnection; ! import java.net.UnknownHostException; ! import org.htmlparser.util.EncodingChangeException; import org.htmlparser.util.LinkProcessor; + import org.htmlparser.util.ParserException; /** *************** *** 692,696 **** for (int i = 0; i < offset; i++) if (new_chars[i] != buffer[i]) ! throw new ParserException ("character mismatch (new: " + new_chars[i] + " != old: " --- 700,704 ---- for (int i = 0; i < offset; i++) if (new_chars[i] != buffer[i]) ! throw new EncodingChangeException ("character mismatch (new: " + new_chars[i] + " != old: " *************** *** 700,704 **** + " to " + character_set ! + " at offset " + offset); } --- 708,712 ---- + " to " + character_set ! + " at character offset " + offset); } |
From: <der...@us...> - 2004-01-10 15:23:36
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests In directory sc8-pr-cvs1:/tmp/cvs-serv3574/tests/lexerTests Modified Files: LexerTests.java Log Message: Fix bug #874175 StringBean doesn't handle charset change well Add EncodingChangeException to distinguish a recoverable character set change occuring after the lexer has already coughed up some characters using the wrong encoding. Added testEncodingChange in LexerTests to excercise it. Changed IteratorImpl to not wrap a ParserException with another ParserException. Changed StringBean to retry the URL when an encoding change exception is caught. Index: LexerTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/lexerTests/LexerTests.java,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** LexerTests.java 2 Jan 2004 16:24:55 -0000 1.15 --- LexerTests.java 10 Jan 2004 15:23:33 -0000 1.16 *************** *** 52,55 **** --- 52,56 ---- import org.htmlparser.util.NodeIterator; import org.htmlparser.util.NodeList; + import org.htmlparser.util.EncodingChangeException; import org.htmlparser.util.ParserException; *************** *** 620,628 **** * causes spurious tags. * The root cause is characters bracketed by [esc]$B and [esc](J (contrary ! * to what is indicated in then j_s_nightingale analysis of the problem) that * sometimes have an angle bracket (< or 0x3c) embedded in them. These * are taken to be tags by the parser, instead of being considered strings. * <p> ! * The URL refrenced has an ISO-8859-1 encoding (the default), but * Japanese characters intermixed on the page with English, using the JIS * encoding. We detect failure by looking for weird tag names which were --- 621,629 ---- * causes spurious tags. * The root cause is characters bracketed by [esc]$B and [esc](J (contrary ! * to what is indicated in the j_s_nightingale analysis of the problem) that * sometimes have an angle bracket (< or 0x3c) embedded in them. These * are taken to be tags by the parser, instead of being considered strings. * <p> ! * The URL http://www.009.com/ has an ISO-8859-1 encoding (the default), but * Japanese characters intermixed on the page with English, using the JIS * encoding. We detect failure by looking for weird tag names which were *************** *** 666,670 **** NodeIterator iterator; ! parser = new Parser ("http://www.009.com/"); iterator = parser.elements (); while (iterator.hasMoreNodes ()) --- 667,671 ---- NodeIterator iterator; ! parser = new Parser ("http://htmlparser.sourceforge.net/test/www_009_com.html"); iterator = parser.elements (); while (iterator.hasMoreNodes ()) *************** *** 745,748 **** --- 746,784 ---- } + /** + * See bug #874175 StringBean doesn't handle charset change well + * Force an encoding change exception, reset and re-read. + */ + public void testEncodingChange () + throws + ParserException + { + NodeIterator iterator; + Node node; + boolean success; + + parser = new Parser ("http://htmlparser.sourceforge.net/test/www_china-pub_com.html"); + success = false; + try + { + for (iterator = parser.elements (); iterator.hasMoreNodes (); ) + node = iterator.nextNode (); + } + catch (EncodingChangeException ece) + { + success = true; + try + { + parser.reset (); + for (iterator = parser.elements (); iterator.hasMoreNodes (); ) + node = iterator.nextNode (); + } + catch (ParserException pe) + { + success = false; + } + } + assertTrue ("encoding change failed", success); + } } |
From: <der...@us...> - 2004-01-10 00:06:06
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util In directory sc8-pr-cvs1:/tmp/cvs-serv24025/util Modified Files: NodeList.java Log Message: First pass at the wiki capturer. Added useful extensions to the HasAttributeFilter, SiteCapturer and NodeList Index: NodeList.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/NodeList.java,v retrieving revision 1.53 retrieving revision 1.54 diff -C2 -d -r1.53 -r1.54 *** NodeList.java 2 Jan 2004 16:24:58 -0000 1.53 --- NodeList.java 10 Jan 2004 00:06:03 -0000 1.54 *************** *** 185,188 **** --- 185,191 ---- * Filter the list with the given filter non-recursively. * @param filter The filter to use. + * @return A new node array containing the nodes accepted by the filter. + * This is a linear list and preserves the nested structure of the returned + * nodes only. */ public NodeList extractAllNodesThatMatch (NodeFilter filter) *************** *** 195,198 **** --- 198,204 ---- * @param filter The filter to use. * @param recursive If <code>true<code> digs into the children recursively. + * @return A new node array containing the nodes accepted by the filter. + * This is a linear list and preserves the nested structure of the returned + * nodes only. */ public NodeList extractAllNodesThatMatch (NodeFilter filter, boolean recursive) *************** *** 218,221 **** --- 224,266 ---- return (ret); + } + + /** + * Remove nodes not matching the given filter non-recursively. + * @param filter The filter to use. + */ + public void keepAllNodesThatMatch (NodeFilter filter) + { + keepAllNodesThatMatch (filter, false); + } + + /** + * Remove nodes not matching the given filter. + * @param filter The filter to use. + * @param recursive If <code>true<code> digs into the children recursively. + */ + public void keepAllNodesThatMatch (NodeFilter filter, boolean recursive) + { + String name; + Node node; + NodeList children; + NodeList ret; + + for (int i = 0; i < size; ) + { + node = nodeData[i]; + if (!filter.accept (node)) + remove (i); + else + { + if (recursive) + { + children = node.getChildren (); + if (null != children) + children.keepAllNodesThatMatch (filter, recursive); + } + i++; + } + } } |
From: <der...@us...> - 2004-01-10 00:06:06
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications In directory sc8-pr-cvs1:/tmp/cvs-serv24025/parserapplications Modified Files: SiteCapturer.java Added Files: WikiCapturer.java Log Message: First pass at the wiki capturer. Added useful extensions to the HasAttributeFilter, SiteCapturer and NodeList --- NEW FILE: WikiCapturer.java --- // HTMLParser Library $Name: $ - A java-based parser for HTML // http://sourceforge.org/projects/htmlparser // Copyright (C) 2003 Derrick Oswald // // Revision Control Information // // $Source: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/WikiCapturer.java,v $ // $Author: derrickoswald $ // $Date: 2004/01/10 00:06:03 $ // $Revision: 1.1 $ // // This library is free software; you can redistribute it and/or // modify it under the terms of the GNU Lesser General Public // License as published by the Free Software Foundation; either // version 2.1 of the License, or (at your option) any later version. // // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.parserapplications; import java.io.File; import java.io.IOException; import java.net.MalformedURLException; import java.net.URL; import javax.swing.JFileChooser; import javax.swing.JOptionPane; import org.htmlparser.filters.AndFilter; import org.htmlparser.filters.HasAttributeFilter; import org.htmlparser.filters.NotFilter; import org.htmlparser.filters.OrFilter; import org.htmlparser.filters.TagNameFilter; /** * Save a wikiwikiweb locally. * Illustrative program to save a wiki locally. */ public class WikiCapturer extends SiteCapturer { /** * Create a wikicapturer. */ public WikiCapturer () { } /** * Mainline to capture a web site locally. * @param args The command line arguments. * There are three arguments the web site to capture, the local directory * to save it to, and a flag (true or false) to indicate whether resources * such as images and video are to be captured as well. * These are requested via dialog boxes if not supplied. */ public static void main (String[] args) throws MalformedURLException, IOException { WikiCapturer worker; String url; JFileChooser chooser; URL source; String path; File target; Boolean capture; int ret; worker = new WikiCapturer (); if (0 >= args.length) { url = (String)JOptionPane.showInputDialog ( null, "Enter the URL to capture:", "Web Site", JOptionPane.PLAIN_MESSAGE, null, null, "http://htmlparser.sourceforge.net/wiki"); if (null != url) worker.setSource (url); else System.exit (1); } else worker.setSource (args[0]); if (1 >= args.length) { url = worker.getSource (); source = new URL (url); path = new File (new File ("." + File.separator), source.getHost () + File.separator).getCanonicalPath (); target = new File (path); chooser = new JFileChooser (target); chooser.setDialogType (JFileChooser.SAVE_DIALOG); chooser.setFileSelectionMode (JFileChooser.DIRECTORIES_ONLY); chooser.setSelectedFile (target); // this doesn't frickin' work chooser.setMultiSelectionEnabled (false); chooser.setDialogTitle ("Target Directory"); ret = chooser.showSaveDialog (null); if (ret == JFileChooser.APPROVE_OPTION) worker.setTarget (chooser.getSelectedFile ().getAbsolutePath ()); else System.exit (1); } else worker.setTarget (args[1]); if (2 >= args.length) { capture = (Boolean)JOptionPane.showInputDialog ( null, "Should resources be captured:", "Capture Resources", JOptionPane.PLAIN_MESSAGE, null, new Object[] { Boolean.TRUE, Boolean.FALSE}, Boolean.TRUE); if (null != capture) worker.setCaptureResources (capture.booleanValue ()); else System.exit (1); } else worker.setCaptureResources ((Boolean.valueOf (args[2]).booleanValue ())); worker.setFilter ( new NotFilter ( new OrFilter ( new AndFilter ( new TagNameFilter ("DIV"), new HasAttributeFilter ("id", "navbar")), new OrFilter ( new AndFilter ( new TagNameFilter ("DIV"), new HasAttributeFilter ("id", "actionbar")), new AndFilter ( new TagNameFilter ("DIV"), new HasAttributeFilter ("id", "xhtml-validator")))))); worker.capture (); System.exit (0); } } Index: SiteCapturer.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/SiteCapturer.java,v retrieving revision 1.2 retrieving revision 1.3 diff -C2 -d -r1.2 -r1.3 *** SiteCapturer.java 4 Jan 2004 03:23:09 -0000 1.2 --- SiteCapturer.java 10 Jan 2004 00:06:03 -0000 1.3 *************** *** 42,51 **** --- 42,57 ---- import org.htmlparser.Node; + import org.htmlparser.NodeFilter; import org.htmlparser.Parser; import org.htmlparser.PrototypicalNodeFactory; + import org.htmlparser.filters.AndFilter; + import org.htmlparser.filters.HasAttributeFilter; + import org.htmlparser.filters.NodeClassFilter; + import org.htmlparser.lexer.nodes.Attribute; import org.htmlparser.tags.BaseHrefTag; import org.htmlparser.tags.FrameTag; import org.htmlparser.tags.ImageTag; import org.htmlparser.tags.LinkTag; + import org.htmlparser.tags.MetaTag; import org.htmlparser.util.NodeIterator; import org.htmlparser.util.NodeList; *************** *** 54,58 **** /** * Save a web site locally. ! * Illustrative prgram to save a web site contents locally. * It was created to demonstrate URL rewriting in it's simplest form. * It uses customized tags in the NodeFactory to alter the URLs. --- 60,64 ---- /** * Save a web site locally. ! * Illustrative program to save a web site contents locally. * It was created to demonstrate URL rewriting in it's simplest form. * It uses customized tags in the NodeFactory to alter the URLs. *************** *** 125,128 **** --- 131,139 ---- /** + * The filter to apply to the nodes retrieved. + */ + protected NodeFilter mFilter; + + /** * Copy buffer size. * Resources are moved to disk in chunks this size or less. *************** *** 137,140 **** --- 148,153 ---- PrototypicalNodeFactory factory; + mSource = null; + mTarget = null; mPages = new ArrayList (); mFinished = new HashSet (); *************** *** 148,151 **** --- 161,166 ---- factory.registerTag (new LocalImageTag ()); mParser.setNodeFactory (factory); + mCaptureResources = true; + mFilter = null; } *************** *** 214,217 **** --- 229,251 ---- } + + /** Getter for property filter. + * @return Value of property filter. + * + */ + public NodeFilter getFilter () + { + return (mFilter); + } + + /** Setter for property filter. + * @param filter New value of property filter. + * + */ + public void setFilter (NodeFilter filter) + { + mFilter = filter; + } + /** * Returns <code>true</code> if the link is one we are interested in. *************** *** 282,286 **** String ret; ! if (link.equals (getSource ())) ret = "index.html"; // handle the root page specially else if (link.startsWith (getSource ()) --- 316,320 ---- String ret; ! if (link.equals (getSource ()) || (!getSource ().endsWith ("/") && link.equals (getSource () + "/"))) ret = "index.html"; // handle the root page specially else if (link.startsWith (getSource ()) *************** *** 383,392 **** * Process a single page. */ ! protected void process () throws ParserException { String url; NodeList list; File file; File dir; --- 417,430 ---- * Process a single page. */ ! protected void process (NodeFilter filter) throws ParserException { String url; + int bookmark; NodeList list; + NodeList robots; + MetaTag robot; + String content; File file; File dir; *************** *** 399,403 **** try ! { // fetch the page and gather the list of nodes mParser.setURL (url); list = new NodeList (); --- 437,443 ---- try ! { ! bookmark = mPages.size (); ! // fetch the page and gather the list of nodes mParser.setURL (url); list = new NodeList (); *************** *** 405,408 **** --- 445,470 ---- list.add (e.nextNode ()); // URL conversion occurs in the tags + // handle robots meta tag according to http://www.robotstxt.org/wc/meta-user.html + // <meta name="robots" content="index,follow" /> + // <meta name="robots" content="noindex,nofollow" /> + robots = list.extractAllNodesThatMatch ( + new AndFilter ( + new NodeClassFilter (MetaTag.class), + new HasAttributeFilter ("name", "robots")), true); + if (0 != robots.size ()) + { + robot = (MetaTag)robots.elementAt (0); + content = robot.getAttribute ("content").toLowerCase (); + if ((-1 != content.indexOf ("none")) || (-1 != content.indexOf ("nofollow"))) + // reset mPages + for (int i = bookmark; i < mPages.size (); i++) + mPages.remove (i); + if ((-1 != content.indexOf ("none")) || (-1 != content.indexOf ("noindex"))) + return; + } + + if (null != filter) + list.keepAllNodesThatMatch (filter, true); + // save the page locally file = new File (getTarget (), makeLocalLink (url, "")); *************** *** 410,413 **** --- 472,483 ---- if (!dir.exists ()) dir.mkdirs (); + else if (!dir.isDirectory ()) + { + dir = new File (dir.getParentFile (), dir.getName () + ".content"); + if (!dir.exists ()) + dir.mkdirs (); + file = new File (dir, file.getName ()); + } + try { *************** *** 582,586 **** try { ! process (); while (0 != mImages.size ()) copy (); --- 652,656 ---- try { ! process (getFilter ()); while (0 != mImages.size ()) copy (); |
From: <der...@us...> - 2004-01-10 00:06:06
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/filters In directory sc8-pr-cvs1:/tmp/cvs-serv24025/filters Modified Files: HasAttributeFilter.java Log Message: First pass at the wiki capturer. Added useful extensions to the HasAttributeFilter, SiteCapturer and NodeList Index: HasAttributeFilter.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/filters/HasAttributeFilter.java,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** HasAttributeFilter.java 8 Nov 2003 21:30:58 -0000 1.1 --- HasAttributeFilter.java 10 Jan 2004 00:06:03 -0000 1.2 *************** *** 29,36 **** import org.htmlparser.Node; import org.htmlparser.NodeFilter; import org.htmlparser.lexer.nodes.TagNode; /** ! * This class accepts all tags that have a child acceptable to the filter. */ public class HasAttributeFilter implements NodeFilter --- 29,37 ---- import org.htmlparser.Node; import org.htmlparser.NodeFilter; + import org.htmlparser.lexer.nodes.Attribute; import org.htmlparser.lexer.nodes.TagNode; /** ! * This class accepts all tags that have a certain attribute, and optionally, with a certain value. */ public class HasAttributeFilter implements NodeFilter *************** *** 42,45 **** --- 43,51 ---- /** + * The value to check for. + */ + protected String mValue; + + /** * Creates a new instance of HasAttributeFilter that accepts tags with the given attribute. * @param attribute The attribute to search for. *************** *** 47,51 **** --- 53,68 ---- public HasAttributeFilter (String attribute) { + this (attribute, null); + } + + /** + * Creates a new instance of HasAttributeFilter that accepts tags with the given attribute. + * @param attribute The attribute to search for. + * @param value The value that must be matched, or null if any value will match. + */ + public HasAttributeFilter (String attribute, String value) + { mAttribute = attribute.toUpperCase (); + mValue = value; } *************** *** 57,60 **** --- 74,78 ---- { TagNode tag; + Attribute attribute; boolean ret; *************** *** 63,67 **** { tag = (TagNode)node; ! ret = null != tag.getAttributeEx (mAttribute); } --- 81,88 ---- { tag = (TagNode)node; ! attribute = tag.getAttributeEx (mAttribute); ! ret = null != attribute; ! if (ret && (null != mValue)) ! ret = mValue.equals (attribute.getValue ()); } |
From: <der...@us...> - 2004-01-04 23:12:45
|
Update of /cvsroot/htmlparser/htmlparser/docs In directory sc8-pr-cvs1:/tmp/cvs-serv27606/docs Modified Files: changes.txt release.txt Log Message: Update version to 1.4-20040104. Index: changes.txt =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/changes.txt,v retrieving revision 1.193 retrieving revision 1.194 diff -C2 -d -r1.193 -r1.194 *** changes.txt 8 Dec 2003 01:31:49 -0000 1.193 --- changes.txt 4 Jan 2004 19:03:35 -0000 1.194 *************** *** 13,16 **** --- 13,355 ---- ******************************************************************************* + Integration Build 1.4 - 20040104 + -------------------------------- + + 2004-01-03 22:23 derrickoswald + + * build.xml, docs/bug.html, docs/contributors.html, + docs/index.html, docs/joinus.html, docs/mailinglists.html, + docs/main.html, docs/panel.html, docs/samples.html, + docs/support.html, docs/articles/index.html, + docs/articles/quest.html, docs/samples/index.html, bin/beanybaby, + bin/beanybaby.bat, bin/stringextractor, bin/stringextractor.bat, + src/org/htmlparser/Parser.java, + src/org/htmlparser/beans/LinkBean.java, + src/org/htmlparser/parserapplications/LinkExtractor.java, + src/org/htmlparser/parserapplications/SiteCapturer.java: + + Web site revamp, phase 1. + Main and first level pages are refurbished. The wiki is still to do. + Fixed bug #865279 Documentation + The samples directory is now orphaned and no longer shipped. + + 2004-01-02 14:32 derrickoswald + + * build.xml, src/org/htmlparser/Parser.java, + src/org/htmlparser/lexer/Cursor.java, + src/org/htmlparser/scanners/package.html: + + Gey keyword substitution working. + In the future, could developers ensure source files are initially check in with keyword substitution (-kkv) turned on. + + 2004-01-02 11:24 derrickoswald + + * build.xml, docs/docs/FrequentlyAskedQuestions.html, + docs/docs/ParsingXml.html, docs/docs/ReviewerInformation.html, + docs/docs/TextExtractingVisitor.html, docs/docs/WebCrawler.html, + docs/docs/index.html, src/org/htmlparser/AbstractNode.java, + src/org/htmlparser/Node.java, src/org/htmlparser/Parser.java, + src/org/htmlparser/RemarkNode.java, + src/org/htmlparser/StringNode.java, + src/org/htmlparser/StringNodeFactory.java, + src/org/htmlparser/package.html, + src/org/htmlparser/beans/BeanyBaby.java, + src/org/htmlparser/beans/HTMLLinkBean.java, + src/org/htmlparser/beans/HTMLTextBean.java, + src/org/htmlparser/beans/LinkBean.java, + src/org/htmlparser/beans/StringBean.java, + src/org/htmlparser/beans/package.html, + src/org/htmlparser/filters/package.html, + src/org/htmlparser/lexer/Cursor.java, + src/org/htmlparser/lexer/Lexer.java, + src/org/htmlparser/lexer/Page.java, + src/org/htmlparser/lexer/PageIndex.java, + src/org/htmlparser/lexer/Source.java, + src/org/htmlparser/lexer/Stream.java, + src/org/htmlparser/lexer/package.html, + src/org/htmlparser/lexer/nodes/Attribute.java, + src/org/htmlparser/lexer/nodes/PageAttribute.java, + src/org/htmlparser/lexer/nodes/RemarkNode.java, + src/org/htmlparser/lexer/nodes/StringNode.java, + src/org/htmlparser/lexer/nodes/TagNode.java, + src/org/htmlparser/lexer/nodes/package.html, + src/org/htmlparser/lexerapplications/tabby/package.html, + src/org/htmlparser/lexerapplications/thumbelina/package.html, + src/org/htmlparser/nodeDecorators/AbstractNodeDecorator.java, + src/org/htmlparser/nodeDecorators/DecodingNode.java, + src/org/htmlparser/nodeDecorators/EscapeCharacterRemovingNode.java, + src/org/htmlparser/nodeDecorators/NonBreakingSpaceConvertingNode.java, + src/org/htmlparser/parserapplications/StringExtractor.java, + src/org/htmlparser/parserapplications/package.html, + src/org/htmlparser/scanners/package.html, + src/org/htmlparser/tags/AppletTag.java, + src/org/htmlparser/tags/BaseHrefTag.java, + src/org/htmlparser/tags/BodyTag.java, + src/org/htmlparser/tags/Bullet.java, + src/org/htmlparser/tags/BulletList.java, + src/org/htmlparser/tags/CompositeTag.java, + src/org/htmlparser/tags/Div.java, + src/org/htmlparser/tags/DoctypeTag.java, + src/org/htmlparser/tags/FormTag.java, + src/org/htmlparser/tags/FrameSetTag.java, + src/org/htmlparser/tags/FrameTag.java, + src/org/htmlparser/tags/HeadTag.java, + src/org/htmlparser/tags/Html.java, + src/org/htmlparser/tags/ImageTag.java, + src/org/htmlparser/tags/InputTag.java, + src/org/htmlparser/tags/JspTag.java, + src/org/htmlparser/tags/LabelTag.java, + src/org/htmlparser/tags/LinkTag.java, + src/org/htmlparser/tags/MetaTag.java, + src/org/htmlparser/tags/OptionTag.java, + src/org/htmlparser/tags/ScriptTag.java, + src/org/htmlparser/tags/SelectTag.java, + src/org/htmlparser/tags/Span.java, + src/org/htmlparser/tags/StyleTag.java, + src/org/htmlparser/tags/TableColumn.java, + src/org/htmlparser/tags/TableRow.java, + src/org/htmlparser/tags/TableTag.java, + src/org/htmlparser/tags/Tag.java, + src/org/htmlparser/tags/TextareaTag.java, + src/org/htmlparser/tags/TitleTag.java, + src/org/htmlparser/tags/package.html, + src/org/htmlparser/tests/AllTests.java, + src/org/htmlparser/tests/AssertXmlEqualsTest.java, + src/org/htmlparser/tests/BadTagIdentifier.java, + src/org/htmlparser/tests/FunctionalTests.java, + src/org/htmlparser/tests/InstanceofPerformanceTest.java, + src/org/htmlparser/tests/LineNumberAssignedByNodeReaderTest.java, + src/org/htmlparser/tests/ParserTest.java, + src/org/htmlparser/tests/ParserTestCase.java, + src/org/htmlparser/tests/PerformanceTest.java, + src/org/htmlparser/tests/package.html, + src/org/htmlparser/tests/codeMetrics/LineCounter.java, + src/org/htmlparser/tests/lexerTests/AllTests.java, + src/org/htmlparser/tests/lexerTests/AttributeTests.java, + src/org/htmlparser/tests/lexerTests/LexerTests.java, + src/org/htmlparser/tests/lexerTests/PageIndexTests.java, + src/org/htmlparser/tests/lexerTests/PageTests.java, + src/org/htmlparser/tests/lexerTests/SourceTests.java, + src/org/htmlparser/tests/lexerTests/StreamTests.java, + src/org/htmlparser/tests/lexerTests/TagTests.java, + src/org/htmlparser/tests/nodeDecoratorTests/AllTests.java, + src/org/htmlparser/tests/nodeDecoratorTests/DecodingNodeTest.java, + src/org/htmlparser/tests/nodeDecoratorTests/EscapeCharacterRemovingNodeTest.java, + src/org/htmlparser/tests/nodeDecoratorTests/NonBreakingSpaceConvertingNodeTest.java, + src/org/htmlparser/tests/parserHelperTests/AllTests.java, + src/org/htmlparser/tests/parserHelperTests/CompositeTagScannerHelperTest.java, + src/org/htmlparser/tests/parserHelperTests/RemarkNodeParserTest.java, + src/org/htmlparser/tests/parserHelperTests/StringParserTest.java, + src/org/htmlparser/tests/scannersTests/AllTests.java, + src/org/htmlparser/tests/scannersTests/CompositeTagScannerTest.java, + src/org/htmlparser/tests/scannersTests/JspScannerTest.java, + src/org/htmlparser/tests/scannersTests/ScriptScannerTest.java, + src/org/htmlparser/tests/scannersTests/TagScannerTest.java, + src/org/htmlparser/tests/scannersTests/XmlEndTagScanningTest.java, + src/org/htmlparser/tests/scannersTests/package.html, + src/org/htmlparser/tests/tagTests/AllTests.java, + src/org/htmlparser/tests/tagTests/AppletTagTest.java, + src/org/htmlparser/tests/tagTests/BaseHrefTagTest.java, + src/org/htmlparser/tests/tagTests/BodyTagTest.java, + src/org/htmlparser/tests/tagTests/CompositeTagTest.java, + src/org/htmlparser/tests/tagTests/DoctypeTagTest.java, + src/org/htmlparser/tests/tagTests/EndTagTest.java, + src/org/htmlparser/tests/tagTests/FormTagTest.java, + src/org/htmlparser/tests/tagTests/FrameSetTagTest.java, + src/org/htmlparser/tests/tagTests/FrameTagTest.java, + src/org/htmlparser/tests/tagTests/ImageTagTest.java, + src/org/htmlparser/tests/tagTests/InputTagTest.java, + src/org/htmlparser/tests/tagTests/JspTagTest.java, + src/org/htmlparser/tests/tagTests/LinkTagTest.java, + src/org/htmlparser/tests/tagTests/MetaTagTest.java, + src/org/htmlparser/tests/tagTests/ObjectCollectionTest.java, + src/org/htmlparser/tests/tagTests/OptionTagTest.java, + src/org/htmlparser/tests/tagTests/ScriptTagTest.java, + src/org/htmlparser/tests/tagTests/SelectTagTest.java, + src/org/htmlparser/tests/tagTests/StyleTagTest.java, + src/org/htmlparser/tests/tagTests/TagTest.java, + src/org/htmlparser/tests/tagTests/TextareaTagTest.java, + src/org/htmlparser/tests/tagTests/TitleTagTest.java, + src/org/htmlparser/tests/tagTests/package.html, + src/org/htmlparser/tests/utilTests/AllTests.java, + src/org/htmlparser/tests/utilTests/BeanTest.java, + src/org/htmlparser/tests/utilTests/CharacterTranslationTest.java, + src/org/htmlparser/tests/utilTests/HTMLLinkProcessorTest.java, + src/org/htmlparser/tests/utilTests/HTMLParserUtilsTest.java, + src/org/htmlparser/tests/utilTests/NodeListTest.java, + src/org/htmlparser/tests/utilTests/SortTest.java, + src/org/htmlparser/tests/utilTests/package.html, + src/org/htmlparser/tests/visitorsTests/AllTests.java, + src/org/htmlparser/tests/visitorsTests/CompositeTagFindingVisitorTest.java, + src/org/htmlparser/tests/visitorsTests/HtmlPageTest.java, + src/org/htmlparser/tests/visitorsTests/LinkFindingVisitorTest.java, + src/org/htmlparser/tests/visitorsTests/NodeVisitorTest.java, + src/org/htmlparser/tests/visitorsTests/StringFindingVisitorTest.java, + src/org/htmlparser/tests/visitorsTests/TagFindingVisitorTest.java, + src/org/htmlparser/tests/visitorsTests/TextExtractingVisitorTest.java, + src/org/htmlparser/tests/visitorsTests/UrlModifyingVisitorTest.java, + src/org/htmlparser/util/ChainedException.java, + src/org/htmlparser/util/CommandLine.java, + src/org/htmlparser/util/DefaultParserFeedback.java, + src/org/htmlparser/util/FeedbackManager.java, + src/org/htmlparser/util/Generate.java, + src/org/htmlparser/util/IteratorImpl.java, + src/org/htmlparser/util/LinkProcessor.java, + src/org/htmlparser/util/NodeIterator.java, + src/org/htmlparser/util/NodeList.java, + src/org/htmlparser/util/ParserException.java, + src/org/htmlparser/util/ParserFeedback.java, + src/org/htmlparser/util/ParserUtils.java, + src/org/htmlparser/util/PeekingIterator.java, + src/org/htmlparser/util/SimpleNodeIterator.java, + src/org/htmlparser/util/SpecialHashtable.java, + src/org/htmlparser/util/Translate.java, + src/org/htmlparser/util/package.html, + src/org/htmlparser/util/sort/Ordered.java, + src/org/htmlparser/util/sort/Sort.java, + src/org/htmlparser/util/sort/Sortable.java, + src/org/htmlparser/util/sort/package.html, + src/org/htmlparser/visitors/HtmlPage.java, + src/org/htmlparser/visitors/LinkFindingVisitor.java, + src/org/htmlparser/visitors/NodeVisitor.java, + src/org/htmlparser/visitors/ObjectFindingVisitor.java, + src/org/htmlparser/visitors/StringFindingVisitor.java, + src/org/htmlparser/visitors/TagFindingVisitor.java, + src/org/htmlparser/visitors/TextExtractingVisitor.java, + src/org/htmlparser/visitors/UrlModifyingVisitor.java, + src/org/htmlparser/visitors/package.html: + + Update the integration procedure to use a CVS tag, + rather than changing the version in every file header. + + 2004-01-02 00:01 derrickoswald + + * src/org/htmlparser/tests/: ParserTest.java, + lexerTests/AttributeTests.java, lexerTests/LexerTests.java, + parserHelperTests/RemarkNodeParserTest.java, + scannersTests/ScriptScannerTest.java, tagTests/DoctypeTagTest.java, + tagTests/FormTagTest.java, tagTests/ImageTagTest.java, + tagTests/JspTagTest.java, tagTests/LinkTagTest.java: + + Added testcases but was unable to reproduce the following bugs in the version 1.4 codebase: + 839264 toHtml() parse error in Javascripts with "form" keyword + 833592 DOCTYPE element is not parsed correctly + 826764 ParserException occurs only when using setInputHTML() instea + 825820 Words conjoined + 825645 <input> not getting parsed inside table + 813838 links not parsed correctly + and + #851882 zero length alt tag causes bug in ImageScanner + #832530 empty attribute causes parser to fail + #805598 attribute src in tag img sometimes not correctly parsed + (these 3 are all the same bug, duplicates of the following): + #753012 IMG SRC not parsed v1.3 & v1.4 + #755929 Empty string attr. value causes attr parsing to be stopped + #778781 SRC-attribute suppression in IMG-tags + Also reviewed these test cases, again, with none reproducible in 1.4: + #788746 parser crashes on comments like <!-- foobar --!> + #772700 Jsp Tags are not parsed correctly when in quoted attributes. + + 2004-01-01 12:16 derrickoswald + + * src/org/htmlparser/visitors/UrlModifyingVisitor.java: + + Fix support request #824989 UrlModifyingVisitor taking out HTML comments. + Added remark node handling to preserve comments. + + 2004-01-01 11:54 derrickoswald + + * src/org/htmlparser/tests/tagTests/FormTagTest.java: + + Added test case for investigation of support request #772998 Cannot extract input tags. + Not reproducible. + + 2003-12-31 09:40 derrickoswald + + * src/org/htmlparser/: lexer/Lexer.java, + tests/lexerTests/LexerTests.java: + + Fix bug #789439 Japanese page causes OutOfMemory Exception + Modified the lexer to skip over JIS escape sequences. + + 2003-12-30 23:03 derrickoswald + + * src/org/htmlparser/tests/lexerTests/SourceTests.java: + + Fix Source 'SameChars' test. + + 2003-12-30 21:50 derrickoswald + + * docs/banner.html, docs/bar.gif, docs/docsindex.html, + docs/panel.html, docs/sample.html, src/doc-files/todo.html, + bin/crawler.bat, bin/linkextractor, bin/linkextractor.bat, + bin/ripper.bat, src/org/htmlparser/lexer/Stream.java, + src/org/htmlparser/parserapplications/LinkExtractor.java, + src/org/htmlparser/parserapplications/MailRipper.java, + src/org/htmlparser/parserapplications/Robot.java, + src/org/htmlparser/tests/lexerTests/StreamTests.java, + src/org/htmlparser/util/NodeList.java: + + Add filter support to NodeList. + Rework LinkExtractor and remove MailRipper and Robot example programs. + Clean out docs directory. + + 2003-12-29 09:18 derrickoswald + + * src/org/htmlparser/: lexer/Page.java, tags/BaseHrefTag.java, + parserapplications/SiteCapturer.java, tags/FrameTag.java, + tests/tagTests/BaseHrefTagTest.java: + + Add simplistic web site capture example application. + Demonstration of using custom tags in the NodeFactory. + Fixed various issues with URL rewriting. + + 2003-12-20 18:47 derrickoswald + + * src/: doc-files/todo.html, + org/htmlparser/scanners/CompositeTagScanner.java, + org/htmlparser/scanners/JspScanner.java, + org/htmlparser/scanners/Scanner.java, + org/htmlparser/scanners/ScriptScanner.java, + org/htmlparser/scanners/TagScanner.java, + org/htmlparser/scanners/package.html, + org/htmlparser/tags/CompositeTag.java, + org/htmlparser/tests/scannersTests/CompositeTagScannerTest.java, + org/htmlparser/util/IteratorImpl.java, + org/htmlparser/util/NodeList.java, + org/htmlparser/util/PeekingIteratorImpl.java: + + Reduce recursion on the JVM stack in CompositeTagScanner. + Pass a stack of open tags to the scanner. + Add smarter tag closing by walking up the stack on encountering an unopened end tag. + Avoids a problem with bad HTML such as that found at + http://scores.nba.com/games/20031029/scoreboard.html by Shaun Roach. + Added testInvalidNesting to CompositeTagScanner Test based on the above. + + 2003-12-15 21:29 derrickoswald + + * build.xml, src/org/htmlparser/lexer/Page.java, + resources/HtmlTaglet.java, resources/stylesheet.css, + src/doc-files/building.html, src/doc-files/overview.html, + src/doc-files/todo.html, + src/org/htmlparser/lexerapplications/thumbelina/Picture.java, + src/org/htmlparser/lexerapplications/thumbelina/ThumbelinaFrame.java, + src/org/htmlparser/tags/CompositeTag.java: + + Javadoc changes and additions. Stylesheet, overview, build instructions and todo list. + Added HTMLTaglet, an inline Javadoc taglet for embedding HTML into javadocs. + + 2003-12-08 08:13 derrickoswald + + * src/org/htmlparser/: AbstractNode.java, Node.java, Parser.java, + PrototypicalNodeFactory.java, StringNodeFactory.java, + lexer/nodes/PageAttribute.java, scanners/CompositeTagScanner.java, + scanners/TagScanner.java, tags/DoctypeTag.java, tags/FormTag.java, + tags/FrameSetTag.java, tags/ImageTag.java, + tests/scannersTests/CompositeTagScannerTest.java: + + Eliminate deprecation warnings. + Clean up javadoc warnings. + Integration Build 1.4 - 20031207 -------------------------------- Index: release.txt =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/release.txt,v retrieving revision 1.52 retrieving revision 1.53 diff -C2 -d -r1.52 -r1.53 *** release.txt 8 Dec 2003 01:31:50 -0000 1.52 --- release.txt 4 Jan 2004 19:03:35 -0000 1.53 *************** *** 1,68 **** ! HTMLParser Version 1.4 (Integration Build Dec 07, 2003) ********************************************* ! Q1. What are the contents of the distribution ? ! Q2. Which part of the source code should I analyze to be able to use it in my project ? ! Q3. How do I extend the capability of this library ? ! ! ********************************************************* ! Q1. What are the contents of the distribution ? ! ! A1. The distribution contains : ! (i) binary jar files - htmlparser.jar and lexer.jar (in lib directory) ! (ii) source code - src.zip (in distribution directory) ! Also contains necessary resources, and build file. Unzip this ! and you should be all set to build the parser from its source. ! You would need Jakarta Ant already installed to run the script. ! (iii) documentation - doc directory (includes javadoc) ! (iv) Executing scripts - There are three scripts (in the bin directory): ! (a) runParser.bat : Runs the html parser ! (b) runCrawler.bat : Runs the robot crawler ! (c) runRipper.bat : Runs the mail ripper ! (d) runLexer.bat : Runs the low lever lexer ! (e) lexer : Runs the low lever lexer on linux/unix ! All four batch files assume that java 1.2 (or upwards) is visible in your path. ! ! Issue the following command : ! java -jar htmlparser.jar myURL (or runParser myURL) ! ! The following are legal examples of usage: ! java -jar htmlparser.jar http://www.yahoo.com (or runParser http://www.yahoo.com) ! java -jar htmlparser.jar http://www.google.com -l (or runParser http://www.google.com -l) ! java -jar htmlparser.jar c:\html\test.html (or runParser c:\html\test.html) ! ! To see a list of options, simply type : ! java -jar htmlparser.jar ! ! and the switches allowed will be shown. ! ! ---------- *** ------------ ! Q2. Which part of the source code should I analyze to be able to use it in my project ? ! ! Go through the javadoc of Parser.java - it contains all the examples to get you started. ! ! Also go through the package org.htmlparser.parserapplications. ! The two applications here, Robot and MailRipper, have been expressly written for the purpose of demonstrating the usage of htmlparser. ! ! ---------- *** ------------ ! Q3. How do I extend the capability of this library ? ! You need to extend the HTMLTagScanner in order to take advantage of existing ! code to identify a generic tag. You could add further processing code based ! on the tag contents. For more information on how to do this, examine the code ! of the scanners in the com.kizna.html.scanners package. The scanners ! basically use a Template Method+Factory Method approach. You need to ! override the evaluate() - optional,scan() and getID() methods - compulsory, and register your ! scanner to the parser before you begin parsing. ! This process is simple, check the com.kizna.html.scanners package ! to see how the existing scanners work. ! ---------- *** ------------ ! Acknowledgements : The following people have contributed important bug reports, feature ideas : [1] Kaarle Kaaila --- 1,96 ---- ! HTMLParser Version 1.4 (Integration Build Jan 04, 2004) ********************************************* ! Contents of the distribution ! ---------------------------- ! (i) binary jar files - htmlparser.jar and lexer.jar (in lib directory) ! (ii) source code - src.zip ! Also contains necessary resources, and build file. Unzip this ! and you should be all set to build the parser from its source. ! You would need Jakarta Ant installed. ! (iii) documentation - docs directory (includes javadoc) ! Point your browser at index.html in the docs directory. ! (iv) executing scripts - bin directory ! Batch files assume that java 1.2 (or upwards) is visible in your path. ! (v) this file ! Changes since Version 1.3 ! ------------------------- ! Decorators ! The node decorator package has been added to provide support for the ! delegate model. ! Lexer ! A new lexer i/o subsystem has been added. This provides accurate line number ! and character position data, tag and attribute names maintain their original ! case, and attributes maintain their original order. Line numbers reported by ! tags are now zero based, not one based. The node count for parsing goes up ! in most cases because whitespace is strictly maintained, i.e. every ! whitespace (i.e. newline) now counts as a StringNode too. Storage of ! attributes is now in a Vector which means the element 0 Attribute is ! actually the name of the tag, rather than having the $TAGNAME entry in a ! HashTable. The htmllexer.jar is this new i/o subsystem broken out and made ! JDK 1.1 compliant, the htmlparser.jar, which includes everything in ! htmllexer.jar, is not necessarily intended to be used in JDK 1.1 ! environments. Some support for JIS escape sequences has been added. ! Tags ! Zero arg tag constructors have been added. Attribute maintenance ! (add/remove/edit) improved. There is no EndTag class any more. Just a ! generic tag that responds true to isEndTag(). Improvements to form tag ! handling, getting <input> and <textarea> tags nested within other tags. ! Improvements to applet tag handling regarding parameters and codebases. ! Scanners ! The concept of scanners has been completely reworked. Applications register ! tags not scanners to express interest in parsing only some tags. The default ! is now to parse all tags, which is equivalent to the old registerDOMTags(), ! so some extra nesting of tags will need to be handled. CompositeTagScanner ! logic has been improved to try and match unclosed open tags when an ! unexpected end tag is encountered. This change also moved recursion off the ! JDK stack, eliminating most StackOverflow exceptions. Also, a CompositeTag's ! "startTag()" is "this", and the CompositeTagScanner just adds children. ! Filters ! A new powerful filtering capability has been added, which makes extracting ! specific tags very easy. ! Applications ! New example applications Thumbelina and SiteCapturer. ! Bug Fixes ! --------- ! 865279 Documentation ! 851882 zero length alt tag causes bug in ImageScanner ! 839264 toHtml() parse error in Javascripts with "form" keyword ! 833592 DOCTYPE element is not parsed correctly ! 832530 empty attribute causes parser to fail ! 826764 ParserException occurs only when using setInputHTML() instea ! 825820 Words conjoined ! 825645 <input> not getting parsed inside table ! 813838 links not parsed correctly ! 805598 attribute src in tag img sometimes not correctly parsed ! 801118 two " characters at the end of an attribute value problem ! 798554 Applet Tag does not update codebase data ! 798553 setInputHtml does not set text ! 798552 Sample for node iterator incorrect ! 789439 Japanese page causes OutOfMemory Exception ! 788746 parser crashes on comments like <!-- foobar --!> ! 786869 LinkExtractor Sample not working ! 784767 irc://server/channel urls are HTTPLike? ! 778781 SRC-attribute suppression in IMG-tags ! 772700 Jsp Tags are not parsed correctly when in quoted attributes ! 765413 typo ! 761798 Error reading next element. ! 757337 Standalone attributes should remain standalone ! 755929 Empty string attr. value causes attr parsing to be stopped ! 753012 IMG SRC not parsed v1.3 & v1.4 ! 753003 <IMG> within <A> missed when followed by <MAP> ! 750117 StackOverFlow while Node-Iteration ! 749295 Problem Parsing Table ! 745566 StackOverflowError on select with too many unclosed options ! 744610 getLink() Erroneous for Relative Links from Files on Windows ! Acknowledgements ! ---------------- The following people have contributed important bug reports, feature ideas : [1] Kaarle Kaaila *************** *** 91,94 **** --- 119,124 ---- [24] Derrick Oswald [25] Joshua Kerievsky + [26] Stephen Nightingale + [27] Donnla Nic Gearailt If you find any bugs, please go to *************** *** 97,102 **** And please join the HTMLParser-User mailing list ! to get help on getting started. Join HTMLParser-Developer ! to become a developer on this project. Join HTMLParser-Announce to be notified whenever a new release is out. --- 127,131 ---- And please join the HTMLParser-User mailing list ! to get help on getting started. Join HTMLParser-Announce to be notified whenever a new release is out. |
From: <der...@us...> - 2004-01-04 23:10:46
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser In directory sc8-pr-cvs1:/tmp/cvs-serv27606/src/org/htmlparser Modified Files: Parser.java Log Message: Update version to 1.4-20040104. Index: Parser.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/Parser.java,v retrieving revision 1.81 retrieving revision 1.82 diff -C2 -d -r1.81 -r1.82 *** Parser.java 4 Jan 2004 03:23:09 -0000 1.81 --- Parser.java 4 Jan 2004 19:03:36 -0000 1.82 *************** *** 94,98 **** */ public final static String ! VERSION_DATE = "Dec 07, 2003" ; --- 94,98 ---- */ public final static String ! VERSION_DATE = "Jan 04, 2004" ; |
From: <der...@us...> - 2004-01-04 03:23:12
|
Update of /cvsroot/htmlparser/htmlparser/bin In directory sc8-pr-cvs1:/tmp/cvs-serv11427/htmlparser/bin Added Files: beanybaby beanybaby.bat stringextractor stringextractor.bat Log Message: Web site revamp, phase 1. Main and first level pages are refurbished. The wiki is still to do. Fixed bug #865279 Documentation The samples directory is now orphaned and no longer shipped. --- NEW FILE: beanybaby --- #! /bin/sh if [ -z "$HTMLPARSER_HOME" ] ; then ## resolve links - $0 may be a link to the home PRG="$0" progname=`basename "$0"` saveddir=`pwd` # need this for relative symlinks dirname_prg=`dirname "$PRG"` cd "$dirname_prg" while [ -h "$PRG" ] ; do ls=`ls -ld "$PRG"` link=`expr "$ls" : '.*-> \(.*\)$'` if expr "$link" : '/.*' > /dev/null; then PRG="$link" else PRG=`dirname "$PRG"`"/$link" fi done HTMLPARSER_HOME=`dirname "$PRG"`/.. cd "$saveddir" # make it fully qualified HTMLPARSER_HOME=`cd "$HTMLPARSER_HOME" && pwd` fi if [ -z "$JAVACMD" ] ; then if [ -n "$JAVA_HOME" ] ; then if [ -x "$JAVA_HOME/jre/sh/java" ] ; then # IBM's JDK on AIX uses strange locations for the executables JAVACMD="$JAVA_HOME/jre/sh/java" else JAVACMD="$JAVA_HOME/bin/java" fi else JAVACMD=`which java 2> /dev/null ` if [ -z "$JAVACMD" ] ; then JAVACMD=java fi fi fi if [ ! -x "$JAVACMD" ] ; then echo "Error: JAVA_HOME is not defined correctly." echo " We cannot execute $JAVACMD" exit 1 fi if [ -n "$CLASSPATH" ] ; then LOCALCLASSPATH="$CLASSPATH" fi HTMLPARSER_LIB="${HTMLPARSER_HOME}/lib" # add in the parser .jar file if [ -z "$LOCALCLASSPATH" ] ; then LOCALCLASSPATH="${HTMLPARSER_LIB}/htmlparser.jar" else LOCALCLASSPATH="${HTMLPARSER_LIB}/htmlparser.jar":"$LOCALCLASSPATH" fi # handle 1.1x JDKs if [ -n "$JAVA_HOME" ] ; then if [ -f "$JAVA_HOME/lib/classes.zip" ] ; then LOCALCLASSPATH="$LOCALCLASSPATH:$JAVA_HOME/lib/classes.zip" fi fi "$JAVACMD" -classpath "$LOCALCLASSPATH" org.htmlparser.beans.BeanyBaby "$@" --- NEW FILE: beanybaby.bat --- java -jar ..\lib\htmlparser.jar org.htmlparser.beans.BeanyBaby %1 %2 --- NEW FILE: stringextractor --- #! /bin/sh if [ -z "$HTMLPARSER_HOME" ] ; then ## resolve links - $0 may be a link to the home PRG="$0" progname=`basename "$0"` saveddir=`pwd` # need this for relative symlinks dirname_prg=`dirname "$PRG"` cd "$dirname_prg" while [ -h "$PRG" ] ; do ls=`ls -ld "$PRG"` link=`expr "$ls" : '.*-> \(.*\)$'` if expr "$link" : '/.*' > /dev/null; then PRG="$link" else PRG=`dirname "$PRG"`"/$link" fi done HTMLPARSER_HOME=`dirname "$PRG"`/.. cd "$saveddir" # make it fully qualified HTMLPARSER_HOME=`cd "$HTMLPARSER_HOME" && pwd` fi if [ -z "$JAVACMD" ] ; then if [ -n "$JAVA_HOME" ] ; then if [ -x "$JAVA_HOME/jre/sh/java" ] ; then # IBM's JDK on AIX uses strange locations for the executables JAVACMD="$JAVA_HOME/jre/sh/java" else JAVACMD="$JAVA_HOME/bin/java" fi else JAVACMD=`which java 2> /dev/null ` if [ -z "$JAVACMD" ] ; then JAVACMD=java fi fi fi if [ ! -x "$JAVACMD" ] ; then echo "Error: JAVA_HOME is not defined correctly." echo " We cannot execute $JAVACMD" exit 1 fi if [ -n "$CLASSPATH" ] ; then LOCALCLASSPATH="$CLASSPATH" fi HTMLPARSER_LIB="${HTMLPARSER_HOME}/lib" # add in the parser .jar file if [ -z "$LOCALCLASSPATH" ] ; then LOCALCLASSPATH="${HTMLPARSER_LIB}/htmlparser.jar" else LOCALCLASSPATH="${HTMLPARSER_LIB}/htmlparser.jar":"$LOCALCLASSPATH" fi # handle 1.1x JDKs if [ -n "$JAVA_HOME" ] ; then if [ -f "$JAVA_HOME/lib/classes.zip" ] ; then LOCALCLASSPATH="$LOCALCLASSPATH:$JAVA_HOME/lib/classes.zip" fi fi "$JAVACMD" -classpath "$LOCALCLASSPATH" org.htmlparser.parserapplications.StringExtractor "$@" --- NEW FILE: stringextractor.bat --- java -jar ..\lib\htmlparser.jar org.htmlparser.parserapplications.StringExtractor %1 %2 |
From: <der...@us...> - 2004-01-04 03:23:12
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications In directory sc8-pr-cvs1:/tmp/cvs-serv11427/htmlparser/src/org/htmlparser/parserapplications Modified Files: LinkExtractor.java SiteCapturer.java Log Message: Web site revamp, phase 1. Main and first level pages are refurbished. The wiki is still to do. Fixed bug #865279 Documentation The samples directory is now orphaned and no longer shipped. Index: LinkExtractor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/LinkExtractor.java,v retrieving revision 1.50 retrieving revision 1.51 diff -C2 -d -r1.50 -r1.51 *** LinkExtractor.java 31 Dec 2003 02:50:50 -0000 1.50 --- LinkExtractor.java 4 Jan 2004 03:23:09 -0000 1.51 *************** *** 61,65 **** null, null, ! "http://htmlparser.sourceforge.net/docs/"); if (null == url) System.exit (1); --- 61,65 ---- null, null, ! "http://htmlparser.sourceforge.net/wiki/"); if (null == url) System.exit (1); Index: SiteCapturer.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/parserapplications/SiteCapturer.java,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** SiteCapturer.java 29 Dec 2003 14:18:24 -0000 1.1 --- SiteCapturer.java 4 Jan 2004 03:23:09 -0000 1.2 *************** *** 637,641 **** null, null, ! "http://htmlparser.sourceforge.net/docs"); if (null != url) worker.setSource (url); --- 637,641 ---- null, null, ! "http://htmlparser.sourceforge.net/wiki"); if (null != url) worker.setSource (url); |
Update of /cvsroot/htmlparser/htmlparser/docs In directory sc8-pr-cvs1:/tmp/cvs-serv11427/htmlparser/docs Modified Files: bug.html contributors.html index.html mailinglists.html main.html panel.html support.html Added Files: joinus.html samples.html Log Message: Web site revamp, phase 1. Main and first level pages are refurbished. The wiki is still to do. Fixed bug #865279 Documentation The samples directory is now orphaned and no longer shipped. --- NEW FILE: joinus.html --- <!doctype html public "-//w3c//dtd html 4.0 transitional//en"> <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> <meta name="Author" content="Somik Raha"> <meta name="GENERATOR" content="Mozilla/4.61 [en] (WinNT; I) [Netscape]"> <title>Join the HTML Parser Project</title> <link REL ="stylesheet" TYPE="text/css" HREF="javadoc/stylesheet.css" TITLE="Style"> </head> <body> <h2>Join The HTML Parser Project</h2> <p>If you wish to join the htmlparser project as a developer, you have to register as a developer at <a href="http://sourceforge.net/account/register.php">SourceForge</a>. <p>Send your sourceforge login name or id and a brief resume (a write-up about yourself) to <p><a href="http://sourceforge.net/sendmessage.php?touser=605407">Derrick Oswald</a><br> <p>You also need to sign up on the <a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-developer">HTMLParser-Developer</a> mailing list - this is the forum we use for collaborating on this project. You may also want to sign up on the <a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-user">HTMLParser-User</a> mailing list to monitor other user activity. <p>What would you gain by joining us : <ul> <li>If you are a student, you'ld learn a lot about architecture, test-driven development and refactoring.</li> <li>If you are a professional, you'd have fun interacting with other professionals and making a super-fast parser even more powerful.</li> </ul> <p>We'll be happy to have you with us!</p> </body> </html> --- NEW FILE: samples.html --- <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> <title>Sample Programs</title> <link REL ="stylesheet" TYPE="text/css" HREF="javadoc/stylesheet.css" TITLE="Style"> </head> <body> <h2>Sample Programs</h2> <p>The example programs included with the HTML Parser distribution are listed below, with some details.</p> <p><strong>Note:</strong> On unix systems if you used the Java jar command or some older unzip utility to extract the distribution zip file, the executable flag will not have been preserved on the files in the bin directory. You can fix this by issuing the following command: <pre> <code>chmod u+x bin/*</code> </pre> <p> <table width="94%" border="0"> <tr> <td valign="top"> <strong>Parser</strong><br> </td> <td> <i>Parse a web page and print the tags in a simple loop.</i><br> <a href="../javadoc/org/htmlparser/Parser.html#main(java.lang.String[])" target="_parent">org.htmlparser.Parser.main(String[] args)</a> <pre> <code>bin/parser http://website_url [tag_name]</code> where tag_name is an optional tag name to be used as a filter, i.e. A - Show only the link tags extracted from the document IMG - Show only the image tags extracted from the document TITLE - Extract the title from the document NOTE: this is also the default program for the htmlparser.jar, so the above could be: <code>java -jar lib/htmlparser.jar http://website_url [tag_name]</code> </pre> </td> </tr> <tr> <td valign="top"> <strong>Link Extractor</strong><br> </td> <td> <i>Extract links/mail addresses from a web page.</i><br> <a href="../javadoc/org/htmlparser/parserapplications/LinkExtractor.html" target="_parent">org.htmlparser.parserapplications.LinkExtractor</a> <pre> <code>bin/linkextractor http://website_url [-maillinks]</code> the optional -maillinks argument causes mailto: links to be printed </pre> </td> </tr> <tr> <td valign="top"> <strong>String Extractor</strong><br> </td> <td> <i>Extract text from a web page.</i><br> <a href="../javadoc/org/htmlparser/parserapplications/LinkExtractor.html" target="_parent">org.htmlparser.parserapplications.StringExtractor</a> <pre> <code>bin/stringextractor http://website_url [-links]</code> the optional -links argument causes hyperlinks to be shown within the text </pre> </td> </tr> <tr> <td valign="top"> <strong>Site Capturer</strong><br> </td> <td> <i>Save a web site locally.</i><br> <a href="../javadoc/org/htmlparser/parserapplications/SiteCapturer.html" target="_parent">org.htmlparser.parserapplications.SiteCapturer</a> <pre> <code>bin/sitecapturer http://source_website /target_directory/ [true|false]</code> the optional boolean argument determines whether resources such as images, audio and video are to be captured </pre> </td> </tr> <tr> <td valign="top"> <strong>Thumbelina</strong><br> </td> <td> <i>View images behind thumbnails.</i><br> <a href="../javadoc/org/htmlparser/lexerapplications/thumbelina/package-summary.html" target="_parent">org.htmlparser.lexerapplications.thumbelina.Thumbelina</a> <pre> <code>bin/thumbelina [http://starting_website]</code> </pre> </td> </tr> <tr> <td valign="top"> <strong>BeanyBaby</strong><br> </td> <td> <i>Parser Java Bean demo.</i><br> <a href="../javadoc/org/htmlparser/beans/BeanyBaby.html" target="_parent">org.htmlparser.beans.BeanyBaby</a> <pre> <code>bin/beanybaby [http://starting_website]</code> </pre> </td> </tr> </table> </body> </html> Index: bug.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/bug.html,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** bug.html 15 Dec 2002 03:45:00 -0000 1.1 --- bug.html 4 Jan 2004 03:23:08 -0000 1.2 *************** *** 2,23 **** <html> <head> ! <title>Bug Reports</title> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> </head> - <body> <h2>Bug Reports </h2> ! <p>You can submit bug reports here. Before you proceed, please note that you <strong>should</strong> ! <a href="http://sourceforge.net/account/login.php">sign in at sourceforge</a>, ! for only then will our responses reach you. Putting in an anonymous bug report ! will provide us with no way of reaching you. If you do not have a sourceforge ! login, then please consider mentioning your email address in the bug report.</p> <p>Checklist <strong>BEFORE</strong> you submit your bug report :</p> <ul> <li>Have you <a href="http://sourceforge.net/tracker/?func=browse&group_id=24399&atid=381399">checked ! the list of older bug reports</a> </li> ! <li>Have you written a testcase to simulate your bug ? Why do we request this ! - check <a href="design/tests.html#communicate">Communicate with Testcases</a>. ! We do take reports without testcases, but pls note that such reports may take longer for us to respond to.</li> </ul> --- 2,25 ---- <html> <head> ! <title>Bug Reports</title> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <link REL ="stylesheet" TYPE="text/css" HREF="javadoc/stylesheet.css" TITLE="Style"> </head> <body> <h2>Bug Reports </h2> ! <p>You can submit bug reports here. Before you proceed, please note that you ! <strong>must</strong> ! <a href="http://sourceforge.net/account/login.php">login to sourceforge</a>. ! This is required so that bug status reports can be forwarded to you. ! If you do not have a sourceforge login, you can get one ! <a href="http://sourceforge.net/account/register.php">here</a>.</p> <p>Checklist <strong>BEFORE</strong> you submit your bug report :</p> <ul> + <li>Have you pretty much isolated the problem to the HTML Parser component.</li> <li>Have you <a href="http://sourceforge.net/tracker/?func=browse&group_id=24399&atid=381399">checked ! the list of older bug reports</a></li> ! <li>Have you written a testcase to simulate your bug? Why do we request this? ! - check <a href="wiki/TestDrivenDevelopment.html">Test Driven Development</a>. ! We do take reports without testcases, but please note that such reports may take longer for us to respond to.</li> </ul> Index: contributors.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/contributors.html,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** contributors.html 27 Apr 2003 19:08:21 -0000 1.3 --- contributors.html 4 Jan 2004 03:23:08 -0000 1.4 *************** *** 2,13 **** <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <meta name="Author" content="Somik Raha"> ! <meta name="GENERATOR" content="Mozilla/4.61 [en] (WinNT; I) [Netscape]"> ! <title>Contributors to HTML Parser</title> </head> ! <body text="#000000" bgcolor="#FFFFFF" link="#3333FF" vlink="#FF6600" alink="#FFCC00"> ! <b><u>Contributors</u></b><b><u></u></b> ! <p>The following people have contributed to this project : <table width="94%" border="1"> <tr> --- 2,13 ---- <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <meta name="Author" content="Somik Raha"> ! <meta name="GENERATOR" content="Mozilla/4.61 [en] (WinNT; I) [Netscape]"> ! <title>Contributors to HTML Parser</title> ! <link REL ="stylesheet" TYPE="text/css" HREF="javadoc/stylesheet.css" TITLE="Style"> </head> ! <body> ! <h1>Contributors</h1> <table width="94%" border="1"> <tr> *************** *** 22,27 **** K1R 7Y2<br> (613) 755-5065 ! <br> <a href="http://www.autodesk.com">http://www.autodesk.com</a><br> ! <a href="mailto:Der...@Au...">Der...@Au...</a> <br> Voice : 613.755.5065<br> --- 22,28 ---- K1R 7Y2<br> (613) 755-5065 ! <br> <a href="http://www.autodesk.com">http://www.autodesk.com</a><br> ! <!--a href="mailto:Der...@Au...">Der...@Au...</a--> ! <a href="http://sourceforge.net/sendmessage.php?touser=605407">email</a> <br> Voice : 613.755.5065<br> *************** *** 64,69 **** CA USA 94709<br> <a href="http://www.industriallogic.com">http://www.industriallogic.com</a><br> <a href="http://www.industrialxp.org">http://www.industrialxp.org</a><br> ! <a href="http://www.geocities.com/somik/">Personal Home Page</a><br> ! <a href="mailto:so...@in...">so...@in...</a> <br> Voice : 510.540.8336<br> --- 65,71 ---- CA USA 94709<br> <a href="http://www.industriallogic.com">http://www.industriallogic.com</a><br> <a href="http://www.industrialxp.org">http://www.industrialxp.org</a><br> ! <a href="http://www.geocities.com/somik/">Personal Home Page</a><br> ! <!--a href="mailto:so...@in...">so...@in...</a--> ! <a href="http://sourceforge.net/sendmessage.php?touser=187944">email</a> <br> Voice : 510.540.8336<br> *************** *** 102,107 **** CA USA 94709<br> <a href="http://www.industriallogic.com">http://www.industriallogic.com</a><br> ! <a href="http://www.industrialxp.org">http://www.industrialxp.org</a><br> ! <a href="mailto:jo...@in...">so...@in...</a> <br> Voice : 510.540.8336<br> --- 104,110 ---- CA USA 94709<br> <a href="http://www.industriallogic.com">http://www.industriallogic.com</a><br> ! <a href="http://www.industrialxp.org">http://www.industrialxp.org</a><br> ! <!--a href="mailto:jo...@in...">jo...@in...</a--> ! <a href="http://sourceforge.net/sendmessage.php?touser=344339">email</a> <br> Voice : 510.540.8336<br> *************** *** 130,135 **** Kalenteritie 23 B 4<br> 02200 Espoo, Finland<br> ! tel: +358-50-3725844<br> <a href="http://www.kk-software.fi">http://www.kk-software.fi</a><br> ! <a href="mailto:kaa...@ik..."> kaa...@ik...</a><br> </td> <td valign="top"> --- 133,139 ---- Kalenteritie 23 B 4<br> 02200 Espoo, Finland<br> ! tel: +358-50-3725844<br> <a href="http://www.kk-software.fi">http://www.kk-software.fi</a><br> ! <!--a href="mailto:kaa...@ik...">kaa...@ik...</a--> ! <a href="http://sourceforge.net/sendmessage.php?touser=287304">email</a> </td> <td valign="top"> *************** *** 163,168 **** <td valign="top"><img src="pics/claude.jpg" width="100" height="114"> <img src="pics/canada.gif" width="64" height="34"> <br> ! Claude Duguay<br> <a href="http://www.arcessa.com/">Arcessa, Inc.</a><br> ! <a href="mailto:CD...@ar...">CD...@ar...</a><br> </td> <td valign="top"> --- 167,178 ---- <td valign="top"><img src="pics/claude.jpg" width="100" height="114"> <img src="pics/canada.gif" width="64" height="34"> <br> ! Claude Duguay<br> ! Arcessa, Inc.<br> ! 10210 NE Points Drive<br> ! Suite 310<br> ! Kirkland, WA 98033<br> ! <a href="http://www.arcessa.com/">http://www.arcessa.com/</a><br> ! <!--a href="mailto:CD...@ar...">CD...@ar...</a--> ! <a href="http://sourceforge.net/sendmessage.php?touser=350041">email</a> </td> <td valign="top"> *************** *** 188,193 **** +91-22-28290019<br> Extn. 1457 <br> ! <a href="http://www.orbitech.co.in">http://www.orbitech.co.in</a> <br> ! <a href="mailto:dha...@or...">dha...@or...</a><br> </p> </td> --- 198,204 ---- +91-22-28290019<br> Extn. 1457 <br> ! <a href="http://www.orbitech.co.in">http://www.orbitech.co.in</a> <br> ! <!--a href="mailto:dha...@or...">dha...@or...</a--> ! <a href="http://sourceforge.net/sendmessage.php?touser=539715">email</a> </p> </td> *************** *** 200,204 **** - thus making the parser usable across Windows and Linux - which have different conventions for end-of-line characters.</p> ! <p>Read Dhaval's article on <a href="../articles/quest.html">The Quest for HTMLParser</a>.</p></td> <td valign="top"><p>I've been passionate about computers from very early on. Started working<br> --- 211,215 ---- - thus making the parser usable across Windows and Linux - which have different conventions for end-of-line characters.</p> ! <p>Read Dhaval's article on <a href="articles/quest.html">The Quest for HTMLParser</a>.</p></td> <td valign="top"><p>I've been passionate about computers from very early on. Started working<br> *************** *** 234,239 **** <tr> <td valign="top"><img src="pics/france.gif" width="51" height="35"><br> ! Cédric Rosa<br> <a href="mailto:ced...@fr...%20">ced...@fr... ! </a><br> </td> <td valign="top">Cédric was one of the most prolific testers of the parser, coming --- 245,251 ---- <tr> <td valign="top"><img src="pics/france.gif" width="51" height="35"><br> ! Cédric Rosa<br> ! <!--a href="mailto:ced...@fr...">ced...@fr...</a--> ! <a href="http://sourceforge.net/sendmessage.php?touser=584072">email</a> </td> <td valign="top">Cédric was one of the most prolific testers of the parser, coming *************** *** 341,345 **** </tr> </table> ! <p>Thanks to Stephen Harrington, Domenico Lordi, Kamen, John Zook, Cedric Rosa, Cheng Jun, Mazlan Mat, Rob Shields, Wolfgang Germund, Raj Sharma, Robert Kausch, Gordon Deudney, Serge Kruppa, Roger Kjensrud, Rodney S Foley and Manpreet Singh --- 353,357 ---- </tr> </table> ! <p>Thanks to Stephen Harrington, Domenico Lordi, Kamen, John Zook, Cheng Jun, Mazlan Mat, Rob Shields, Wolfgang Germund, Raj Sharma, Robert Kausch, Gordon Deudney, Serge Kruppa, Roger Kjensrud, Rodney S Foley and Manpreet Singh Index: index.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/index.html,v retrieving revision 1.2 retrieving revision 1.3 diff -C2 -d -r1.2 -r1.3 *** index.html 15 Dec 2002 03:45:00 -0000 1.2 --- index.html 4 Jan 2004 03:23:08 -0000 1.3 *************** *** 1,10 **** <html> <head> ! <title>HTMLParser Home Page</title> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <META name="description" content="HTML Parser- A java-based open source html parser"> ! <META name="keywords" content="html, parser, html parser, htmlparser, open source parser, java parser, java html parser"> </head> ! <frameset cols="20%,80%" frameborder="NO" border="0" framespacing="0" rows="*"> <frame name="leftFrame" scrolling="NO" src="panel.html" frameborder="NO" noresize> <frame name="mainFrame" src="main.html" frameborder="NO"> --- 1,11 ---- <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <META name="description" content="HTML Parser- A java-based open source html parser"> ! <META name="keywords" content="html, parser, html parser, htmlparser, open source parser, java parser, java html parser"> ! <title>HTMLParser Home Page</title> ! <link REL ="stylesheet" TYPE="text/css" HREF="javadoc/stylesheet.css" TITLE="Style"> </head> ! <frameset cols="15%,85%" frameborder="NO" border="0" framespacing="0" rows="*"> <frame name="leftFrame" scrolling="NO" src="panel.html" frameborder="NO" noresize> <frame name="mainFrame" src="main.html" frameborder="NO"> Index: mailinglists.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/mailinglists.html,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** mailinglists.html 16 Apr 2002 06:28:42 -0000 1.1 --- mailinglists.html 4 Jan 2004 03:23:08 -0000 1.2 *************** *** 2,26 **** <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <meta name="Author" content="Somik Raha"> ! <meta name="GENERATOR" content="Mozilla/4.61 [en] (WinNT; I) [Netscape]"> ! <meta name="KeyWords" content="mailing lists,htmlparser,java,user,developer, announce"> ! <title>HTML Parser Mailing Lists</title> </head> ! <body text="#000000" bgcolor="#FFFFFF" link="#3333FF" vlink="#FF6600" alink="#FFCC00"> ! <b><u>HTML Parser Mailing Lists</u></b> ! <p><a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-announce">HTMLParser ! Announcement mailing list (very low traffic)</a> ! <br>Join this list if you are interested in new releases of HTML Parser. ! Notifications of releases will be put on this list. ! <p><a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-developer">HTMLParser ! Developer mailing list</a>. ! <br>Join this list ONLY if you plan to have a developer discussion about ! the htmlparser library. This list is intended for developers of HTMLParser ! only. ! <p><a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-user">HTMLParser ! Users mailing list</a>. ! <br>Join this list if you want to use the HTMLParser library and need some ! help to get started. Feel free to post your questions here. </body> </html> --- 2,35 ---- <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <meta name="Author" content="Somik Raha"> ! <meta name="GENERATOR" content="Mozilla/4.61 [en] (WinNT; I) [Netscape]"> ! <meta name="KeyWords" content="mailing lists,htmlparser,java,user,developer,announce"> ! <title>HTML Parser Mailing Lists</title> ! <link REL ="stylesheet" TYPE="text/css" HREF="javadoc/stylesheet.css" TITLE="Style"> </head> ! <body> ! <h2>HTML Parser Mailing Lists</h2> ! <p><a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-announce"> ! HTMLParser Announcement mailing list (very low traffic)</a><br> ! Join this list if you are interested in new releases of HTML Parser. ! Notifications of releases will be put on this list.<br> ! Monitoring of intermediate releases is also possible by clicking on the ! <img src="http://images.sourceforge.net/images/ic/mail16d.png" alt="envelope icon"> ! in the <em>Notes / Monitor</em> column of the <em>Latest File Releases</em> ! list on the <a href="http://sourceforge.net/projects/htmlparser" target="_parent">project page</a>. ! <p><a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-user"> ! HTMLParser Users mailing list</a><br> ! Join this list if you are using the HTMLParser library and need some ! help to get started or solve a problem. Feel free to post your questions here. ! <p><a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-developer"> ! HTMLParser Developer mailing list</a><br> ! Join this list ONLY if you wish to monitor developer discussion about ! the htmlparser library. This list is intended for developer collaboration. ! <p><a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-cvs"> ! HTMLParser cvs commit mailing list</a><br> ! A syncmail script issues messages to this list whenever a CVS commit is ! performed to the /cvsroot/htmlparser repository. ! Subscribe to this list only if you want to be notified of code drops as they happen. </body> </html> Index: main.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/main.html,v retrieving revision 1.7 retrieving revision 1.8 diff -C2 -d -r1.7 -r1.8 *** main.html 25 May 2003 22:19:44 -0000 1.7 --- main.html 4 Jan 2004 03:23:08 -0000 1.8 *************** *** 2,24 **** <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <meta name="Author" content="Derrick Oswald"> ! <title>HTMLParser Main</title> </head> <body> ! <h1>HTMLParser v 1.3</span></h1> ! ! <div>Welcome to the homepage of HTMLParser - a super-fast real-time ! parser for real-world HTML. What has attracted most users to HTMLParser has ! been its simplicity in design, speed and ability to handle streaming real-world ! html. ! <p>The production release of HTMLParser 1.3 is available! ! You can download it from the <a href="http://sourceforge.net/project/showfiles.php?group_id=24399&release_id=161563">download page</a>. ! <p>Before you bother downloading the parser, you would probably want to check ! our "simple design" claim. Check <a href="samples/index.html">sample ! programs</a> to see how simple it is to parse HTML using HTMLParser. ! <p> <a href="http://sourceforge.net/projects/htmlparser"> ! <img src="http://sourceforge.net/sflogo.php?group_id=24399&type=1" width="88" height="31" border="0" alt="SourceForge.net Logo"></a> ! </div> </body> </html> --- 2,119 ---- <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <meta name="Author" content="Derrick Oswald"> ! <title>HTMLParser Main</title> ! <link REL ="stylesheet" TYPE="text/css" HREF="javadoc/stylesheet.css" TITLE="Style"> </head> <body> ! <h1>HTMLParser</h1> ! Welcome to the homepage of HTMLParser - a super-fast real-time ! parser for real-world HTML. What has attracted most developers to HTMLParser has ! been its simplicity in design, speed and ability to handle streaming real-world ! html. ! <p>The two fundamental use-cases that are handled by the parser are ! <a href="#extraction">extraction</a> and <a href="#transformation">transformation</a> ! (the syntheses use-case, where HTML pages are created from scratch, is better ! handled by other tools closer to the source of data). While prior versions ! concentrated on data extraction from web pages, Version 1.4 of the ! HTMLParser has substantial improvements in the area of transforming web ! pages, with simplified tag creation and editing, and verbatim toHtml() method ! output. ! <p>In general, to use the HTMLParser you will need to be able to write code in ! the Java programming language. Although some example programs are provided ! that may be useful as they stand, it's more than likely you will need (or ! want) to create your own programs or modify the ones provided to match your ! intended application. ! <p>To use the library, you will need to add either the htmllexer.jar or ! htmlparser.jar to your classpath when compiling and running. The ! htmllexer.jar provides low level access to generic string, remark and tag nodes on ! the page in a linear, flat, sequential manner. The htmlparser.jar, which ! includes the classes found in htmllexer.jar, provides access to a page as a ! sequence of nested differentiated tags containing string, remark and other ! tag nodes. So where the output from calls to the lexer ! <a href="javadoc/org/htmlparser/lexer/Lexer.html#nextNode()">nextNode()<a> ! method might be: ! <pre> ! <html> ! <head> ! <title> ! "Welcome" ! </title> ! </head> ! <body> ! etc... ! </pre> ! The output from the parser <a ! href="javadoc/org/htmlparser/util/NodeIterator.html">NodeIterator</a> would ! nest the tags as children of the <html>, <head> and other nodes ! (here represented by indentation): ! <pre> ! <html> ! <head> ! <title> ! "Welcome" ! </title> ! </head> ! <body> ! etc... ! </pre> ! The parser attempts to balance opening tags with ending tags to present the ! structure of the page, while the lexer simply spits out nodes. If your ! application requires only modest structural knowledge of the page, and is ! primarily concerned with individual, isolated nodes, you should consider ! using the lightweight lexer. But if your application requires knowledge of ! the nested structure of the page, for example processing tables, you will ! probably want to use the full parser. ! <h2><a name=extraction>Extraction</a></h2> ! Extraction encompasses all the information retrieval programs that are not ! meant to preserve the source page. This covers uses like: ! <ul> ! <li>text extraction, for use as input for text search engine databases for example</li> ! <li>link extraction, for crawling through web pages or harvesting email ! addresses</li> ! <li>screen scraping, for programmatic data input from web pages</li> ! <li>resource extraction, collecting images or sound</li> ! <li>a browser front end, the preliminary stage of page display</li> ! <li>link checking, ensuring links are valid</li> ! <li>site monitoring, checking for page differences beyond simplistic diffs</li> ! </ul> ! There are several facilities in the HTMLParser codebase to help with ! extraction, including ! <a href="javadoc/org/htmlparser/filters/package-summary.html">filters</a>, ! <a href="javadoc/org/htmlparser/visitors/package-summary.html">visitors</a> and ! <a href="javadoc/org/htmlparser/beans/package-summary.html">JavaBeans</a>. ! <h2><a name=transformation>Transformation</a></h2> ! Transformation includes all processing where the input <em>and</em> the output ! are HTML pages. Some examples are: ! <ul> ! <li>URL rewriting, modifying some or all links on a page</li> ! <li>site capture, moving content from the web to local disk</li> ! <li>censorship, removing offending words and phrases from pages</li> ! <li>HTML cleanup, correcting erroneous pages</li> ! <li>ad removal, excising URLs referencing advertising</li> ! <li>conversion to XML, moving existing web pages to XML</li> ! </ul> ! During or after reading in a page, operations on the nodes can ! accomplish many transformation tasks "in place", which can then be output ! with the <a href="javadoc/org/htmlparser/Node.html#toHtml()">toHtml()</a> method. ! Depending on the purpose of your application, you will probably want to look ! into node decorators, ! <a href="javadoc/org/htmlparser/visitors/package-summary.html">visitors</a>, or ! <a href="javadoc/org/htmlparser/tags/package-summary.html">custom tags</a> ! in conjunction with the ! <a href="javadoc/org/htmlparser/PrototypicalNodeFactory.html">PrototypicalNodeFactory</a>. ! <p>The HTML Parser is an open source library released under ! <a href="http://www.opensource.org/licenses/lgpl-license.html">GNU Lesser General Public ! License</a>, which basically says you are free to use the library "as is" in ! other (even proprietary) products, as long as due credit is given to the authors ! and the source code for the HTMLParser is included or available with the other product. ! For modified or embedded use, please consult the ! <a href="http://www.opensource.org/licenses/lgpl-license.html">LGPL license</a>. ! <div align="right"> ! <a href="http://sourceforge.net/projects/htmlparser" target="_parent"> ! <img src="http://sourceforge.net/sflogo.php?group_id=24399&type=1" width="88" height="31" border="0" alt="SourceForge.net"> ! </a> ! </div> </body> </html> Index: panel.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/panel.html,v retrieving revision 1.6 retrieving revision 1.7 diff -C2 -d -r1.6 -r1.7 *** panel.html 31 Dec 2003 02:50:49 -0000 1.6 --- panel.html 4 Jan 2004 03:23:08 -0000 1.7 *************** *** 2,42 **** <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <meta name="Author" content="Somik Raha & Abhishek Srivastava"> ! <meta name="KEYWORDS" content="java,jini,calcutta, java users group,design patterns"> ! <meta name="GENERATOR" content="Mozilla/4.61 [en] (WinNT; I) [Netscape]"> ! <title>NAVIGATION PAGE</title> ! <style type=text/css>.abhi { FONT-FAMILY: "Arial Black", arial; FONT-SIZE: 8pt; FONT-WEIGHT: normal; TEXT-DECORATION: none}</style> </head> <body bgcolor="#FFFFFF" background="background.gif"> <img SRC="htmlparserlogo.jpg" BORDER=0 height=40 width=100> ! <li> ! <a href="main.html" target="mainFrame">Home Page</a></li> ! ! <li> <a href="http://sourceforge.net/project/showfiles.php?group_id=24399&release_id=129477" target="mainFrame">Download</a></li> ! <li> <a href="samples/index.html" target="mainFrame">Sample Programs</a></li> ! ! <li> <a href="docs/index.html" target="_parent">Documentation</a></li> <li> <a href="articles/index.html" target="mainFrame">Articles</a></li> ! <li> <a href="mailinglists.html" target="mainFrame">Mailing Lists</a> </li> - <li> <a href="bug.html" target="mainFrame">Report Bugs</a></li> - <li> <a href="support.html" target="mainFrame">Request Support</a></li> ! ! <li> ! <a href="http://sourceforge.net/cvs/?group_id=24399" target="mainFrame">CVS ! Repository</a></li> ! ! <li> ! <a href="http://sourceforge.net/projects/htmlparser" target="mainFrame">Project ! Page</a></li> ! ! <li> ! <a href="contributors.html" target="mainFrame">Contributors</a></li> ! ! <li> <a href="design/joinus.html" target="mainFrame">Join this Project</a></li> ! </body> </html> --- 2,41 ---- <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <meta name="Author" content="Somik Raha & Abhishek Srivastava"> ! <meta name="KEYWORDS" content="java,html,design patterns"> ! <title>NAVIGATION PAGE</title> ! <link REL ="stylesheet" TYPE="text/css" HREF="javadoc/stylesheet.css" TITLE="Style"> ! <!--style type=text/css>.abhi { FONT-FAMILY: "Arial Black", arial; FONT-SIZE: 8pt; FONT-WEIGHT: normal; TEXT-DECORATION: ! none}</style--> </head> <body bgcolor="#FFFFFF" background="background.gif"> <img SRC="htmlparserlogo.jpg" BORDER=0 height=40 width=100> ! <p><strong>About HTMLParser</strong></p> ! <ul> ! <li> <a href="main.html" target="mainFrame">Welcome</a></li> ! <li> <a href="http://sourceforge.net/projects/htmlparser" target="_parent">Project Page</a></li> ! <li> <a href="contributors.html" target="mainFrame">Contributors</a></li> ! <li> <a href="joinus.html" target="mainFrame">Join this Project</a></li> ! </ul> ! <p><strong>Downloads</strong></p> ! <ul> ! <li> <a href="http://sourceforge.net/project/showfiles.php?group_id=24399&package_id=47712" target="_parent">Version 1.4</a></li> ! <li> <a href="http://sourceforge.net/project/showfiles.php?group_id=24399&package_id=17243" target="_parent">Old Releases</a></li> ! <li> <a href="http://cvs.sourceforge.net/viewcvs.py/htmlparser/htmlparser/" target="_parent">CVS Repository</a></li> ! </ul> ! <p><strong>Documentation</strong></p> ! <ul> ! <li> <a href="javadoc/index.html" target="_parent">JavaDocs</a></li> ! <li> <a href="samples.html" target="mainFrame">Sample Programs</a></li> ! <li> <a href="wiki/index.html" target="_parent">Wiki</a></li> <li> <a href="articles/index.html" target="mainFrame">Articles</a></li> ! </ul> ! <p><strong>Support</strong></p> ! <ul> <li> <a href="mailinglists.html" target="mainFrame">Mailing Lists</a> </li> <li> <a href="bug.html" target="mainFrame">Report Bugs</a></li> <li> <a href="support.html" target="mainFrame">Request Support</a></li> ! </ul> </body> </html> Index: support.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/support.html,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** support.html 15 Dec 2002 03:45:00 -0000 1.1 --- support.html 4 Jan 2004 03:23:08 -0000 1.2 *************** *** 2,30 **** <html> <head> ! <title>Bug Reports</title> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> </head> <body> <h2>Support Request</h2> ! <p>You can submit support requests here. Before you proceed, please note that ! you <strong>must</strong> <a href="http://sourceforge.net/account/login.php">sign ! in at sourceforge</a>, for only then will our responses reach you. Putting in ! an anonymous bug report will provide us with no way of reaching you. If you ! do not have a sourceforge login, then please consider mentioning your email ! address in the bug report. Anonymous support requests just dont make sense, ! so we will not be replying to anonymous reports.<br> ! <br> ! Please note that this is an open source project, and most of us are hard pressed ! for time. We are not obliged to help you but we do so anyway. You can help us ! by first researching your problem, and then requesting for support when you ! are really stuck. You should have gone through the <a href="sample/index.html">sample ! programs</a>, and <a href="design/index.html" target="_parent">documentation</a> ! before you submit your request. It might also be much faster to get help from ! the htmlparser user community, by signing up on the <a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-user">HTMLParser ! User mailing list</a>.<br> ! <br> ! Once you are ready, <a href="http://sourceforge.net/tracker/?func=add&group_id=24399&atid=381400">click ! here to submit your report</a>.</p> </body> </html> --- 2,31 ---- <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <title>Support Request</title> ! <link REL ="stylesheet" TYPE="text/css" HREF="javadoc/stylesheet.css" TITLE="Style"> </head> <body> <h2>Support Request</h2> ! <p>You can submit support requests here. Before you proceed, please note that ! you <strong>must</strong> <a href="http://sourceforge.net/account/login.php">sign ! in at sourceforge</a>, for only then will our responses reach you. ! If you do not have a sourceforge login, you can get one ! <a href="http://sourceforge.net/account/register.php">here</a>.</p> ! <p>Please note that this is an open source project, and most of us are hard pressed ! for time. We are not obliged to help you but we do so anyway. You can help us ! by first researching your problem, and then requesting for support when you ! are really stuck. You should have consulted the ! <a href="faq.html">frequently asked questions</a>, ! <a href="javadoc/index.html" target="_parent">JavaDocs</a>, and ! <a href="samples/index.html">sample programs</a> ! before you submit your request. It might also be much faster to get help from ! the htmlparser user community, by signing up on the ! <a href="http://lists.sourceforge.net/lists/listinfo/htmlparser-user"> ! HTMLParser User mailing list</a>.<br> ! <p>Once you are ready, ! <a href="http://sourceforge.net/tracker/?func=add&group_id=24399&atid=381400" target="_parent"> ! click here to submit your report</a>.</p> </body> </html> |
From: <der...@us...> - 2004-01-04 03:23:12
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/beans In directory sc8-pr-cvs1:/tmp/cvs-serv11427/htmlparser/src/org/htmlparser/beans Modified Files: LinkBean.java Log Message: Web site revamp, phase 1. Main and first level pages are refurbished. The wiki is still to do. Fixed bug #865279 Documentation The samples directory is now orphaned and no longer shipped. Index: LinkBean.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/beans/LinkBean.java,v retrieving revision 1.26 retrieving revision 1.27 diff -C2 -d -r1.26 -r1.27 *** LinkBean.java 2 Jan 2004 16:24:53 -0000 1.26 --- LinkBean.java 4 Jan 2004 03:23:09 -0000 1.27 *************** *** 42,48 **** /** ! * Extract strings from a URL. ! * @author Derrick Oswald ! * Created on December 23, 2002, 5:01 PM */ public class LinkBean extends Object implements Serializable --- 42,46 ---- /** ! * Extract links from a URL. */ public class LinkBean extends Object implements Serializable |
From: <der...@us...> - 2004-01-04 03:23:12
|
Update of /cvsroot/htmlparser/htmlparser/docs/samples In directory sc8-pr-cvs1:/tmp/cvs-serv11427/htmlparser/docs/samples Modified Files: index.html Log Message: Web site revamp, phase 1. Main and first level pages are refurbished. The wiki is still to do. Fixed bug #865279 Documentation The samples directory is now orphaned and no longer shipped. Index: index.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/samples/index.html,v retrieving revision 1.5 retrieving revision 1.6 diff -C2 -d -r1.5 -r1.6 *** index.html 2 Sep 2003 00:41:56 -0000 1.5 --- index.html 4 Jan 2004 03:23:08 -0000 1.6 *************** *** 2,28 **** <html> <head> ! <title>Sample Programs</title> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> </head> <body> ! <h3><font size="4"><strong>Sample Programs </strong></font></h3> ! <strong>WARNING: These examples are outdated. Except for the embedded links article, ! they need to be reworked to the most recent version of HTML Parser. ! <p>Please see <a ! href="http://htmlparser.sourceforge.net/docs/index.php/SamplePrograms">WikiPages ! Sample Programs</a> for more recent versions.</strong> ! <p>We provide below some commonly-used sample programs that were created using ! HTMLParser. Going through these programs will give you an idea of the design ! of the parser, and its expected usage.</p> ! <p><a href="links.html">Extracting Links / Mail addresses from a Web Page<br> ! </a><a href="linksEmbedded.html">Extracting Embedded Links</a><br> ! <a href="text.html">Extracting Text Content from a Web Page</a><br> ! <a href="imageslinks.html">Extracting Images Within Links</a><br> ! <a href="exception.html">Exception Handling in the parser</a><br> ! <a href="crawler.html">Web Crawler</a><br> ! <a href="ripper.html">Web Ripper (Modifying links and image locations)</a><br> ! <a href="feedback.html">Feedback Mechanism</a><br> ! <a href="custom.html">Supporting Custom Tags (extending the parser)</a></p> </body> </html> --- 2,106 ---- <html> <head> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <title>Sample Programs</title> ! <link REL ="stylesheet" TYPE="text/css" HREF="../javadoc/stylesheet.css" TITLE="Style"> </head> <body> ! <h2>Sample Programs</h2> ! <p>The example programs included with the HTML Parser distribution are listed ! below, with some details.</p> ! <p><strong>Note:</strong> On unix systems if you used the Java jar command or ! some older unzip utility to extract the distribution zip file, the ! executable flag will not have been preserved on the files in the bin ! directory. You can fix this by issuing the following command: ! <pre> ! <code>chmod u+x bin/*</code> ! </pre> ! <p> ! <table width="94%" border="0"> ! <tr> ! <td valign="top"> ! <strong>Parser</strong><br> ! </td> ! <td> ! <i>Parse a web page and print the tags in a simple loop.</i><br> ! <a href="../javadoc/org/htmlparser/Parser.html#main(java.lang.String[])" target="_parent">org.htmlparser.Parser.main(String[] args)</a> ! <pre> ! <code>bin/parser http://website_url [tag_name]</code> ! where tag_name is an optional tag name to be used as a filter, i.e. ! A - Show only the link tags extracted from the document ! IMG - Show only the image tags extracted from the document ! TITLE - Extract the title from the document ! NOTE: this is also the default program for the htmlparser.jar, so the above could be: ! <code>java -jar lib/htmlparser.jar http://website_url [tag_name]</code> ! </pre> ! </td> ! </tr> ! <tr> ! <td valign="top"> ! <strong>Link Extractor</strong><br> ! </td> ! <td> ! <i>Extract links/mail addresses from a web page.</i><br> ! <a href="../javadoc/org/htmlparser/parserapplications/LinkExtractor.html" target="_parent">org.htmlparser.parserapplications.LinkExtractor</a> ! <pre> ! <code>bin/linkextractor http://website_url [-maillinks]</code> ! the optional -maillinks argument causes mailto: links to be printed ! </pre> ! </td> ! </tr> ! <tr> ! <td valign="top"> ! <strong>String Extractor</strong><br> ! </td> ! <td> ! <i>Extract text from a web page.</i><br> ! <a href="../javadoc/org/htmlparser/parserapplications/LinkExtractor.html" target="_parent">org.htmlparser.parserapplications.StringExtractor</a> ! <pre> ! <code>bin/stringextractor http://website_url [-links]</code> ! the optional -links argument causes hyperlinks to be shown within the text ! </pre> ! </td> ! </tr> ! <tr> ! <td valign="top"> ! <strong>Site Capturer</strong><br> ! </td> ! <td> ! <i>Save a web site locally.</i><br> ! <a href="../javadoc/org/htmlparser/parserapplications/SiteCapturer.html" target="_parent">org.htmlparser.parserapplications.SiteCapturer</a> ! <pre> ! <code>bin/sitecapturer http://source_website /target_directory/ [true|false]</code> ! the optional boolean argument determines whether resources such as images, ! audio and video are to be captured ! </pre> ! </td> ! </tr> ! <tr> ! <td valign="top"> ! <strong>Thumbelina</strong><br> ! </td> ! <td> ! <i>View images behind thumbnails.</i><br> ! <a href="../javadoc/org/htmlparser/lexerapplications/thumbelina/package-summary.html" target="_parent">org.htmlparser.lexerapplications.thumbelina.Thumbelina</a> ! <pre> ! <code>bin/thumbelina [http://starting_website]</code> ! </pre> ! </td> ! </tr> ! <tr> ! <td valign="top"> ! <strong>BeanyBaby</strong><br> ! </td> ! <td> ! <i>Parser Java Bean demo.</i><br> ! <a href="../javadoc/org/htmlparser/beans/BeanyBaby.html" target="_parent">org.htmlparser.beans.BeanyBaby</a> ! <pre> ! <code>bin/beanybaby [http://starting_website]</code> ! </pre> ! </td> ! </tr> ! </table> </body> </html> |
From: <der...@us...> - 2004-01-04 03:23:12
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser In directory sc8-pr-cvs1:/tmp/cvs-serv11427/htmlparser/src/org/htmlparser Modified Files: Parser.java Log Message: Web site revamp, phase 1. Main and first level pages are refurbished. The wiki is still to do. Fixed bug #865279 Documentation The samples directory is now orphaned and no longer shipped. Index: Parser.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/Parser.java,v retrieving revision 1.80 retrieving revision 1.81 diff -C2 -d -r1.80 -r1.81 *** Parser.java 2 Jan 2004 19:32:04 -0000 1.80 --- Parser.java 4 Jan 2004 03:23:09 -0000 1.81 *************** *** 94,98 **** */ public final static String ! VERSION_DATE = "Jan 02, 2004" ; --- 94,98 ---- */ public final static String ! VERSION_DATE = "Dec 07, 2003" ; |
From: <der...@us...> - 2004-01-04 03:23:11
|
Update of /cvsroot/htmlparser/htmlparser/docs/articles In directory sc8-pr-cvs1:/tmp/cvs-serv11427/htmlparser/docs/articles Modified Files: index.html quest.html Log Message: Web site revamp, phase 1. Main and first level pages are refurbished. The wiki is still to do. Fixed bug #865279 Documentation The samples directory is now orphaned and no longer shipped. Index: index.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/articles/index.html,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** index.html 15 Dec 2002 03:45:00 -0000 1.1 --- index.html 4 Jan 2004 03:23:08 -0000 1.2 *************** *** 1,13 **** <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> ! <head> ! <title>HTMLParser Articles</title> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> </head> <body> <h2>Articles </h2> ! <p><a href="quest.html">The Quest for HTMLParser</a> - <a href="../design/contributors.html#dhaval"><em>Dhaval ! Udani</em></a></p> </body> </html> --- 1,13 ---- <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> ! <head> ! <title>HTMLParser Articles</title> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <link REL ="stylesheet" TYPE="text/css" HREF="../javadoc/stylesheet.css" TITLE="Style"> </head> <body> <h2>Articles </h2> ! <p><a href="quest.html">The Quest for HTMLParser</a> - <a href="../contributors.html#dhaval"><em>Dhaval Udani</em></a></p> </body> </html> Index: quest.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/articles/quest.html,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** quest.html 15 Dec 2002 03:45:00 -0000 1.1 --- quest.html 4 Jan 2004 03:23:08 -0000 1.2 *************** *** 2,12 **** <html> <head> ! <title>The Quest for HTMLParser</title> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> </head> - <body> ! <h2><strong>The Quest for HTMLParser</strong> </h2> ! <p>by <a href="../design/contributors.html#dhaval">Dhaval Udani</a><br> </p> <table width="75%" border="0"> --- 2,12 ---- <html> <head> ! <title>The Quest for HTMLParser</title> ! <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> ! <link REL ="stylesheet" TYPE="text/css" HREF="../javadoc/stylesheet.css" TITLE="Style"> </head> <body> ! <h2><strong>The Quest for HTMLParser</strong></h2> ! <p>by <a href="../contributors.html#dhaval">Dhaval Udani</a><br> </p> <table width="75%" border="0"> *************** *** 50,54 **** by me would be fixed and a release would be available soon.<br> <br> ! <a href="../design/contributors.html#dhaval"><em>Dhaval Udani</em></a><em> is a Senior Analyst at Orbitech Solutions Ltd. and a developer on the HTMLParser project. </em></p></td> --- 50,54 ---- by me would be fixed and a release would be available soon.<br> <br> ! <a href="../contributors.html#dhaval"><em>Dhaval Udani</em></a><em> is a Senior Analyst at Orbitech Solutions Ltd. and a developer on the HTMLParser project. </em></p></td> |
From: <der...@us...> - 2004-01-04 03:23:11
|
Update of /cvsroot/htmlparser/htmlparser In directory sc8-pr-cvs1:/tmp/cvs-serv11427/htmlparser Modified Files: build.xml Log Message: Web site revamp, phase 1. Main and first level pages are refurbished. The wiki is still to do. Fixed bug #865279 Documentation The samples directory is now orphaned and no longer shipped. Index: build.xml =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/build.xml,v retrieving revision 1.56 retrieving revision 1.57 diff -C2 -d -r1.56 -r1.57 *** build.xml 2 Jan 2004 19:32:03 -0000 1.56 --- build.xml 4 Jan 2004 03:23:08 -0000 1.57 *************** *** 6,11 **** Release Procedure - cd htmlparser ! - delete the local Wiki pages with 'rm /home/derrick/htmlparser_cvs/htmlparser/docs/docs/*' ! and 'rm /home/derrick/htmlparser_cvs/htmlparser/docs/docs/images/*', of course any one else would have to adjust this and also the hard-coded path in WikiCapturer --- 6,11 ---- Release Procedure - cd htmlparser ! - delete the local Wiki pages with 'rm /home/derrick/htmlparser_cvs/htmlparser/docs/wiki/*' ! and 'rm /home/derrick/htmlparser_cvs/htmlparser/docs/wiki/images/*', of course any one else would have to adjust this and also the hard-coded path in WikiCapturer *************** *** 23,28 **** - 'ant versionSource' updates the version in Parser.java and release.txt - perform a CVS update on htmlparser to identify new and changed files ! - commit changed files (i.e. Parser.java, release.txt, docs/changes, docs/docs ! and docs/docs/images) to the head revision using a reason of the form: Update version to 1.4-20040104. - use CVS to tag the current head revisions with a name like v1_4_20040104 --- 23,28 ---- - 'ant versionSource' updates the version in Parser.java and release.txt - perform a CVS update on htmlparser to identify new and changed files ! - commit changed files (i.e. Parser.java, release.txt, docs/changes, docs/wiki ! and docs/wiki/images) to the head revision using a reason of the form: Update version to 1.4-20040104. - use CVS to tag the current head revisions with a name like v1_4_20040104 *************** *** 354,367 **** </target> ! <target name="Release" depends="jar,thumbelina,javadoc" description="prepare the release files"> </target> <!-- Package the release --> ! <target name="Package" depends="Release,sources" description="glom the release and source files into the distribution zip file"> <mkdir dir="${distribution}"/> <zip zipfile="${distribution}/htmlparser${versionTag}.zip"> ! <zipfileset dir="${bin}" prefix="htmlparser${versionQualifier}/${bin}"/> ! <zipfileset dir="${docs}" prefix="htmlparser${versionQualifier}/${docs}"/> <zipfileset dir="${lib}" prefix="htmlparser${versionQualifier}/${lib}"/> <zipfileset dir="." prefix="htmlparser${versionQualifier}/" includes="src.zip"/> --- 354,369 ---- </target> ! <target name="release" depends="jar,thumbelina,javadoc" description="prepare the release files"> </target> <!-- Package the release --> ! <target name="package" depends="release,sources" description="glom the release and source files into the distribution zip file"> <mkdir dir="${distribution}"/> <zip zipfile="${distribution}/htmlparser${versionTag}.zip"> ! <zipfileset dir="${bin}" prefix="htmlparser${versionQualifier}/${bin}" includes="*.bat"/> ! <zipfileset dir="${bin}" prefix="htmlparser${versionQualifier}/${bin}" includes="*" excludes="*.bat" filemode="755"/> ! <zipfileset dir="${docs}" prefix="htmlparser${versionQualifier}/${docs}" excludes="docs/**,samples/**"/> ! <zipfileset dir="${docs}/docs" prefix="htmlparser${versionQualifier}/${docs}/wiki"/> <zipfileset dir="${lib}" prefix="htmlparser${versionQualifier}/${lib}"/> <zipfileset dir="." prefix="htmlparser${versionQualifier}/" includes="src.zip"/> *************** *** 391,395 **** <!-- Perform the htmlparser integration --> ! <target name="htmlparser" depends="Package,clean" description="same as Package plus cleanup"> </target> --- 393,397 ---- <!-- Perform the htmlparser integration --> ! <target name="htmlparser" depends="package,clean" description="same as package plus cleanup"> </target> |
From: <der...@us...> - 2004-01-04 03:23:11
|
Update of /cvsroot/htmlparser/WikiCapturer/src/org/htmlparser/wikicapturer In directory sc8-pr-cvs1:/tmp/cvs-serv11427/WikiCapturer/src/org/htmlparser/wikicapturer Modified Files: CaptureWiki.java Log Message: Web site revamp, phase 1. Main and first level pages are refurbished. The wiki is still to do. Fixed bug #865279 Documentation The samples directory is now orphaned and no longer shipped. Index: CaptureWiki.java =================================================================== RCS file: /cvsroot/htmlparser/WikiCapturer/src/org/htmlparser/wikicapturer/CaptureWiki.java,v retrieving revision 1.3 retrieving revision 1.4 diff -C2 -d -r1.3 -r1.4 *** CaptureWiki.java 24 Aug 2003 18:44:10 -0000 1.3 --- CaptureWiki.java 4 Jan 2004 03:23:08 -0000 1.4 *************** *** 27,34 **** public static void main(String[] args) throws Exception { CaptureWiki captureWiki = ! new CaptureWiki("http://htmlparser.sourceforge.net/docs/"); File file = new File ("./"); System.out.println (file.getAbsolutePath ()); ! captureWiki.captureTo("/home/derrick/htmlparser_cvs/htmlparser/docs/docs/"); } --- 27,34 ---- public static void main(String[] args) throws Exception { CaptureWiki captureWiki = ! new CaptureWiki("http://htmlparser.sourceforge.net/wiki/"); File file = new File ("./"); System.out.println (file.getAbsolutePath ()); ! captureWiki.captureTo("/home/derrick/htmlparser_cvs/htmlparser/docs/wiki/"); } |
From: <der...@us...> - 2004-01-02 19:32:08
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser In directory sc8-pr-cvs1:/tmp/cvs-serv4635/src/org/htmlparser Modified Files: Parser.java Log Message: Gey keyword substitution working. In the future, could developers ensure source files are initially check in with keyword substitution (-kkv) turned on. Index: Parser.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/Parser.java,v retrieving revision 1.79 retrieving revision 1.80 diff -C2 -d -r1.79 -r1.80 |
From: <der...@us...> - 2004-01-02 19:32:08
|
Update of /cvsroot/htmlparser/htmlparser In directory sc8-pr-cvs1:/tmp/cvs-serv4635 Modified Files: build.xml Log Message: Gey keyword substitution working. In the future, could developers ensure source files are initially check in with keyword substitution (-kkv) turned on. Index: build.xml =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/build.xml,v retrieving revision 1.55 retrieving revision 1.56 diff -C2 -d -r1.55 -r1.56 *** build.xml 2 Jan 2004 16:24:52 -0000 1.55 --- build.xml 2 Jan 2004 19:32:03 -0000 1.56 *************** *** 139,156 **** <echo message="previous version date = ${VERSION_DATE}"/> - <!-- set property previousVersion to current version --> - <loadfile srcFile="${src}/org/htmlparser/Parser.java" property="previousTag"> - <filterchain> - <headfilter lines="1"/> - <filterreader classname="org.apache.tools.ant.filters.StripLineBreaks"> - <param name="linebreaks" value="/Piy - A java-based parser for HTML"/> - </filterreader> - <striplinebreaks/> - </filterchain> - </loadfile> - <echo message="previous version tag = ${previousTag}"/> </target> <target name="versionSource" depends="init" description="update the version"> <echo message="Replacing version VERSION_NUMBER = ${VERSION_NUMBER} with VERSION_NUMBER = ${versionNumber} in ${src}/org/htmlparser/Parser.java"/> <replace file="${src}/org/htmlparser/Parser.java" token="VERSION_NUMBER = ${VERSION_NUMBER}" value="VERSION_NUMBER = ${versionNumber}"/> --- 139,146 ---- <echo message="previous version date = ${VERSION_DATE}"/> </target> <target name="versionSource" depends="init" description="update the version"> + <chmod file="${src}/org/htmlparser/Parser.java" perm="u+w"/> <echo message="Replacing version VERSION_NUMBER = ${VERSION_NUMBER} with VERSION_NUMBER = ${versionNumber} in ${src}/org/htmlparser/Parser.java"/> <replace file="${src}/org/htmlparser/Parser.java" token="VERSION_NUMBER = ${VERSION_NUMBER}" value="VERSION_NUMBER = ${versionNumber}"/> *************** *** 162,165 **** --- 152,156 ---- <replace file="${src}/org/htmlparser/Parser.java" token="VERSION_DATE = "${VERSION_DATE}"" value="VERSION_DATE = "${TODAY_STRING}""/> + <chmod file="${docs}/release.txt" perm="u+w"/> <echo message="Replacing version "${VERSION_NUMBER} (${VERSION_TYPE} ${VERSION_DATE})" with "${versionNumber} (${versionType} ${TODAY_STRING})" in ${docs}/release.txt"/> <replace dir="${docs}" value="${versionNumber} (${versionType} ${TODAY_STRING})"> |
From: <der...@us...> - 2004-01-02 19:32:08
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners In directory sc8-pr-cvs1:/tmp/cvs-serv4635/src/org/htmlparser/scanners Modified Files: package.html Log Message: Gey keyword substitution working. In the future, could developers ensure source files are initially check in with keyword substitution (-kkv) turned on. Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/scanners/package.html,v retrieving revision 1.20 retrieving revision 1.21 diff -C2 -d -r1.20 -r1.21 *** package.html 2 Jan 2004 16:24:54 -0000 1.20 --- package.html 2 Jan 2004 19:32:04 -0000 1.21 *************** *** 43,47 **** returned by the lexer and creates a tag containing a single string that is the script code.<br> ! You might need to create a scanner (that implements the {@link Scanner Scanner} interface) if the text you are trying to parse doesn't look like HTML, as is the case for the script scanner, or the normal processing of tags by nesting their structure is --- 43,48 ---- returned by the lexer and creates a tag containing a single string that is the script code.<br> ! You might need to create a scanner (that implements the ! {@link org.htmlparser.scanners.Scanner Scanner} interface) if the text you are trying to parse doesn't look like HTML, as is the case for the script scanner, or the normal processing of tags by nesting their structure is |
From: <der...@us...> - 2004-01-02 19:32:08
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer In directory sc8-pr-cvs1:/tmp/cvs-serv4635/src/org/htmlparser/lexer Modified Files: Cursor.java Log Message: Gey keyword substitution working. In the future, could developers ensure source files are initially check in with keyword substitution (-kkv) turned on. Index: Cursor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/lexer/Cursor.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** Cursor.java 2 Jan 2004 16:24:53 -0000 1.16 --- Cursor.java 2 Jan 2004 19:32:04 -0000 1.17 *************** *** 116,118 **** --- 116,163 ---- { return ((Cursor)clone ()); + } + catch (CloneNotSupportedException cnse) + { + return (new Cursor (getPage (), getPosition ())); + } + } + + public String toString () + { + int row; + int column; + StringBuffer ret; + + ret = new StringBuffer (9 * 3 + 3); // three ints and delimiters + ret.append (getPosition ()); + ret.append ("["); + if (null != mPage) + ret.append (mPage.row (this)); + else + ret.append ("?"); + ret.append (","); + if (null != mPage) + ret.append (mPage.column (this)); + else + ret.append ("?"); + ret.append ("]"); + + return (ret.toString ()); + } + + // + // Ordered interface + // + + /** + * Compare one reference to another. + * @see org.htmlparser.util.sort.Ordered + */ + public int compare (Object that) + { + Cursor r = (Cursor)that; + return (getPosition () - r.getPosition ()); + } + } + |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util In directory sc8-pr-cvs1:/tmp/cvs-serv2062/htmlparser/src/org/htmlparser/util Modified Files: ChainedException.java CommandLine.java DefaultParserFeedback.java FeedbackManager.java Generate.java IteratorImpl.java LinkProcessor.java NodeIterator.java NodeList.java ParserException.java ParserFeedback.java ParserUtils.java PeekingIterator.java SimpleNodeIterator.java SpecialHashtable.java Translate.java package.html Log Message: Update the integration procedure to use a CVS tag, rather than changing the version in every file header. Index: ChainedException.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/ChainedException.java,v retrieving revision 1.43 retrieving revision 1.44 diff -C2 -d -r1.43 -r1.44 *** ChainedException.java 8 Dec 2003 01:31:56 -0000 1.43 --- ChainedException.java 2 Jan 2004 16:24:58 -0000 1.44 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Claude Duguay ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,32 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... ! // ! // Postal Address : ! // Somik Raha ! // Extreme Programmer & Coach ! // Industrial Logic Corporation ! // 2583 Cedar Street, Berkeley, ! // CA 94708, USA ! // Website : http://www.industriallogic.com ! // ! // This class was contributed by ! // Claude Duguay // --- 17,26 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // Index: CommandLine.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/CommandLine.java,v retrieving revision 1.41 retrieving revision 1.42 diff -C2 -d -r1.41 -r1.42 *** CommandLine.java 8 Dec 2003 01:31:56 -0000 1.41 --- CommandLine.java 2 Jan 2004 16:24:58 -0000 1.42 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; Index: DefaultParserFeedback.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/DefaultParserFeedback.java,v retrieving revision 1.30 retrieving revision 1.31 diff -C2 -d -r1.30 -r1.31 *** DefaultParserFeedback.java 8 Dec 2003 01:31:56 -0000 1.30 --- DefaultParserFeedback.java 2 Jan 2004 16:24:58 -0000 1.31 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Claude Duguay ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; *************** *** 36,43 **** * can implement their own classes to support alternate behavior. * - * @author Claude Duguay * @see ParserFeedback * @see FeedbackManager ! **/ public class DefaultParserFeedback implements --- 34,40 ---- * can implement their own classes to support alternate behavior. * * @see ParserFeedback * @see FeedbackManager ! */ public class DefaultParserFeedback implements Index: FeedbackManager.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/FeedbackManager.java,v retrieving revision 1.43 retrieving revision 1.44 diff -C2 -d -r1.43 -r1.44 *** FeedbackManager.java 8 Dec 2003 01:31:56 -0000 1.43 --- FeedbackManager.java 2 Jan 2004 16:24:58 -0000 1.44 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Claude Duguay ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,30 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com - package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; *************** *** 37,41 **** * the setParserFeedback method. * - * @author Claude Duguay * @see ParserFeedback * @see DefaultParserFeedback --- 34,37 ---- Index: Generate.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/Generate.java,v retrieving revision 1.47 retrieving revision 1.48 diff -C2 -d -r1.47 -r1.48 *** Generate.java 8 Dec 2003 01:31:56 -0000 1.47 --- Generate.java 2 Jan 2004 16:24:58 -0000 1.48 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Derrick Oswald ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,32 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... ! // ! // Postal Address : ! // Somik Raha ! // Extreme Programmer & Coach ! // Industrial Logic Corporation ! // 2583 Cedar Street, Berkeley, ! // CA 94708, USA ! // Website : http://www.industriallogic.com ! // ! // This class was contributed by ! // Derrick Oswald // --- 17,26 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // *************** *** 443,448 **** { Generate filter = new Generate (); ! System.out.println ("// HTMLParser Library v1_4_20031207 - A java-based parser for HTML"); ! System.out.println ("// Copyright (C) Dec 31, 2000 Somik Raha"); System.out.println ("//"); System.out.println ("// This library is free software; you can redistribute it and/or"); --- 437,451 ---- { Generate filter = new Generate (); ! ! System.out.println ("// HTMLParser Library $Name$ - A java-based parser for HTML"); ! System.out.println ("// http://sourceforge.org/projects/htmlparser"); ! System.out.println ("// Copyright (C) 2004 Derrick Oswald"); ! System.out.println ("//"); ! System.out.println ("// Revision Control Information"); ! System.out.println ("//"); ! System.out.println ("// $Source$"); ! System.out.println ("// $Author$"); ! System.out.println ("// $Date$"); ! System.out.println ("// $Revision$"); System.out.println ("//"); System.out.println ("// This library is free software; you can redistribute it and/or"); *************** *** 453,476 **** System.out.println ("// This library is distributed in the hope that it will be useful,"); System.out.println ("// but WITHOUT ANY WARRANTY; without even the implied warranty of"); ! System.out.println ("// MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU"); System.out.println ("// Lesser General Public License for more details."); System.out.println ("//"); System.out.println ("// You should have received a copy of the GNU Lesser General Public"); System.out.println ("// License along with this library; if not, write to the Free Software"); ! System.out.println ("// Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA"); ! System.out.println ("//"); ! System.out.println ("// For any questions or suggestions, you can write to me at :"); ! System.out.println ("// Email :so...@in..."); ! System.out.println ("//"); ! System.out.println ("// Postal Address :"); ! System.out.println ("// Somik Raha"); ! System.out.println ("// Extreme Programmer & Coach"); ! System.out.println ("// Industrial Logic Corporation"); ! System.out.println ("// 2583 Cedar Street, Berkeley,"); ! System.out.println ("// CA 94708, USA"); ! System.out.println ("// Website : http://www.industriallogic.com"); ! System.out.println ("//"); ! System.out.println ("// This class was contributed by"); ! System.out.println ("// Derrick Oswald"); System.out.println ("//"); System.out.println (); --- 456,465 ---- System.out.println ("// This library is distributed in the hope that it will be useful,"); System.out.println ("// but WITHOUT ANY WARRANTY; without even the implied warranty of"); ! System.out.println ("// MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU"); System.out.println ("// Lesser General Public License for more details."); System.out.println ("//"); System.out.println ("// You should have received a copy of the GNU Lesser General Public"); System.out.println ("// License along with this library; if not, write to the Free Software"); ! System.out.println ("// Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA"); System.out.println ("//"); System.out.println (); Index: IteratorImpl.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/IteratorImpl.java,v retrieving revision 1.38 retrieving revision 1.39 diff -C2 -d -r1.38 -r1.39 *** IteratorImpl.java 20 Dec 2003 23:47:55 -0000 1.38 --- IteratorImpl.java 2 Jan 2004 16:24:58 -0000 1.39 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; Index: LinkProcessor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/LinkProcessor.java,v retrieving revision 1.32 retrieving revision 1.33 diff -C2 -d -r1.32 -r1.33 *** LinkProcessor.java 8 Dec 2003 01:31:56 -0000 1.32 --- LinkProcessor.java 2 Jan 2004 16:24:58 -0000 1.33 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; Index: NodeIterator.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/NodeIterator.java,v retrieving revision 1.31 retrieving revision 1.32 diff -C2 -d -r1.31 -r1.32 *** NodeIterator.java 8 Dec 2003 01:31:56 -0000 1.31 --- NodeIterator.java 2 Jan 2004 16:24:58 -0000 1.32 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; Index: NodeList.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/NodeList.java,v retrieving revision 1.52 retrieving revision 1.53 diff -C2 -d -r1.52 -r1.53 *** NodeList.java 31 Dec 2003 02:50:50 -0000 1.52 --- NodeList.java 2 Jan 2004 16:24:58 -0000 1.53 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; Index: ParserException.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/ParserException.java,v retrieving revision 1.28 retrieving revision 1.29 diff -C2 -d -r1.28 -r1.29 *** ParserException.java 8 Dec 2003 01:31:56 -0000 1.28 --- ParserException.java 2 Jan 2004 16:24:58 -0000 1.29 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Claude Duguay ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; *************** *** 33,40 **** * * @see ChainedException - * @author Claude Duguay **/ - - public class ParserException extends ChainedException --- 31,35 ---- Index: ParserFeedback.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/ParserFeedback.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** ParserFeedback.java 8 Dec 2003 01:31:56 -0000 1.29 --- ParserFeedback.java 2 Jan 2004 16:24:58 -0000 1.30 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Claude Duguay ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; *************** *** 36,40 **** * are also possible. * - * @author Claude Duguay * @see DefaultParserFeedback * @see FeedbackManager --- 34,37 ---- Index: ParserUtils.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/ParserUtils.java,v retrieving revision 1.37 retrieving revision 1.38 diff -C2 -d -r1.37 -r1.38 *** ParserUtils.java 8 Dec 2003 01:31:56 -0000 1.37 --- ParserUtils.java 2 Jan 2004 16:24:58 -0000 1.38 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; Index: PeekingIterator.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/PeekingIterator.java,v retrieving revision 1.20 retrieving revision 1.21 diff -C2 -d -r1.20 -r1.21 *** PeekingIterator.java 8 Dec 2003 01:31:56 -0000 1.20 --- PeekingIterator.java 2 Jan 2004 16:24:58 -0000 1.21 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Derrick Oswald ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; Index: SimpleNodeIterator.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/SimpleNodeIterator.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** SimpleNodeIterator.java 8 Dec 2003 01:31:56 -0000 1.33 --- SimpleNodeIterator.java 2 Jan 2004 16:24:58 -0000 1.34 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; Index: SpecialHashtable.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/SpecialHashtable.java,v retrieving revision 1.4 retrieving revision 1.5 diff -C2 -d -r1.4 -r1.5 *** SpecialHashtable.java 8 Dec 2003 01:31:56 -0000 1.4 --- SpecialHashtable.java 2 Jan 2004 16:24:58 -0000 1.5 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Derrick Oswald ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.util; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.util; Index: Translate.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/Translate.java,v retrieving revision 1.41 retrieving revision 1.42 diff -C2 -d -r1.41 -r1.42 *** Translate.java 8 Dec 2003 01:31:56 -0000 1.41 --- Translate.java 2 Jan 2004 16:24:58 -0000 1.42 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Derrick Oswald ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,32 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... ! // ! // Postal Address : ! // Somik Raha ! // Extreme Programmer & Coach ! // Industrial Logic Corporation ! // 2583 Cedar Street, Berkeley, ! // CA 94708, USA ! // Website : http://www.industriallogic.com ! // ! // This class was contributed by ! // Derrick Oswald // --- 17,26 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/package.html,v retrieving revision 1.18 retrieving revision 1.19 diff -C2 -d -r1.18 -r1.19 *** package.html 8 Dec 2003 01:31:56 -0000 1.18 --- package.html 2 Jan 2004 16:24:58 -0000 1.19 *************** *** 3,11 **** <head> <!-- ! @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! Copyright (C) Dec 31, 2000 Somik Raha This library is free software; you can redistribute it and/or --- 3,16 ---- <head> <!-- + HTMLParser Library $Name$ - A java-based parser for HTML + http://sourceforge.org/projects/htmlparser + Copyright (C) 2004 Somik Raha ! Revision Control Information ! $Source$ ! $Author$ ! $Date$ ! $Revision$ This library is free software; you can redistribute it and/or *************** *** 16,37 **** This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of ! MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details. You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software ! Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! ! For any questions or suggestions, you can write to me at : ! Email :so...@in... ! ! Postal Address : ! Somik Raha ! Extreme Programmer & Coach ! Industrial Logic Corporation ! 2583 Cedar Street, Berkeley, ! CA 94708, USA ! Website : http://www.industriallogic.com ! --> </head> --- 21,30 ---- This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of ! MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details. You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software ! Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA --> </head> |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/visitorsTests In directory sc8-pr-cvs1:/tmp/cvs-serv2062/htmlparser/src/org/htmlparser/tests/visitorsTests Modified Files: AllTests.java CompositeTagFindingVisitorTest.java HtmlPageTest.java LinkFindingVisitorTest.java NodeVisitorTest.java StringFindingVisitorTest.java TagFindingVisitorTest.java TextExtractingVisitorTest.java UrlModifyingVisitorTest.java Log Message: Update the integration procedure to use a CVS tag, rather than changing the version in every file header. Index: AllTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/visitorsTests/AllTests.java,v retrieving revision 1.40 retrieving revision 1.41 diff -C2 -d -r1.40 -r1.41 *** AllTests.java 8 Dec 2003 01:31:56 -0000 1.40 --- AllTests.java 2 Jan 2004 16:24:57 -0000 1.41 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.visitorsTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.visitorsTests; Index: CompositeTagFindingVisitorTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/visitorsTests/CompositeTagFindingVisitorTest.java,v retrieving revision 1.13 retrieving revision 1.14 diff -C2 -d -r1.13 -r1.14 *** CompositeTagFindingVisitorTest.java 8 Dec 2003 01:31:56 -0000 1.13 --- CompositeTagFindingVisitorTest.java 2 Jan 2004 16:24:57 -0000 1.14 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.visitorsTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.visitorsTests; Index: HtmlPageTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/visitorsTests/HtmlPageTest.java,v retrieving revision 1.17 retrieving revision 1.18 diff -C2 -d -r1.17 -r1.18 *** HtmlPageTest.java 8 Dec 2003 01:31:56 -0000 1.17 --- HtmlPageTest.java 2 Jan 2004 16:24:57 -0000 1.18 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.visitorsTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.visitorsTests; Index: LinkFindingVisitorTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/visitorsTests/LinkFindingVisitorTest.java,v retrieving revision 1.14 retrieving revision 1.15 diff -C2 -d -r1.14 -r1.15 *** LinkFindingVisitorTest.java 8 Dec 2003 01:31:56 -0000 1.14 --- LinkFindingVisitorTest.java 2 Jan 2004 16:24:57 -0000 1.15 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.visitorsTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.visitorsTests; Index: NodeVisitorTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/visitorsTests/NodeVisitorTest.java,v retrieving revision 1.13 retrieving revision 1.14 diff -C2 -d -r1.13 -r1.14 *** NodeVisitorTest.java 8 Dec 2003 01:31:56 -0000 1.13 --- NodeVisitorTest.java 2 Jan 2004 16:24:57 -0000 1.14 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.visitorsTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.visitorsTests; Index: StringFindingVisitorTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/visitorsTests/StringFindingVisitorTest.java,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** StringFindingVisitorTest.java 8 Dec 2003 01:31:56 -0000 1.15 --- StringFindingVisitorTest.java 2 Jan 2004 16:24:57 -0000 1.16 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.visitorsTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.visitorsTests; Index: TagFindingVisitorTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/visitorsTests/TagFindingVisitorTest.java,v retrieving revision 1.16 retrieving revision 1.17 diff -C2 -d -r1.16 -r1.17 *** TagFindingVisitorTest.java 8 Dec 2003 01:31:56 -0000 1.16 --- TagFindingVisitorTest.java 2 Jan 2004 16:24:57 -0000 1.17 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.visitorsTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.visitorsTests; Index: TextExtractingVisitorTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/visitorsTests/TextExtractingVisitorTest.java,v retrieving revision 1.14 retrieving revision 1.15 diff -C2 -d -r1.14 -r1.15 *** TextExtractingVisitorTest.java 8 Dec 2003 01:31:56 -0000 1.14 --- TextExtractingVisitorTest.java 2 Jan 2004 16:24:57 -0000 1.15 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.visitorsTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.visitorsTests; Index: UrlModifyingVisitorTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/visitorsTests/UrlModifyingVisitorTest.java,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** UrlModifyingVisitorTest.java 8 Dec 2003 01:31:56 -0000 1.15 --- UrlModifyingVisitorTest.java 2 Jan 2004 16:24:57 -0000 1.16 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.visitorsTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.visitorsTests; |
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests In directory sc8-pr-cvs1:/tmp/cvs-serv2062/htmlparser/src/org/htmlparser/tests/tagTests Modified Files: AllTests.java AppletTagTest.java BaseHrefTagTest.java BodyTagTest.java CompositeTagTest.java DoctypeTagTest.java EndTagTest.java FormTagTest.java FrameSetTagTest.java FrameTagTest.java ImageTagTest.java InputTagTest.java JspTagTest.java LinkTagTest.java MetaTagTest.java ObjectCollectionTest.java OptionTagTest.java ScriptTagTest.java SelectTagTest.java StyleTagTest.java TagTest.java TextareaTagTest.java TitleTagTest.java package.html Log Message: Update the integration procedure to use a CVS tag, rather than changing the version in every file header. Index: AllTests.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/AllTests.java,v retrieving revision 1.49 retrieving revision 1.50 diff -C2 -d -r1.49 -r1.50 *** AllTests.java 8 Dec 2003 01:31:55 -0000 1.49 --- AllTests.java 2 Jan 2004 16:24:57 -0000 1.50 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: AppletTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/AppletTagTest.java,v retrieving revision 1.36 retrieving revision 1.37 diff -C2 -d -r1.36 -r1.37 *** AppletTagTest.java 8 Dec 2003 01:31:55 -0000 1.36 --- AppletTagTest.java 2 Jan 2004 16:24:57 -0000 1.37 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: BaseHrefTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/BaseHrefTagTest.java,v retrieving revision 1.37 retrieving revision 1.38 diff -C2 -d -r1.37 -r1.38 *** BaseHrefTagTest.java 29 Dec 2003 14:18:24 -0000 1.37 --- BaseHrefTagTest.java 2 Jan 2004 16:24:57 -0000 1.38 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: BodyTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/BodyTagTest.java,v retrieving revision 1.19 retrieving revision 1.20 diff -C2 -d -r1.19 -r1.20 *** BodyTagTest.java 8 Dec 2003 01:31:55 -0000 1.19 --- BodyTagTest.java 2 Jan 2004 16:24:57 -0000 1.20 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: CompositeTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/CompositeTagTest.java,v retrieving revision 1.15 retrieving revision 1.16 diff -C2 -d -r1.15 -r1.16 *** CompositeTagTest.java 8 Dec 2003 01:31:55 -0000 1.15 --- CompositeTagTest.java 2 Jan 2004 16:24:57 -0000 1.16 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: DoctypeTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/DoctypeTagTest.java,v retrieving revision 1.35 retrieving revision 1.36 diff -C2 -d -r1.35 -r1.36 *** DoctypeTagTest.java 2 Jan 2004 05:01:28 -0000 1.35 --- DoctypeTagTest.java 2 Jan 2004 16:24:57 -0000 1.36 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: EndTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/EndTagTest.java,v retrieving revision 1.37 retrieving revision 1.38 diff -C2 -d -r1.37 -r1.38 *** EndTagTest.java 8 Dec 2003 01:31:55 -0000 1.37 --- EndTagTest.java 2 Jan 2004 16:24:57 -0000 1.38 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,30 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com - package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: FormTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/FormTagTest.java,v retrieving revision 1.42 retrieving revision 1.43 diff -C2 -d -r1.42 -r1.43 *** FormTagTest.java 2 Jan 2004 05:01:28 -0000 1.42 --- FormTagTest.java 2 Jan 2004 16:24:57 -0000 1.43 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: FrameSetTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/FrameSetTagTest.java,v retrieving revision 1.35 retrieving revision 1.36 diff -C2 -d -r1.35 -r1.36 *** FrameSetTagTest.java 8 Dec 2003 01:31:55 -0000 1.35 --- FrameSetTagTest.java 2 Jan 2004 16:24:57 -0000 1.36 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: FrameTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/FrameTagTest.java,v retrieving revision 1.35 retrieving revision 1.36 diff -C2 -d -r1.35 -r1.36 *** FrameTagTest.java 8 Dec 2003 01:31:55 -0000 1.35 --- FrameTagTest.java 2 Jan 2004 16:24:57 -0000 1.36 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: ImageTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/ImageTagTest.java,v retrieving revision 1.40 retrieving revision 1.41 diff -C2 -d -r1.40 -r1.41 *** ImageTagTest.java 2 Jan 2004 05:01:28 -0000 1.40 --- ImageTagTest.java 2 Jan 2004 16:24:57 -0000 1.41 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: InputTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/InputTagTest.java,v retrieving revision 1.38 retrieving revision 1.39 diff -C2 -d -r1.38 -r1.39 *** InputTagTest.java 8 Dec 2003 01:31:55 -0000 1.38 --- InputTagTest.java 2 Jan 2004 16:24:57 -0000 1.39 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Dhaval Udani ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,32 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com - - // Author of this class : Dhaval Udani - // dha...@or... package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: JspTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/JspTagTest.java,v retrieving revision 1.41 retrieving revision 1.42 diff -C2 -d -r1.41 -r1.42 *** JspTagTest.java 2 Jan 2004 05:01:28 -0000 1.41 --- JspTagTest.java 2 Jan 2004 16:24:57 -0000 1.42 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: LinkTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/LinkTagTest.java,v retrieving revision 1.44 retrieving revision 1.45 diff -C2 -d -r1.44 -r1.45 *** LinkTagTest.java 2 Jan 2004 05:01:28 -0000 1.44 --- LinkTagTest.java 2 Jan 2004 16:24:57 -0000 1.45 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: MetaTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/MetaTagTest.java,v retrieving revision 1.37 retrieving revision 1.38 diff -C2 -d -r1.37 -r1.38 *** MetaTagTest.java 8 Dec 2003 01:31:55 -0000 1.37 --- MetaTagTest.java 2 Jan 2004 16:24:57 -0000 1.38 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: ObjectCollectionTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/ObjectCollectionTest.java,v retrieving revision 1.19 retrieving revision 1.20 diff -C2 -d -r1.19 -r1.20 *** ObjectCollectionTest.java 8 Dec 2003 01:31:55 -0000 1.19 --- ObjectCollectionTest.java 2 Jan 2004 16:24:57 -0000 1.20 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: OptionTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/OptionTagTest.java,v retrieving revision 1.40 retrieving revision 1.41 diff -C2 -d -r1.40 -r1.41 *** OptionTagTest.java 8 Dec 2003 01:31:55 -0000 1.40 --- OptionTagTest.java 2 Jan 2004 16:24:57 -0000 1.41 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Dhaval Udani ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,32 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... ! // ! // Postal Address : ! // Somik Raha ! // Extreme Programmer & Coach ! // Industrial Logic Corporation ! // 2583 Cedar Street, Berkeley, ! // CA 94708, USA ! // Website : http://www.industriallogic.com // - // Author of this class : Dhaval Udani - // dha...@or... package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: ScriptTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/ScriptTagTest.java,v retrieving revision 1.41 retrieving revision 1.42 diff -C2 -d -r1.41 -r1.42 *** ScriptTagTest.java 8 Dec 2003 01:31:55 -0000 1.41 --- ScriptTagTest.java 2 Jan 2004 16:24:57 -0000 1.42 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: SelectTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/SelectTagTest.java,v retrieving revision 1.39 retrieving revision 1.40 diff -C2 -d -r1.39 -r1.40 *** SelectTagTest.java 8 Dec 2003 01:31:55 -0000 1.39 --- SelectTagTest.java 2 Jan 2004 16:24:57 -0000 1.40 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Dhaval Udani ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,32 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... ! // ! // Postal Address : ! // Somik Raha ! // Extreme Programmer & Coach ! // Industrial Logic Corporation ! // 2583 Cedar Street, Berkeley, ! // CA 94708, USA ! // Website : http://www.industriallogic.com // - // Author of this class : Dhaval Udani - // dha...@or... package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: StyleTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/StyleTagTest.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** StyleTagTest.java 8 Dec 2003 01:31:55 -0000 1.34 --- StyleTagTest.java 2 Jan 2004 16:24:57 -0000 1.35 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: TagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/TagTest.java,v retrieving revision 1.55 retrieving revision 1.56 diff -C2 -d -r1.55 -r1.56 *** TagTest.java 8 Dec 2003 01:31:55 -0000 1.55 --- TagTest.java 2 Jan 2004 16:24:57 -0000 1.56 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: TextareaTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/TextareaTagTest.java,v retrieving revision 1.37 retrieving revision 1.38 diff -C2 -d -r1.37 -r1.38 *** TextareaTagTest.java 8 Dec 2003 01:31:55 -0000 1.37 --- TextareaTagTest.java 2 Jan 2004 16:24:57 -0000 1.38 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: TitleTagTest.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/TitleTagTest.java,v retrieving revision 1.34 retrieving revision 1.35 diff -C2 -d -r1.34 -r1.35 *** TitleTagTest.java 8 Dec 2003 01:31:55 -0000 1.34 --- TitleTagTest.java 2 Jan 2004 16:24:57 -0000 1.35 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.tests.tagTests; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.tests.tagTests; Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/tests/tagTests/package.html,v retrieving revision 1.18 retrieving revision 1.19 diff -C2 -d -r1.18 -r1.19 *** package.html 8 Dec 2003 01:31:55 -0000 1.18 --- package.html 2 Jan 2004 16:24:57 -0000 1.19 *************** *** 3,11 **** <head> <!-- ! @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! Copyright (C) Dec 31, 2000 Somik Raha This library is free software; you can redistribute it and/or --- 3,16 ---- <head> <!-- + HTMLParser Library $Name$ - A java-based parser for HTML + http://sourceforge.org/projects/htmlparser + Copyright (C) 2004 Somik Raha ! Revision Control Information ! $Source$ ! $Author$ ! $Date$ ! $Revision$ This library is free software; you can redistribute it and/or *************** *** 16,37 **** This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of ! MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details. You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software ! Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! ! For any questions or suggestions, you can write to me at : ! Email :so...@in... ! ! Postal Address : ! Somik Raha ! Extreme Programmer & Coach ! Industrial Logic Corporation ! 2583 Cedar Street, Berkeley, ! CA 94708, USA ! Website : http://www.industriallogic.com ! --> </head> --- 21,30 ---- This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of ! MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details. You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software ! Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA --> </head> |
From: <der...@us...> - 2004-01-02 16:30:08
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors In directory sc8-pr-cvs1:/tmp/cvs-serv2062/htmlparser/src/org/htmlparser/visitors Modified Files: HtmlPage.java LinkFindingVisitor.java NodeVisitor.java ObjectFindingVisitor.java StringFindingVisitor.java TagFindingVisitor.java TextExtractingVisitor.java UrlModifyingVisitor.java package.html Log Message: Update the integration procedure to use a CVS tag, rather than changing the version in every file header. Index: HtmlPage.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/HtmlPage.java,v retrieving revision 1.40 retrieving revision 1.41 diff -C2 -d -r1.40 -r1.41 *** HtmlPage.java 8 Dec 2003 01:31:56 -0000 1.40 --- HtmlPage.java 2 Jan 2004 16:24:58 -0000 1.41 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.visitors; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.visitors; Index: LinkFindingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/LinkFindingVisitor.java,v retrieving revision 1.33 retrieving revision 1.34 diff -C2 -d -r1.33 -r1.34 *** LinkFindingVisitor.java 8 Dec 2003 01:31:56 -0000 1.33 --- LinkFindingVisitor.java 2 Jan 2004 16:24:58 -0000 1.34 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.visitors; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.visitors; Index: NodeVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/NodeVisitor.java,v retrieving revision 1.35 retrieving revision 1.36 diff -C2 -d -r1.35 -r1.36 *** NodeVisitor.java 8 Dec 2003 01:31:56 -0000 1.35 --- NodeVisitor.java 2 Jan 2004 16:24:58 -0000 1.36 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.visitors; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.visitors; Index: ObjectFindingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/ObjectFindingVisitor.java,v retrieving revision 1.38 retrieving revision 1.39 diff -C2 -d -r1.38 -r1.39 *** ObjectFindingVisitor.java 8 Dec 2003 01:31:56 -0000 1.38 --- ObjectFindingVisitor.java 2 Jan 2004 16:24:58 -0000 1.39 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Joshua Kerievsky ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,31 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com - - // contributed by Joshua Kerievsky package org.htmlparser.visitors; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.visitors; Index: StringFindingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/StringFindingVisitor.java,v retrieving revision 1.37 retrieving revision 1.38 diff -C2 -d -r1.37 -r1.38 *** StringFindingVisitor.java 8 Dec 2003 01:31:56 -0000 1.37 --- StringFindingVisitor.java 2 Jan 2004 16:24:58 -0000 1.38 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Joshua Kerievsky ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,31 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com - - // contributed by Joshua Kerievsky package org.htmlparser.visitors; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.visitors; Index: TagFindingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/TagFindingVisitor.java,v retrieving revision 1.40 retrieving revision 1.41 diff -C2 -d -r1.40 -r1.41 *** TagFindingVisitor.java 8 Dec 2003 01:31:56 -0000 1.40 --- TagFindingVisitor.java 2 Jan 2004 16:24:58 -0000 1.41 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Joshua Kerievsky ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,31 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http:// www.industriallogic.com - - // contributed by Joshua Kerievsky package org.htmlparser.visitors; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.visitors; Index: TextExtractingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/TextExtractingVisitor.java,v retrieving revision 1.38 retrieving revision 1.39 diff -C2 -d -r1.38 -r1.39 *** TextExtractingVisitor.java 8 Dec 2003 01:31:56 -0000 1.38 --- TextExtractingVisitor.java 2 Jan 2004 16:24:58 -0000 1.39 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Somik Raha ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,29 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... // - // Postal Address : - // Somik Raha - // Extreme Programmer & Coach - // Industrial Logic Corporation - // 2583 Cedar Street, Berkeley, - // CA 94708, USA - // Website : http://www.industriallogic.com package org.htmlparser.visitors; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.visitors; Index: UrlModifyingVisitor.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/UrlModifyingVisitor.java,v retrieving revision 1.42 retrieving revision 1.43 diff -C2 -d -r1.42 -r1.43 *** UrlModifyingVisitor.java 1 Jan 2004 17:16:53 -0000 1.42 --- UrlModifyingVisitor.java 2 Jan 2004 16:24:58 -0000 1.43 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Joshua Kerievsky ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,31 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... ! // ! // Postal Address : ! // Somik Raha ! // Extreme Programmer & Coach ! // Industrial Logic Corporation ! // 2583 Cedar Street, Berkeley, ! // CA 94708, USA ! // Website : http://www.industriallogic.com // - // This class was contributed by Joshua Kerievsky package org.htmlparser.visitors; --- 17,27 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // package org.htmlparser.visitors; Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/visitors/package.html,v retrieving revision 1.18 retrieving revision 1.19 diff -C2 -d -r1.18 -r1.19 *** package.html 8 Dec 2003 01:31:56 -0000 1.18 --- package.html 2 Jan 2004 16:24:58 -0000 1.19 *************** *** 3,11 **** <head> <!-- ! @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! Copyright (C) Dec 31, 2000 Somik Raha This library is free software; you can redistribute it and/or --- 3,16 ---- <head> <!-- + HTMLParser Library $Name$ - A java-based parser for HTML + http://sourceforge.org/projects/htmlparser + Copyright (C) 2004 Somik Raha ! Revision Control Information ! $Source$ ! $Author$ ! $Date$ ! $Revision$ This library is free software; you can redistribute it and/or *************** *** 16,37 **** This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of ! MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details. You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software ! Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! ! For any questions or suggestions, you can write to me at : ! Email :so...@in... ! ! Postal Address : ! Somik Raha ! Extreme Programmer & Coach ! Industrial Logic Corporation ! 2583 Cedar Street, Berkeley, ! CA 94708, USA ! Website : http://www.industriallogic.com ! --> </head> --- 21,30 ---- This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of ! MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details. You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software ! Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA --> </head> |
From: <der...@us...> - 2004-01-02 16:30:06
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/sort In directory sc8-pr-cvs1:/tmp/cvs-serv2062/htmlparser/src/org/htmlparser/util/sort Modified Files: Ordered.java Sort.java Sortable.java package.html Log Message: Update the integration procedure to use a CVS tag, rather than changing the version in every file header. Index: Ordered.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/sort/Ordered.java,v retrieving revision 1.10 retrieving revision 1.11 diff -C2 -d -r1.10 -r1.11 *** Ordered.java 8 Dec 2003 01:31:56 -0000 1.10 --- Ordered.java 2 Jan 2004 16:24:58 -0000 1.11 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Derrick Oswald ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,32 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... ! // ! // Postal Address : ! // Somik Raha ! // Extreme Programmer & Coach ! // Industrial Logic Corporation ! // 2583 Cedar Street, Berkeley, ! // CA 94708, USA ! // Website : http://www.industriallogic.com ! // ! // This class was contributed by ! // Derrick Oswald // --- 17,26 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // Index: Sort.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/sort/Sort.java,v retrieving revision 1.10 retrieving revision 1.11 diff -C2 -d -r1.10 -r1.11 *** Sort.java 8 Dec 2003 01:31:56 -0000 1.10 --- Sort.java 2 Jan 2004 16:24:58 -0000 1.11 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Derrick Oswald ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,32 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... ! // ! // Postal Address : ! // Somik Raha ! // Extreme Programmer & Coach ! // Industrial Logic Corporation ! // 2583 Cedar Street, Berkeley, ! // CA 94708, USA ! // Website : http://www.industriallogic.com ! // ! // This class was contributed by ! // Derrick Oswald // --- 17,26 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // Index: Sortable.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/sort/Sortable.java,v retrieving revision 1.10 retrieving revision 1.11 diff -C2 -d -r1.10 -r1.11 *** Sortable.java 8 Dec 2003 01:31:56 -0000 1.10 --- Sortable.java 2 Jan 2004 16:24:58 -0000 1.11 *************** *** 1,4 **** ! // HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! // Copyright (C) Dec 31, 2000 Somik Raha // // This library is free software; you can redistribute it and/or --- 1,12 ---- ! // HTMLParser Library $Name$ - A java-based parser for HTML ! // http://sourceforge.org/projects/htmlparser ! // Copyright (C) 2004 Derrick Oswald ! // ! // Revision Control Information ! // ! // $Source$ ! // $Author$ ! // $Date$ ! // $Revision$ // // This library is free software; you can redistribute it and/or *************** *** 9,32 **** // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! // ! // For any questions or suggestions, you can write to me at : ! // Email :so...@in... ! // ! // Postal Address : ! // Somik Raha ! // Extreme Programmer & Coach ! // Industrial Logic Corporation ! // 2583 Cedar Street, Berkeley, ! // CA 94708, USA ! // Website : http://www.industriallogic.com ! // ! // This class was contributed by ! // Derrick Oswald // --- 17,26 ---- // This library is distributed in the hope that it will be useful, // but WITHOUT ANY WARRANTY; without even the implied warranty of ! // MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU // Lesser General Public License for more details. // // You should have received a copy of the GNU Lesser General Public // License along with this library; if not, write to the Free Software ! // Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA // Index: package.html =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/util/sort/package.html,v retrieving revision 1.9 retrieving revision 1.10 diff -C2 -d -r1.9 -r1.10 *** package.html 8 Dec 2003 01:31:56 -0000 1.9 --- package.html 2 Jan 2004 16:24:58 -0000 1.10 *************** *** 4,12 **** <HEAD> <!-- ! @(#)package.html 1.60 98/01/27 ! HTMLParser Library v1_4_20031207 - A java-based parser for HTML ! Copyright (C) Dec 31, 2000 Somik Raha This library is free software; you can redistribute it and/or --- 4,17 ---- <HEAD> <!-- + HTMLParser Library $Name$ - A java-based parser for HTML + http://sourceforge.org/projects/htmlparser + Copyright (C) 2004 Somik Raha ! Revision Control Information ! $Source$ ! $Author$ ! $Date$ ! $Revision$ This library is free software; you can redistribute it and/or *************** *** 17,38 **** This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of ! MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details. You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software ! Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA ! ! For any questions or suggestions, you can write to me at : ! Email :so...@in... ! ! Postal Address : ! Somik Raha ! Extreme Programmer & Coach ! Industrial Logic Corporation ! 2583 Cedar Street, Berkeley, ! CA 94708, USA ! Website : http://www.industriallogic.com ! --> <TITLE>Sort Package</TITLE> --- 22,31 ---- This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of ! MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details. You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software ! Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA --> <TITLE>Sort Package</TITLE> |