[Htmlparser-cvs] htmlparser/docs release.txt,1.69,1.70 changes.txt,1.205,1.206
Brought to you by:
derrickoswald
From: Derrick O. <der...@us...> - 2005-06-14 10:37:50
|
Update of /cvsroot/htmlparser/htmlparser/docs In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv3209/docs Modified Files: release.txt changes.txt Log Message: Update version to 1.5-20050614 Index: release.txt =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/release.txt,v retrieving revision 1.69 retrieving revision 1.70 diff -C2 -d -r1.69 -r1.70 *** release.txt 6 Apr 2005 10:28:01 -0000 1.69 --- release.txt 14 Jun 2005 10:37:33 -0000 1.70 *************** *** 1,3 **** ! HTMLParser Version 1.5 (Integration Build Mar 13, 2005) ********************************************* --- 1,3 ---- ! HTMLParser Version 1.5 (Release Build Jun 14, 2005) ********************************************* *************** *** 29,35 **** New APIs Implement rudimentary sax parser. Currently exposes DOM parser via sax project ! A new http package is added, the primary class being Connectionmanager which ! handles proxies, passwords and cookies. Some testing still needed. ! Also removed some line separator cruft. Added parseCDATA to the Lexer, used in script and style scanners. Note that this is significantly new behaviour that now adheres to appendix --- 29,35 ---- New APIs Implement rudimentary sax parser. Currently exposes DOM parser via sax project ! A new http package is added, the primary class being Connectionmanager which ! handles proxies, passwords and cookies. Some testing still needed. ! Also removed some line separator cruft. Added parseCDATA to the Lexer, used in script and style scanners. Note that this is significantly new behaviour that now adheres to appendix *************** *** 41,51 **** Updated the logo and included the LGPL license. Fixed the Windows batch files. ! Added optional "classes" property to build.xml. This directory is where class files are put. It defaults to src. To use: ant -Dclasses=classdir <target> where classdir is/will-be a peer directory to src. Refactoring ! Added static STRICT flag to ScriptScanner to revert to legacy handling of broken ETAGO (</). If STRICT is true, scan according to HTML specification, else if false, scan with quote smart state machine which heuristically --- 41,52 ---- Updated the logo and included the LGPL license. Fixed the Windows batch files. ! Added optional "classes" property to build.xml. This directory is where class files are put. It defaults to src. To use: ant -Dclasses=classdir <target> where classdir is/will-be a peer directory to src. + Fixed various end user experience issues. Refactoring ! Added static STRICT flag to ScriptScanner to revert to legacy handling of broken ETAGO (</). If STRICT is true, scan according to HTML specification, else if false, scan with quote smart state machine which heuristically *************** *** 66,72 **** --- 67,75 ---- Incorporate patch #1004985 Page.java, by making getCharset() and findCharset() static. Incorporated some speed optimizations based on profiling. + Deprecated node decorators. Filters Added CssSelectorNodeFilter and RegExFilter. Added the filter builder tool. + Added link pattern filters LinkRegexFilter and LinkStringFilter. Enhancement Requests Index: changes.txt =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/docs/changes.txt,v retrieving revision 1.205 retrieving revision 1.206 diff -C2 -d -r1.205 -r1.206 *** changes.txt 13 Mar 2005 15:36:08 -0000 1.205 --- changes.txt 14 Jun 2005 10:37:33 -0000 1.206 *************** *** 16,19 **** --- 16,241 ---- ******************************************************************************* + Release Build 1.5 - 20050614 + -------------------------------- + + 2005-05-15 07:49 derrickoswald + + * resources/htmlparser_checks.xml, + src/org/htmlparser/Attribute.java, + src/org/htmlparser/NodeFactory.java, + src/org/htmlparser/NodeFilter.java, src/org/htmlparser/Remark.java, + src/org/htmlparser/Tag.java, src/org/htmlparser/Text.java, + src/org/htmlparser/beans/BeanyBaby.java, + src/org/htmlparser/beans/FilterBean.java, + src/org/htmlparser/beans/HTMLLinkBean.java, + src/org/htmlparser/beans/HTMLTextBean.java, + src/org/htmlparser/beans/LinkBean.java, + src/org/htmlparser/beans/StringBean.java, + src/org/htmlparser/filters/CssSelectorNodeFilter.java, + src/org/htmlparser/filters/HasAttributeFilter.java, + src/org/htmlparser/filters/HasChildFilter.java, + src/org/htmlparser/filters/HasParentFilter.java, + src/org/htmlparser/filters/HasSiblingFilter.java, + src/org/htmlparser/filters/IsEqualFilter.java, + src/org/htmlparser/filters/LinkRegexFilter.java, + src/org/htmlparser/filters/LinkStringFilter.java, + src/org/htmlparser/filters/NodeClassFilter.java, + src/org/htmlparser/filters/NotFilter.java, + src/org/htmlparser/filters/OrFilter.java, + src/org/htmlparser/filters/RegexFilter.java, + src/org/htmlparser/filters/StringFilter.java, + src/org/htmlparser/filters/TagNameFilter.java, + src/org/htmlparser/http/ConnectionManager.java, + src/org/htmlparser/http/Cookie.java, + src/org/htmlparser/lexer/Cursor.java, + src/org/htmlparser/lexer/InputStreamSource.java, + src/org/htmlparser/lexer/Lexer.java, + src/org/htmlparser/lexer/Page.java, + src/org/htmlparser/lexer/PageAttribute.java, + src/org/htmlparser/lexer/PageIndex.java, + src/org/htmlparser/lexer/Source.java, + src/org/htmlparser/lexer/Stream.java, + src/org/htmlparser/lexer/StringSource.java, + src/org/htmlparser/scanners/ScriptDecoder.java, + src/org/htmlparser/tests/lexerTests/KitTest.java, + src/org/htmlparser/tests/lexerTests/LexerTests.java, + src/org/htmlparser/tests/lexerTests/PageTests.java, + src/org/htmlparser/tests/lexerTests/TagTests.java, + src/org/htmlparser/tests/tagTests/InputTagTest.java, + src/org/htmlparser/tests/utilTests/SortTest.java, + src/org/htmlparser/util/ParserUtils.java: + + Documentation revamp part four. + Remove some checkstyle warnings. + + 2005-05-13 06:44 derrickoswald + + * docs/contributors.html, src/org/htmlparser/sax/XMLReader.java: + + Add parse(InputSource) suggested by Jamie McCrindle. + + 2005-05-10 18:11 derrickoswald + + * src/org/htmlparser/tests/tagTests/SelectTagTest.java: + + Remove Shamil's email address. + + 2005-04-24 13:48 derrickoswald + + * build.xml, docs/main.html, lib/checkstyle-all-3.1.jar, + lib/fit.jar, resources/htmlparser_checks.xml, + src/doc-files/building.html, src/doc-files/overview.html, + src/doc-files/using.html, src/org/htmlparser/Node.java, + src/org/htmlparser/Parser.java, + src/org/htmlparser/PrototypicalNodeFactory.java, + src/org/htmlparser/tags/package.html, + src/org/htmlparser/tests/ParserTest.java, + src/org/htmlparser/visitors/NodeVisitor.java: + + Documentation revamp part three. + Reworked some JavaDoc descriptions. + Added "HTML Parser for dummies" introductory text. + Removed checkstyle.jar and fit.jar (and it's cruft). + + 2005-04-12 07:27 derrickoswald + + * src/org/htmlparser/: Attribute.java, beans/package.html, + lexer/Cursor.java, lexer/InputStreamSource.java, lexer/Lexer.java, + lexer/Page.java, lexer/PageAttribute.java, lexer/Source.java, + lexer/Stream.java, lexer/StringSource.java, lexer/package.html, + lexerapplications/thumbelina/PicturePanel.java, + parserapplications/LinkExtractor.java, + parserapplications/SiteCapturer.java, + parserapplications/StringExtractor.java, + parserapplications/WikiCapturer.java, + parserapplications/package.html, + parserapplications/filterbuilder/Filter.java, + parserapplications/filterbuilder/FilterBuilder.java, + parserapplications/filterbuilder/HtmlTreeCellRenderer.java, + parserapplications/filterbuilder/HtmlTreeModel.java, + parserapplications/filterbuilder/SubFilterList.java, + parserapplications/filterbuilder/layouts/NullLayoutManager.java, + parserapplications/filterbuilder/layouts/VerticalLayoutManager.java, + parserapplications/filterbuilder/wrappers/AndFilterWrapper.java, + parserapplications/filterbuilder/wrappers/HasAttributeFilterWrapper.java, + parserapplications/filterbuilder/wrappers/HasChildFilterWrapper.java, + parserapplications/filterbuilder/wrappers/HasParentFilterWrapper.java, + parserapplications/filterbuilder/wrappers/HasSiblingFilterWrapper.java, + parserapplications/filterbuilder/wrappers/NodeClassFilterWrapper.java, + parserapplications/filterbuilder/wrappers/NotFilterWrapper.java, + parserapplications/filterbuilder/wrappers/OrFilterWrapper.java, + parserapplications/filterbuilder/wrappers/RegexFilterWrapper.java, + parserapplications/filterbuilder/wrappers/StringFilterWrapper.java, + parserapplications/filterbuilder/wrappers/TagNameFilterWrapper.java, + sax/Feedback.java, sax/XMLReader.java: + + Documentation revamp part two. + + 2005-04-10 19:20 derrickoswald + + * bin/beanybaby.bat, bin/beanybaby.cmd, bin/filterbuilder.bat, + bin/filterbuilder.cmd, bin/lexer.bat, bin/lexer.cmd, + bin/linkextractor.bat, bin/linkextractor.cmd, bin/parser.bat, + bin/parser.cmd, bin/sitecapturer, bin/sitecapturer.cmd, + bin/stringextractor.bat, bin/stringextractor.cmd, + bin/thumbelina.bat, bin/thumbelina.cmd, bin/translate.bat, + bin/translate.cmd, src/org/htmlparser/Attribute.java, + src/org/htmlparser/Node.java, src/org/htmlparser/NodeFactory.java, + src/org/htmlparser/PrototypicalNodeFactory.java, + src/org/htmlparser/Remark.java, + src/org/htmlparser/StringNodeFactory.java, + src/org/htmlparser/Tag.java, src/org/htmlparser/Text.java, + src/org/htmlparser/beans/BeanyBaby.java, + src/org/htmlparser/beans/FilterBean.java, + src/org/htmlparser/beans/HTMLLinkBean.java, + src/org/htmlparser/beans/HTMLTextBean.java, + src/org/htmlparser/beans/LinkBean.java, + src/org/htmlparser/beans/StringBean.java, + src/org/htmlparser/beans/package.html, + src/org/htmlparser/filters/AndFilter.java, + src/org/htmlparser/filters/CssSelectorNodeFilter.java, + src/org/htmlparser/filters/HasAttributeFilter.java, + src/org/htmlparser/filters/HasChildFilter.java, + src/org/htmlparser/filters/HasParentFilter.java, + src/org/htmlparser/filters/HasSiblingFilter.java, + src/org/htmlparser/filters/LinkRegexFilter.java, + src/org/htmlparser/filters/LinkStringFilter.java, + src/org/htmlparser/filters/NodeClassFilter.java, + src/org/htmlparser/filters/NotFilter.java, + src/org/htmlparser/filters/OrFilter.java, + src/org/htmlparser/filters/RegexFilter.java, + src/org/htmlparser/filters/TagNameFilter.java, + src/org/htmlparser/http/ConnectionManager.java, + src/org/htmlparser/http/ConnectionMonitor.java, + src/org/htmlparser/http/Cookie.java, + src/org/htmlparser/http/package.html, + src/org/htmlparser/nodeDecorators/AbstractNodeDecorator.java, + src/org/htmlparser/nodeDecorators/DecodingNode.java, + src/org/htmlparser/nodeDecorators/EscapeCharacterRemovingNode.java, + src/org/htmlparser/nodeDecorators/NonBreakingSpaceConvertingNode.java, + src/org/htmlparser/nodeDecorators/package.html, + src/org/htmlparser/nodes/AbstractNode.java, + src/org/htmlparser/nodes/RemarkNode.java, + src/org/htmlparser/nodes/TagNode.java, + src/org/htmlparser/nodes/TextNode.java, + src/org/htmlparser/nodes/package.html, + src/org/htmlparser/parserapplications/filterbuilder/FilterBuilder.java, + src/org/htmlparser/scanners/CompositeTagScanner.java, + src/org/htmlparser/tags/BaseHrefTag.java, + src/org/htmlparser/tags/BodyTag.java, + src/org/htmlparser/tags/CompositeTag.java, + src/org/htmlparser/tags/DoctypeTag.java, + src/org/htmlparser/tags/FormTag.java, + src/org/htmlparser/tags/FrameSetTag.java, + src/org/htmlparser/tags/FrameTag.java, + src/org/htmlparser/tags/HeadTag.java, + src/org/htmlparser/tags/ImageTag.java, + src/org/htmlparser/tags/JspTag.java, + src/org/htmlparser/tags/LabelTag.java, + src/org/htmlparser/tags/LinkTag.java, + src/org/htmlparser/tags/MetaTag.java, + src/org/htmlparser/tags/OptionTag.java, + src/org/htmlparser/tags/ScriptTag.java, + src/org/htmlparser/tags/SelectTag.java, + src/org/htmlparser/tags/TableRow.java, + src/org/htmlparser/tags/TableTag.java, + src/org/htmlparser/tags/TextareaTag.java, + src/org/htmlparser/tags/TitleTag.java, + src/org/htmlparser/tags/package.html, + src/org/htmlparser/tests/lexerTests/KitTest.java, + src/org/htmlparser/tests/lexerTests/LexerTests.java: + + Documentation revamp part one. + Deprecated node decorators. + Added doSemanticAction for Text and Comment nodes. + Added missing sitecapturer scripts. + Fixed DOS batch files to work when called from any location. + + 2005-04-06 06:27 derrickoswald + + * build.xml, docs/release.txt, docs/samples.html: + + End user experience issues: + remove multiple wiki files in zip + fix sample application links + change readme.txt to use Windows line endings + change copyright date + + 2005-04-06 06:20 derrickoswald + + * docs/contributors.html, + src/org/htmlparser/filters/LinkRegexFilter.java, + src/org/htmlparser/filters/LinkStringFilter.java: + + Add link pattern filters submitted by John Derrick. + + 2005-04-04 20:48 derrickoswald + + * src/org/htmlparser/: NodeFilter.java, Parser.java, package.html, + parserapplications/SiteCapturer.java: + + Update javadocs. + Enable SiteCapturer to handle resource names containing spaces. + Integration Build 1.5 - 20050313 -------------------------------- |