htmlparser-announce Mailing List for HTML Parser (Page 2)
Brought to you by:
derrickoswald
You can subscribe to this list here.
2002 |
Jan
(6) |
Feb
|
Mar
(2) |
Apr
(1) |
May
|
Jun
(4) |
Jul
(3) |
Aug
(3) |
Sep
(1) |
Oct
(3) |
Nov
(2) |
Dec
(5) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2003 |
Jan
(2) |
Feb
(4) |
Mar
(4) |
Apr
(3) |
May
(2) |
Jun
(1) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2004 |
Jan
(1) |
Feb
(1) |
Mar
(1) |
Apr
|
May
|
Jun
|
Jul
(1) |
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2005 |
Jan
|
Feb
|
Mar
|
Apr
(1) |
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2006 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
(1) |
Oct
|
Nov
(1) |
Dec
(6) |
2007 |
Jan
|
Feb
(6) |
Mar
(6) |
Apr
(6) |
May
(1) |
Jun
(1) |
Jul
(1) |
Aug
(27) |
Sep
(7) |
Oct
(4) |
Nov
|
Dec
|
2008 |
Jan
|
Feb
(1) |
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
(2) |
2009 |
Jan
|
Feb
|
Mar
(1) |
Apr
(15) |
May
(83) |
Jun
(72) |
Jul
(39) |
Aug
(14) |
Sep
(16) |
Oct
(30) |
Nov
(5) |
Dec
(4) |
2010 |
Jan
|
Feb
(1) |
Mar
(37) |
Apr
(57) |
May
(74) |
Jun
(66) |
Jul
(44) |
Aug
(54) |
Sep
(19) |
Oct
|
Nov
|
Dec
|
2011 |
Jan
|
Feb
|
Mar
|
Apr
(1) |
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
From: Htmlparser-a. <bla...@ho...> - 2007-08-14 07:01:14
|
Need money but not that eager to come looking for hard work? We never cheat the visitors of our online casino – just visit us and see it! All first-time visitors are given a free bonus of up to $1,000 – sweet, ain’t it? casinorfhwuu.cn |
From: Jessi s. <Je...@ma...> - 2007-08-11 12:30:51
|
H.u,g'e N*e,w*s To I.mpact C,Y'T'V Chi_na YouT.V C,o_r p-. Symbo-l: C Y_T_V We h.a.v-e alrea+dy s,e.e n CY,TV's marke+t imp.act b efore clim-+bing to o-v*e.r $2.0+0 w'i,t,h n.e,w.s-. Pre*ss Releas*e: Ch+ina YouTV'+s C'nBoo W-e+b S'i't e Rank-s N*o-.-1 on Mic*roso+ft L-i.v.e Sear,ch Engi*ne Cn_Boo Tra.ffic Increas-e+s 4 9*% O-v,e'r T+w,o Mo nths R,e.a'd t_h e news+, t+hink ab.out t.h,e impac,t, and j*u-m_p on t_h i's fir'st th,ing Tomo.r-row mor*n.ing! $+0.42 is a g.i f-t at t+h+i-s pr*ic,e..... Do y-o.u*r homewo'r,k a+n_d wat'ch t'h*i_s t_rade Mo_nday mor_ning. |
From: deven r. <dev...@po...> - 2007-08-11 09:18:41
|
H-u g'e N e_w-s To Imp-act C'Y-T-V C hina Y_ouTV C o_r-p . Symbol*: C*Y T'V We h+a.v'e a.lready s'e*e-n CY.TV's ma rket impac.t b+efore c *limbing to o.v+e_r $2. 00 w,i*t*h n,e-w_s . Pres,s Release : Chin_a Y-ouTV's C.nBoo W e'b S*i t_e Rank+s N+o .+1 on Mi crosof+t L_i v e Searc*h Eng*ine CnBo+o Tr-affic In._creases 4 9_% O-v e'r T-w-o Mont-hs R-e,a d t'h e new-s, th'ink a bout t-h-e impa ct, and j*u*m_p on t+h-i-s fi.rst th*ing T omo rrow mo_rnin'g! $0.4'2 is a g.i*f t at t-h'i's price.,..*.. Do y o+u*r ho+mewor k a'n.d wa tch t'h,i*s t_rade M onday mornin+g. |
From: Marko L. <Mar...@hz...> - 2007-08-07 13:46:04
|
1c41eA8f |
From: Nav <Ch...@wi...> - 2007-08-01 16:31:21
|
From: Shannon <Sch...@pr...> - 2007-08-01 05:45:05
|
From: mechelle n. <mec...@af...> - 2007-05-12 17:44:27
|
Sehr geehrter Herr, wir haben heute folgende Nachricht erfolgreich für Sie veröffentlicht: boerse invest wachst sehr stark durch zukaufe BJ5N.F Nachrichtenart: Corporate News Datum: 10.05.2007 Eingabezeit: 10.05.2007 10:00:05 Veröffentlichungszeit: 10.05.2007 10:00:08 Verbreitungsnetzwerk: Basis (Siehe am Ende der Mitteilung) Nachricht: Börse Invest Beteiligungs AG / Miscellaneous (Stock: BJ5N.F) 10.05.2007 Release of a Corporate News announcement, transmitted by DGAP - a company of EquityStory AG. The issuer / publisher is solely responsible for the content of this announcement. --------------------------------------------------------------------------- DÜSSELDORF Goldfish Holdings Inc. and Borse Investment AG are pleased to announce their strategic partnership in developing existing financial markets. Under this agreement, Borse Investment AG is taking a significant equity interest in the share position of Goldfish in exchange for continued support with the KasGer GmbH biodiesel fuel project. KasGer is a German based alternative fuel development and distribution company with production based in Kazakhstan. Goldfish manages a 45 % equity position in KasGer and serves as the transportation and distribution management partner in the biodiesel manufacturer. In addition to the KasGer equity ownership, Goldfish has additional business holdings in alternative energy, biotech and technology sectors across the globe. Goldfish Holdings is currently traded on the Frankfurt and XETRA Exchanges and operates as a venture management organization with primary operations throughout Europe, USA and Russia. Tobias Janssen, CEO stated, 'This opportunity to work hand in hand with the Borse Invest continues to allow both companies the opportunity to leverage existing financial markets as well as opening new markets to the collective strenghts of both organizations'. Borse Investment AG is a Swiss based investment management company. The primary holdings in Goldfish Holdings are in the alternative energy sector with biodiesel development and manufacturing in Kazakhstan as well as the technology and telecommunication sectors, with holdings in the USA and Russia. Symbol:BJ5N.F DGAP 10.05.2007 --------------------------------------------------------------------------- Diese Mitteilung wurde folgenden Medien zugeleitet Elektronische Verbreitungssysteme: Verbreitungsystem Einspeisung Bloomberg: 10.05.2007 10:00:08 Reuters: 10.05.2007 10:00:08 vwd: 10.05.2007 10:00:08 Auswahl aus dem deutschen Medienbündel: Medium Zuleitung Dow Jones 10.05.2007 10:00:08 dpa-afx 10.05.2007 10:00:08 dgap.de 10.05.2007 10:00:08 FTD 10.05.2007 10:00:08 |
From: Leisa P. <Le...@7N...> - 2007-04-30 17:14:42
|
Specifies the Windows culture format, which Windows Forms does not directly use. |
From: Slawomir s. <scr...@1a...> - 2007-04-29 09:35:28
|
ANLEGER UHR BJ5N.F!!! DIE RALLYE IST GESTARTET Firma: BOERSE INVEST BETEI WKN : 797639 ISIN : CH0012802093 Markt: Frankfurt Kürzel : BJ5N.F Preis: 1.90 5-Tag Prognose: 3.00 KAUFEN KAUFEN KAUFEN! BJ5N.F ESGESCHAFT FIN UNTER PARI! |
From: kendra m. <ken...@ag...> - 2007-04-28 22:30:26
|
GERMAN EXCHANGES ARE HOT! http://img233.imageshack.us/img233/3130/u08im5.gif Knowing how to manage the processes running on your Linux system is a critical aspect of system administration. |
From: lucatelli k. <ka...@LE...> - 2007-04-24 23:16:35
|
booth |
From: Derrick O. <der...@ro...> - 2007-04-01 02:46:24
|
The HTMLParser project will be 6 years old on April 4, 2007. Congratulations! Over the past few years it has been growing at a compound annual rate of 25% or more. In the month of March, 2007 it reached the 5000 downloads per month milestone. If you haven't yet, you are encouraged to visit the new Maven generated site at http://htmlparser.org/. The HTMLParser project has been offered an opportunity to generate revenue on the SourceForge site by adding Google AdSense advertising. Unless there are strong objections this will be implemented in the near future. Previous attempts to garner revenue through the voluntary donation mechanism of SourceForge have yielded nothing. |
From: Derrick O. <der...@ro...> - 2006-09-17 20:48:20
|
The very popular HTML Parser project (http://sourceforge.net/projects/htmlparser) on Sourceforge has been updated with a new license, new build environment, new repository and a new web site. To identify this radical change, the version has been revved to 2.0. In response to requests from the Apache community, the htmlparser license has changed from GNU Library or Lesser General Public License, to the more Apache friendly Common Public License 1.0 (http://opensource.org/licenses/cpl1.0.txt). As most projects are doing, the htmlparser repository has been changed from CVS to Subversion (http://subversion.tigris.org/). To support automatic integration in other projects, the build environment has changed from ant to Maven 2 (http://maven.apache.org/). This has provided an opportunity to update the web site (http://htmlparser.org). Project SNAPSHOTS and releases should be available soon, bear with us as we work out the kinks. HTML Parser is a Java library used to parse HTML in either a linear or nested fashion. |
From: muneer a. <mu...@ya...> - 2005-04-05 07:30:17
|
confirm 605457 __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com |
From: Derrick O. <Der...@Ro...> - 2004-07-31 23:39:21
|
Patch release Version 1.42 (Release Build Jul 27, 2004) of the most popular HTML parser on SourceForge is now available: http://sourceforge.net/project/showfiles.php?group_id=24399&package_id=17243&release_id=256305 This is the same as Version 1.4 with four bug fixes: #998195 SiteCatpurer just crashed #995744 Translate.decode(String) #995703 Parser Crash #988846 Linkbean getLinks() segmentation fault (duplicate of above) #919738 Text has not been extracted correctly using StringBean #936392 ScriptTag visitor fails for comments with ' (duplicate of above) One bug involved the decoding of URLs with the Translate.decode() method, which was incorrect. Another bug involved the SiteCapturer program failing in the face of an EncodingChangeException. This exception is raised when the <META> tag indicates a different character set that the one assumed at the start of parsing, and retracing the stream yields different characters than those the client has already consumed. The SiteCapturer now handles this exception by resetting the parser and trying again. Another bug involved an overzealous test for "text/XXX" content. This was erroneously assuming that the content was binary and throwing an exception. Experience indicates numerous web servers are returning parseable streams with content types that do not indicate text. The test is removed. The last bug would return wrong nodes when presented with a quote in a comment in a <SCRIPT> tag. |
From: Derrick O. <Der...@Ro...> - 2004-03-16 11:40:15
|
** Version 1.4 of the most popular HTML parser on sourceforge is now available. Ten months of development have culminated in a very robust, extensible product that has been tested, and is already being used, by thousands of developers. While prior versions concentrated on data extraction from web pages, Version 1.4 of the HTMLParser has substantial improvements in the area of transforming web pages, with simplified tag creation and editing, and verbatim toHtml() method output. Significant improvements have also been made in character set handling, providing support for developers worldwide. Changes since Version 1.3 ------------------------- Translation Character entity encoding and decoding has been revamped, leading to higher throughput and less memory churn. Beans The StringBean can now be used as a visitor for parsers external to the bean. Decorators The node decorator package has been added to provide support for the delegate model. Lexer A new lexer i/o subsystem has been added. This provides accurate line number and character position data, tag and attribute names maintain their original case, and attributes maintain their original order. Line numbers reported by tags are now zero based, not one based. The node count for parsing goes up in most cases because whitespace is strictly maintained, i.e. every whitespace (i.e. newline) now counts as a StringNode too. Storage of attributes is now in a Vector which means the element 0 Attribute is actually the name of the tag, rather than having the $TAGNAME entry in a HashTable. The htmllexer.jar is this new i/o subsystem broken out and made JDK 1.1 compliant, the htmlparser.jar, which includes everything in htmllexer.jar, is not necessarily intended to be used in JDK 1.1 environments. Some support for JIS escape sequences has been added. Tags Zero arg tag constructors have been added. Attribute maintenance (add/remove/edit) improved. There is no EndTag class any more. Just a generic tag that responds true to isEndTag(). Improvements to form tag handling, getting <input> and <textarea> tags nested within other tags. Improvements to applet tag handling regarding parameters and codebases. Scanners The concept of scanners has been completely reworked. Applications register tags not scanners to express interest in parsing only some tags. The default is now to parse all tags, which is equivalent to the old registerDOMTags(), so some extra nesting of tags will need to be handled. CompositeTagScanner logic has been improved to try and match unclosed open tags when an unexpected end tag is encountered. This change also moved recursion off the JDK stack, eliminating most StackOverflow exceptions. Also, a CompositeTag's "startTag()" is "this", and the CompositeTagScanner just adds children. The ScriptScanner will now decrypt Microsoft Script Encoder encrypted script tags. The plaintext is available via ScriptTag.getScriptCode(). Filters A new powerful filtering capability has been added, which makes extracting specific tags very easy. Applications New example applications Thumbelina and SiteCapturer. A mainline has been added to the Translate class to encode/decode stdin to stdout. The developers of the HTML Parser hope you enjoy it. http://sourceforge.net/projects/htmlparser Please post any requests for enhancements in version 1.5 to http://sourceforge.net/tracker/?group_id=24399&atid=381402 <http://sourceforge.net/projects/htmlparser> |
From: Derrick O. <Der...@Ro...> - 2004-02-16 23:51:08
|
The most popular HTML Parser on Sourceforge has released the version 1.4 beta as promised, an appropriate 9 months after the version 1.3 final release. It is accessible as Integration Build 1.4-20040216, see: http://sourceforge.net/project/showfiles.php?group_id=24399&package_id=47712 <http://sourceforge.net/project/showfiles.php?group_id=24399&package_id=47712> There were no bugs reported from the last 1000 downloads (although it's hard to tell because Sourceforge has stopped reporting download and pageview stats), and there are only two outstanding tasks left to do. If bug churn remains as quiescent as it is now, the final 1.4 release should be in mid March. Here are the release notes: *Notes:* Integration build. Beta candidate for 1.4 final release. Failing Unit Tests: none Open Bugs: 874000 LinkScanner and FormScanner cannot be used together Pending Bugs: none Changes since Version 1.3 ------------------------- Translation Character entity encoding and decoding has been revamped, leading to higher throughput and less memory churn. Beans The StringBean can now be used as a visitor for parsers external to the bean. Decorators The node decorator package has been added to provide support for the delegate model. Lexer A new lexer i/o subsystem has been added. This provides accurate line number and character position data, tag and attribute names maintain their original case, and attributes maintain their original order. Line numbers reported by tags are now zero based, not one based. The node count for parsing goes up in most cases because whitespace is strictly maintained, i.e. every whitespace (i.e. newline) now counts as a StringNode too. Storage of attributes is now in a Vector which means the element 0 Attribute is actually the name of the tag, rather than having the $TAGNAME entry in a HashTable. The htmllexer.jar is this new i/o subsystem broken out and made JDK 1.1 compliant, the htmlparser.jar, which includes everything in htmllexer.jar, is not necessarily intended to be used in JDK 1.1 environments. Some support for JIS escape sequences has been added. Tags Zero arg tag constructors have been added. Attribute maintenance (add/remove/edit) improved. There is no EndTag class any more. Just a generic tag that responds true to isEndTag(). Improvements to form tag handling, getting <input> and <textarea> tags nested within other tags. Improvements to applet tag handling regarding parameters and codebases. Scanners The concept of scanners has been completely reworked. Applications register tags not scanners to express interest in parsing only some tags. The default is now to parse all tags, which is equivalent to the old registerDOMTags(), so some extra nesting of tags will need to be handled. CompositeTagScanner logic has been improved to try and match unclosed open tags when an unexpected end tag is encountered. This change also moved recursion off the JDK stack, eliminating most StackOverflow exceptions. Also, a CompositeTag's "startTag()" is "this", and the CompositeTagScanner just adds children. Filters A new powerful filtering capability has been added, which makes extracting specific tags very easy. Applications New example applications Thumbelina and SiteCapturer. A mainline has been added to the Translate class to encode/decode stdin to stdout. Bug Fixes --------- 891058 Bug in lexer 865279 Documentation 851882 zero length alt tag causes bug in ImageScanner 839264 toHtml() parse error in Javascripts with "form" keyword 833592 DOCTYPE element is not parsed correctly 832530 empty attribute causes parser to fail 826764 ParserException occurs only when using setInputHTML() instea 825820 Words conjoined 825645 <input> not getting parsed inside table 813838 links not parsed correctly 805598 attribute src in tag img sometimes not correctly parsed 801118 two " characters at the end of an attribute value problem 798554 Applet Tag does not update codebase data 798553 setInputHtml does not set text 798552 Sample for node iterator incorrect 789439 Japanese page causes OutOfMemory Exception 788746 parser crashes on comments like <!-- foobar --!> 786869 LinkExtractor Sample not working 784767 irc://server/channel urls are HTTPLike? 778781 SRC-attribute suppression in IMG-tags 772700 Jsp Tags are not parsed correctly when in quoted attributes 765413 typo 761798 Error reading next element. 757337 Standalone attributes should remain standalone 755929 Empty string attr. value causes attr parsing to be stopped 753012 IMG SRC not parsed v1.3 & v1.4 753003 <IMG> within <A> missed when followed by <MAP> 750117 StackOverFlow while Node-Iteration 749295 Problem Parsing Table 745566 StackOverflowError on select with too many unclosed options 744610 getLink() Erroneous for Relative Links from Files on Windows |
From: Derrick O. <Der...@Ro...> - 2004-01-04 23:11:08
|
The latest integration build of most popular HTML parser on sourceforge, HTML Parser version 1.4, is now available: http://sourceforge.net/project/showfiles.php?group_id=24399&package_id=47712 This can be considered an alpha candidate of the final 1.4 release, and has much improved stability, speed, and HTML page transformation capabilities. We can only go so far by running the 532 unit tests we have, so we are widening the audience in the hope you'll pick it up and put it through it's paces with your own applications. Changes since Version 1.3 ------------------------- Decorators The node decorator package has been added to provide support for the delegate model. Lexer A new lexer i/o subsystem has been added. This provides accurate line number and character position data, tag and attribute names maintain their original case, and attributes maintain their original order. Line numbers reported by tags are now zero based, not one based. The node count for parsing goes up in most cases because whitespace is strictly maintained, i.e. every whitespace (i.e. newline) now counts as a StringNode too. Storage of attributes is now in a Vector which means the element 0 Attribute is actually the name of the tag, rather than having the $TAGNAME entry in a HashTable. The htmllexer.jar is this new i/o subsystem broken out and made JDK 1.1 compliant, the htmlparser.jar, which includes everything in htmllexer.jar, is not necessarily intended to be used in JDK 1.1 environments. Some support for JIS escape sequences has been added. Tags Zero arg tag constructors have been added. Attribute maintenance (add/remove/edit) improved. There is no EndTag class any more. Just a generic tag that responds true to isEndTag(). Improvements to form tag handling, getting <input> and <textarea> tags nested within other tags. Improvements to applet tag handling regarding parameters and codebases. Scanners The concept of scanners has been completely reworked. Applications register tags not scanners to express interest in parsing only some tags. The default is now to parse all tags, which is equivalent to the old registerDOMTags(), so some extra nesting of tags will need to be handled. CompositeTagScanner logic has been improved to try and match unclosed open tags when an unexpected end tag is encountered. This change also moved recursion off the JDK stack, eliminating most StackOverflow exceptions. Also, a CompositeTag's "startTag()" is "this", and the CompositeTagScanner just adds children. Filters A new powerful filtering capability has been added, which makes extracting specific tags very easy. Applications New example applications Thumbelina and SiteCapturer. Derrick Oswald |
From: Andy N. <and...@ut...> - 2003-06-04 15:52:13
|
Hi, Please help me, I've been struggling to extract links for web page with frame I use this code : Parser parser = new Parser(conn,null); parser.registerScanners(); Node[] frames = parser.extractAllNodesThatAre(FrameSetTag.class); for (int i = 0; i<frames.length; i++) { FrameSetTag fst = (FrameSetTag) frames[i]; NodeList list = fst.getFrames(); } } But it doesn't seem to return any frame Andy |
From: Derrick O. <Der...@ro...> - 2003-05-25 23:46:01
|
Version 1.3 of the most popular HTML parser on sourceforge is now available. Four weeks of candidate testing have culminated in a very stable, production level product, with many new user requested features. Features added since 1.2 include: constructor(URLConnection) for POST and exotic GET improved character set handling hierarchically nested tags, i.e. tables scanners for each type of tag java beans for easy integration of text and link fetching 'visitor' patterns Wiki page documentation improved script scanning improved whitespace handling The developers of the HTML Parser hope you enjoy it. |
From: Derrick O. <Der...@ro...> - 2003-05-09 03:47:45
|
I have set up the syncmail script to automatically send email about CVS commit operations to a newly created htmlparser-cvs mailing list. Thus, you can monitor the code repository for bug fixes and enhancements. A new list was chosen over sending to the htmlparser-developer list directly to provide an opt-out mechanism and to keep the traffic light on the htmlparser-developer list. If you want to subscribe to the CVS notification list, go to the lists area: http://sourceforge.net/mail/?group_id=24399 and choose Subscribe/Unsubscribe/Preferences for the htmlparser-cvs list. Derrick |
From: Somik R. <so...@ya...> - 2003-04-20 03:04:37
|
Hi Folks, This week's release is out. From the change log: Integration Build 1.3 - 20030420 -------------------------------- [1] Fixed bug #722046 StringExtractor.extractStrings misses most of the text, change to use a StringBean to dig into tables. [2] add checking in Translate to eliminate bug #722835 StringIndexOutOfBoundsException exception [3] added line-break condition in assertXmlEquals [4] added fit testing framework [5] added parent association for each node [6] added digupStringNode() and findPositionOf(Node) to CompositeTag [7] Fixed bug 723835 in LinkExtractor We have some powerful searching capability with this release. From any node, you can find the parent composite tag, and navigate thru the entire html structure. This is useful in scenarios like : Search for data that lies close to a certain piece of text. e.g. ... <table> <tr> <td> <b>Name:</b><i>John Doe</i> </td> </tr> </table> We can extract John Doe, by using our knowledge of its expected position. If we assume that the contents are inside a table tag, here's what a program could look like: parser.registerScanners(); Node nodes [] = parser.extractAllNodesThatAre(TableTag.class); // Lets assume our data is in the second table TableTag table = (TableTag)nodes[1]; // Find the position of Name. StringNode [] stringNodes = table.digupStringNode("Name"); // We assume that the first node that matched is the one we want. We navigate to its parent Node parentOfName = stringNodes[0].getParent(); // From the parent, we shall find out the position of "Name" int posOfName = parentOfName.findPositionOf(stringNodes[0]); // Its easy now to navigate to John Doe, as we know it is 3 positions away Node expectedName = parentOfName.childAt(posOfName + 3); This can be useful for writing tests for your pages or extracting position based info - new possibilities open up for semantic searches. Regards, Somik |
From: Somik R. <so...@ya...> - 2003-04-14 00:06:48
|
Hi Folks, This week's release contains : Integration Build 1.3 - 20030413 -------------------------------- [1] reimplement StringBean as NodeVisitor, testStringBeanListener now = succeeds [2] Implemented feature request 702541 (Tags created by = CompositeTagScanner now have startLine and endLine information in their TagData) [3] Modified ScriptScanner to allow for subclassing and fixed minor bug [4] Re-architected CompositeTagScanner [5] Fixed Tag scanning bugs, OOM exceptions connected to #4 Thanks to Derrick Oswald for his work on the StringBean and Marc = Novakowski for his work on adding line number support. In this release, the CompositeTagScanner has been totally redesigned - = using principles of Evolutionary Design. The new code is dramatically = simpler and easier to understand. I tried ED on assertXmlEquals last = week and had the same results. The Out of Memory bugs have been fixed - = it will be good to have some freedback. For those who might be interested in ED, before taking each step, I have = taken a snapshot and put it in CVS. Many thanks to Josh Kerievsky for = teaching me how to do ED. Cheers, Somik |
From: Somik R. <so...@ya...> - 2003-04-05 20:02:22
|
Hi Folks, This week's integration release is out. From the change log: Integration Build 1.3 - 20030405 -------------------------------- [1] Fixed bug 712888 (scanning nested custom tags) [2] Redesigned assertXmlEquals() [3] Fixed bug in Parser.removeScanner() [4] Fixed unnecessary addition of ACTION attribute in Form tag [5] Fixed Bullet scanner out of memory exception [6] Replaced scanner HashTable with Map Regards, Somik |
From: Somik R. <so...@ya...> - 2003-03-31 04:43:54
|
Hi Folks, This week's integration release is packed with goodies! From the change log: Integration Build 1.3 - 20030330 -------------------------------- [1] fixed bug (an enhancement really) 694477 quotes in content-type header [2] fix bug #699886 and #707447 by using a buffered stream reader with infinite mark [3] fixed bug in CompositeTagScanner, filter not being set correctly [4] fixed thread safety issue in TagParser (bug 711073) [5] fixed out of memory error when parsing custom composite tags (bug 709152) [6] fixed bug 701159, 696455 - redesigned script scanner. Javascript parsing is now much more robust. As you can see, a lot of bug fixes have gone in. There are three major fixes - one by Derrick Oswald (#2) addresses the charset issue. The parser should now be able to handle different charsets dynamically. We hope you can test this and give us feedback. The second big change is a redesign of the way Javascript is handled by the parser. It had been riddled with problems for some time, so we've changed its internals. The new implementation is much more robust, and hopefully we can get some feedback on that too. There were some thread safety issues (thanks to Joe Robbins for reporting this). These have been addressed in this release, and the parser should be totally thread-safe now. Regards, Somik |