From: Marc G. <mgu...@ya...> - 2005-05-11 08:50:21
|
HtmlUnit uses NekoHtml to parse the received html in a well formed tree. It is absolutely not possible to disable it: how could the html be parsed? If you don't have the possibility to fix your incorrect html, you can implement your own WebConnection to change the received html code before it is parsed by htmlunit. Marc. Gael Harbonnier wrote: > I believe HtmlUnit uses Tidy for cleaning Html pages, but is it possible > to disable it ? > Htmlunit doesn't succeed in finding a submint button because of the > transformed web page. > Tidy tranform this : > > <form> > <table> > <table> > <input ... > > </table> > <form> > > </table> > > into this: > > <form> > <table> > </table> > <form> > > <table> > <input ... > > </table> > > and so HtmlUnit doesn't find the input in the form. > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by Oracle Space Sweepstakes > Want to be the first software developer in space? > Enter now for the Oracle Space Sweepstakes! > http://ads.osdn.com/?ad_id=7393&alloc_id=16281&op=click > _______________________________________________ > Htmlunit-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlunit-user > |