Thread: Re: [Htmlparser-developer] How parse HTML in spanish?

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

 --- On Fri 12/13, Derrick Oswald  wrote:From: Derrick Oswald [mailto: Der...@ro...]To: htm...@li...Date: Fri, 13 Dec 2002 08:22:00 -0500Subject: Re: [Htmlparser-developer] How parse HTML in spanish?Can you be more specific about what isn't being extracted correctly?The best way would be to make a test case that shows the problem and submit it as a bug.

For example, when I try a URL as : "http://www.elmundo.es"
then appears the text:

... Text = El mßs famoso y caro vino de pago de Espa?a, el Pingus, no podrß acceder a una denominaci?n de origen propia ...

The correct text would be:

... Text = El más famoso y caro vino de pago de España, el Pingus, no podrá acceder a una denominación de origen propia ...

what happend?

Juan J

_______________________________________________
Join Excite! - http://www.excite.com
The most personalized portal on the Web!

Thread: Re: [Htmlparser-developer] How parse HTML in spanish?

htmlparser-developer