[Htmlparser-user] Re: Another oddity
Brought to you by:
derrickoswald
From: Somik R. <so...@ya...> - 2002-11-23 19:17:32
|
Hi Steve, Im taking this to the list.. You cant use URLEncoder bcos URLEncoder replaces spaces with +. = However, what works on the browser is %20. OTOH, there are still = problems with replacing spaces with %20 when you come to mailto links.=20 For your case though, the parser now checks if a url has spaces, and = if so, it performs the correct encoding.=20 The latest codebase is in CVS.=20 Cheers, Somik ----- Original Message -----=20 From: Stephen J. Harrington=20 To: Somik Raha=20 Sent: Thursday, November 21, 2002 10:23 AM Subject: Another oddity Somik,=20 Some of the URL's we parse have spaces in them, e.g = http://www.cnn.com/This is a test.html=20 I know this in non-standard, but I just parse what I am told to = parse.....=20 My first response was to URLEncode the URL (URLEncoder.encode()) and = pass it to the htmlparser. This didn't work!?!=20 So I then wrote a quick utility to replace all spaces with the correct = URL encoded character and this worked. Any idea why the parser would = not like a URLEncoded URL?=20 --stephen=20 |