Re: [mod-security-users] Little problems with encoding type
Brought to you by:
victorhora,
zimmerletw
|
From: Ivan R. <iv...@we...> - 2005-07-15 11:51:26
|
Daniel Fdez. Bleda wrote: > Hello, > > I'm having some problems parsing the HTML with libxml2. > I thought I could use the request_rec->content_encoding but I see that > where some web pages specify UTF-8 as encoding this info in not in > this structure My guess that field is populated only when the content encoding information is in the HTTP headers. > when I'm in sec_filter_out() so I don't know how now > the encription type to use (or use some as "ISO-8859-1" by default). The correct procedure may be to start parsing using the default encoding. If a change of the encoding is encountered, go back and start all over. -- Ivan Ristic Apache Security (O'Reilly) - http://www.apachesecurity.net Open source web application firewall - http://www.modsecurity.org |