Yes, it's Latin-1. It's documented there:
Convert html entities into their ISO8859-1 equivalents
I'll add this as a decode_entities_utf8 helper method. Eduardo, can you resubmit the file with a copyright header? Preferably using LGPL as the rest of the project.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I have made some little changes and added the copyright header.
I forgot to login before I submit this and it seems that I am not able to add more attachments, so I uploaded the new file to my website.
Download here: http://eduardo38.netne.net/decode_entities_utf8.cpp
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
The source
Hi Robson,
This patch makes sense since it encodes decoded entities as utf8. What is the current encoding? Is it latin1?
By the way, the code this patch is supposed to replace is the one at html/utils.
View and moderate all "bugs Discussion" comments posted by this user
Mark all as spam, and block user from posting to "Bugs"
This also decodes hexadecimal entities.
Yes, it's Latin-1. It's documented there:
Convert html entities into their ISO8859-1 equivalents
I'll add this as a decode_entities_utf8 helper method. Eduardo, can you resubmit the file with a copyright header? Preferably using LGPL as the rest of the project.
View and moderate all "bugs Discussion" comments posted by this user
Mark all as spam, and block user from posting to "Bugs"
I have made some little changes and added the copyright header.
I forgot to login before I submit this and it seems that I am not able to add more attachments, so I uploaded the new file to my website.
Download here: http://eduardo38.netne.net/decode_entities_utf8.cpp
View and moderate all "bugs Discussion" comments posted by this user
Mark all as spam, and block user from posting to "Bugs"
I made other entitie decoder: http://eduardo38.netne.net/decode_entities_utf8_2.cpp