weird looking characters if html source code has
A drop-in replacement for simple html dom
Status: Beta
Brought to you by:
pguardiario
while trying to scrape https://thepiratebay.org/torrent/3211594/
or any other torrent page from thepiratebay.org
using curl to get page and then
str_get_html
messes with
nbsp;
and giving strange characters in output
e.g.
Size: 357.91Â MiBÂ (375299009Â Bytes)
see weird Â
if i use simple_html_dom, then this doesnt happen.
thanks
i even trying str_replace with blank and then feeding it to str_get_html.., still same issue.