while trying to scrape https://thepiratebay.org/torrent/3211594/
or any other torrent page from thepiratebay.org
using curl to get page and then
str_get_html
messes with
nbsp;
and giving strange characters in output
e.g.
Size:
357.91Â MiBÂ (375299009Â Bytes)
see weird Â
if i use simple_html_dom, then this doesnt happen.
thanks
i even trying str_replace with blank and then feeding it to str_get_html.., still same issue.