Menu

#2 Â weird looking characters if html source code has  

1.0
open
None
2017-09-16
2017-09-16
json born
No

while trying to scrape https://thepiratebay.org/torrent/3211594/

or any other torrent page from thepiratebay.org

using curl to get page and then

str_get_html

messes with

nbsp;

and giving strange characters in output

e.g.

Size:
357.91Â MiBÂ (375299009Â Bytes)

see weird Â

if i use simple_html_dom, then this doesnt happen.

thanks

i even trying str_replace   with blank and then feeding it to str_get_html.., still same issue.

Discussion


Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.