[Htmlparser-user] extract flickr tags
Brought to you by:
derrickoswald
From: M M. <me...@gm...> - 2007-10-18 13:40:53
|
Hello, I have installed and ran the html parser and I can get it to extract all of the text from the following web page: http://www.flickr.com/photos/mariposa-de-amor/tags/ (mariposa-de-amor is just an example, this could be any user name). But I want to extract ONLY the 150 most popular tags. In other words, I want to output a plain text file with the users 150 most popular tags, like this: 35faves a aberdeen abigfave anawesomeshot aplusphoto athousandwords avianexcellence baby band beach beautiful bedouin belis birds blogthis blueribbonwinner bravo bridge bw castle child children church cindrel city clouds clova cluj colourartaward concert copii criket ............................................. The page source is like this: <p id="TagCloud"> <a href="/photos/mariposa-de-amor/tags/35faves/" style="font-size: 14px;">35faves</a> <a href="/photos/mariposa-de-amor/tags/a/" style="font-size: 14px;">a</a> <a href="/photos/mariposa-de-amor/tags/aberdeen/" style="font-size: 15px;">aberdeen</a> </p> SO I want only the words 35faves, aberdeen, ......... I could try to get the text inside <p id="TagCloud"> and </p>, but that is not effective because I extract " <a href............ " and I don't want this additional text in my output. and I have tried this using Trimtags and in any case, it doesnt work for me. Does anybody know of an existing method with which I could do this, or can you offer any advice? All I want is to extract a list of one user's tags from a flickr (or any other)webpage.Thanks a lot,Myriam |