magpierss-general Mailing List for MagpieRSS (Page 4)
Status: Beta
Brought to you by:
kellan
You can subscribe to this list here.
2002 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
(2) |
Nov
(6) |
Dec
|
---|---|---|---|---|---|---|---|---|---|---|---|---|
2003 |
Jan
(3) |
Feb
|
Mar
|
Apr
(4) |
May
(10) |
Jun
(8) |
Jul
(6) |
Aug
(6) |
Sep
(6) |
Oct
(2) |
Nov
(7) |
Dec
(10) |
2004 |
Jan
(24) |
Feb
(3) |
Mar
(6) |
Apr
|
May
|
Jun
(8) |
Jul
(5) |
Aug
(8) |
Sep
(9) |
Oct
(17) |
Nov
(20) |
Dec
(27) |
2005 |
Jan
(45) |
Feb
(44) |
Mar
(34) |
Apr
(22) |
May
(39) |
Jun
(16) |
Jul
(20) |
Aug
(35) |
Sep
(34) |
Oct
(56) |
Nov
(48) |
Dec
(10) |
2006 |
Jan
(18) |
Feb
(17) |
Mar
(21) |
Apr
(27) |
May
(9) |
Jun
(11) |
Jul
(9) |
Aug
(13) |
Sep
(10) |
Oct
(4) |
Nov
(2) |
Dec
(22) |
2007 |
Jan
(11) |
Feb
(18) |
Mar
(6) |
Apr
(4) |
May
(10) |
Jun
(13) |
Jul
(3) |
Aug
(20) |
Sep
(7) |
Oct
(1) |
Nov
(4) |
Dec
|
2008 |
Jan
(4) |
Feb
(2) |
Mar
|
Apr
(13) |
May
(5) |
Jun
(6) |
Jul
(2) |
Aug
(3) |
Sep
(2) |
Oct
(3) |
Nov
(2) |
Dec
(3) |
2009 |
Jan
(2) |
Feb
(3) |
Mar
|
Apr
(1) |
May
(1) |
Jun
(10) |
Jul
(7) |
Aug
(2) |
Sep
(1) |
Oct
|
Nov
|
Dec
(3) |
2010 |
Jan
|
Feb
(1) |
Mar
|
Apr
(2) |
May
(1) |
Jun
|
Jul
(2) |
Aug
(1) |
Sep
(2) |
Oct
|
Nov
|
Dec
|
2011 |
Jan
(6) |
Feb
(6) |
Mar
|
Apr
|
May
|
Jun
(1) |
Jul
|
Aug
(1) |
Sep
|
Oct
(2) |
Nov
|
Dec
|
2012 |
Jan
|
Feb
(1) |
Mar
|
Apr
|
May
|
Jun
|
Jul
(1) |
Aug
|
Sep
(1) |
Oct
|
Nov
|
Dec
|
2013 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(1) |
Jul
|
Aug
(1) |
Sep
|
Oct
|
Nov
|
Dec
|
2014 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
(1) |
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2016 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
(1) |
Sep
|
Oct
|
Nov
(1) |
Dec
|
2020 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
(1) |
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
From: Gareth D. <gar...@he...> - 2008-08-13 06:51:55
|
Hi there, or bonjour, I think I've got a solution for you. Basically it _is_ an encoding issue as you thought. The feed is encoded in UTF-8. I had to set my own content encoding to UTF-8, and then instead of using fetch_rss, I created a magpie object directly (you have to get the feed contents yourself beforehand to do it this way). This lets you tweak the input and output encodings as below. So at the top somewhere: header("Content-Type: text/html; charset=UTF-8"); then I CURL'ed in the feed contents: $ch = curl_init(); // set URL and other appropriate options curl_setopt($ch, CURLOPT_URL, $url); //curl_setopt($ch, CURLOPT_PROXY, "proxy.here.com'); //if there is a proxy, uncomment this line and set as appropriate // curl_setopt($ch, CURLOPT_PROXYPORT, 80);//if there is a proxy, uncomment this line and set (probably 80) curl_setopt($ch, CURLOPT_RETURNTRANSFER, true); curl_setopt($ch, CURLOPT_HEADER, 0); curl_setopt($ch, CURLOPT_TIMEOUT, 30); // grab URL and pass it to the browser $str = curl_exec($ch); // close cURL resource, and free up system resources curl_close($ch); (Recommend you wrap this in a function - or just get hold of the feed contents in your own way). Then, finally, instead of fetch_rss: $rss = new MagpieRSS($str, 'UTF-8', 'UTF-8', true); That seemd to sort it. Cheers et bon chance, G Gareth Dart Web Developer Higher Education Statistics Agency (HESA) 95 Promenade, Cheltenham, Gloucestershire GL50 1HZ T 01242 211128 F 01242 211122 W www.hesa.ac.uk <http://www.hesa.ac.uk/> ________________________________ From: mag...@li... [mailto:mag...@li...] On Behalf Of Thibault Richard Sent: Tuesday 12 August 2008 16:32 To: mag...@li... Subject: [Magpierss-general] Problem with quote (') characters Hello, I've a problem with quote characters using magpierss. I try to parse a feed generated by Wordpress : http://blog.dimitripiot.com/wp-feed.php And I try to display it on a web page : http://www.dimitripiot.com/index.php (on the left side) If you take a look to the display, you 'll see that the quote characters (') are not well parsed. My code is really basic and is working (except this quote problem) : $rss = fetch_rss('http://blog.dimitripiot.com/wp-feed.php'); if ( $rss) { $items = array_slice($rss->items, 0, $nombre_affiche_blog); echo "<div class='title'>Le blog</div>"; echo "<div style=\"padding:10px; color:#FFFF99\">Visitez également le blog de Dimitri Piot pour plus d'infos au jour le jour... </div>"; echo "<ul>\n"; foreach ($items as $item) { $href = $item['link']; $title = $item['title']; echo "<li><a href=\"$href\" target=\"_blank\">$title</a></li>\n"; } echo "</ul>"; } print "</div>\n"; I don't think the problem is on the feed (which is valid) and I have no idea about the problem I've already tried to change the encoding from ISO-8859-1 to UTF-8 without any success. Is it a known problem ? Thank you in advance for your help Regards Thibault ______________________________________________________________ This incoming email was virus scanned for HESA by MessageLabs. ______________________________________________________________ _____________________________________________________________________ Higher Education Statistics Agency Ltd (HESA) is a company limited by guarantee, registered in England at 95 Promenade Cheltenham GL50 1HZ. Registered No. 2766993. The members are Universities UK and GuildHE. Registered Charity No. 1039709. Certified to ISO 9001 and BS 7799. HESA Services Ltd (HSL) is a wholly owned subsidiary of HESA, registered in England at the same address. Registered No. 3109219. _____________________________________________________________________ This outgoing email was virus scanned for HESA by MessageLabs. _____________________________________________________________________ |
From: Thibault R. <TRD@SODEXO-PASS.BE> - 2008-08-12 15:32:02
|
Hello, I've a problem with quote characters using magpierss. I try to parse a feed generated by Wordpress : http://blog.dimitripiot.com/wp-feed.php And I try to display it on a web page : http://www.dimitripiot.com/index.php (on the left side) If you take a look to the display, you 'll see that the quote characters (') are not well parsed. My code is really basic and is working (except this quote problem) : $rss = fetch_rss('http://blog.dimitripiot.com/wp-feed.php'); if ( $rss) { $items = array_slice($rss->items, 0, $nombre_affiche_blog); echo "<div class='title'>Le blog</div>"; echo "<div style=\"padding:10px; color:#FFFF99\">Visitez également le blog de Dimitri Piot pour plus d'infos au jour le jour... </div>"; echo "<ul>\n"; foreach ($items as $item) { $href = $item['link']; $title = $item['title']; echo "<li><a href=\"$href\" target=\"_blank\">$title</a></li>\n"; } echo "</ul>"; } print "</div>\n"; I don't think the problem is on the feed (which is valid) and I have no idea about the problem I've already tried to change the encoding from ISO-8859-1 to UTF-8 without any success. Is it a known problem ? Thank you in advance for your help Regards Thibault |
From: Gavin H. <net...@gm...> - 2008-07-09 14:29:19
|
This is becoming a fairly regular issue when I parse craigslist feeds. In most cases, the feed validates (maybe with warnings about \x95 characters), but failes to parse. I know it says the line number it fails on, but when I've checked a feed repeatedly as new posts come in, it never seems to be the same line. Does it ignore newlines within a datum? (ie - <description>blah\nblah\nblah\n</description> to be one line instead of 4) Because of the regular changing nature of the craigslist feeds, I created copies of one of the error files, as well as the feedvalidator results, you can see them here: http://www.subnets.org/tmp/index.rss http://www.subnets.org/rss_results.html -- In God we trust, Everyone else must have an x.509 certificate. |
From: Ugur S. <ugu...@gm...> - 2008-07-04 14:59:19
|
I have problem with parsing atom feed with magpie. Category attribute term is not assigned properly in the array. Has anyone come up with a solution for that? Regards Ugur -- www.vesaire.com |
From: Neerav <nee...@us...> - 2008-06-15 08:41:49
|
> ** Attempt to retrieve the <name> > <Attempt 2> > $recdesc = str_replace("&","&",$item[author]); First, the key "author" needs to be quoted, else it is first assumed to be a constant. In this instance, it's probably that you need to retrieve $item['author']['name'] The best way to figure this out is to dump the data: print_r($item); It will give you a much clearer idea of the item's structure. If you are looking at the output in a browser, you'll have to look at the source code for the nice print_r formatting. > ** Attempt to retrive the <div> > <Attempt 3> > $jobdesc = str_replace("&","&",$item[content]); Again, "content" needs to be quoted. Before trying a regex, make sure you are getting/using the right key by dumping the item with a print_r -- Neerav |
From: Neerav <nee...@us...> - 2008-06-13 13:25:02
|
> Feed two, ISSUE 1 > > I am having trouble retrieveing the bold data from the following source > > Source: > ----------------------------------------------------- > <description><![CDATA[ <html> <head> <title>Latest [snip] > <div id="rssDes"><strong>Job ad summary: </strong><br>If your looking for a > change I can help you open the door of opportunity</div> <div [snip] > </html>]]> </description> > ----------------------------------------------------- What you are trying to retrieve is only a small part of the description which has html markup in it. The div itself is not an xml entry. You should try along the lines of: $desc = str_replace("&","&",$item[description]); // regex not tested, but should give a general idea if (preg_match("@<div id=\"rssDes\">(.*?)</div>@s", $desc, $matches) ) { $mydiv = $matches[1]; // or use $desc if you really want to get rid of rest of description echo $mydiv; } > > Feed three, ISSUE 2 This is probably going to be similar to issue #1 -- Neerav |
From: <zer...@ho...> - 2008-06-13 03:49:46
|
I wondering if someone might be nice enough to help me with a code issue relating to Magpie rss. I have spent countless hours searching on google and coding and have not managed to come up with the correct solution. Currently I pull in three feeds the method I use to retrieve the description from the first feed is as follows: Source ------------------------------------------------------ <description> This is a basic description </description> ------------------------------------------------------ php code used to retrieve the description: ---------------------------------------------------- $desc = str_replace("&","&",$item[description]);----------------------------------------------------- This method works fine for one of the three feeds but the other two feeds have a different format. Feed two, ISSUE 1 I am having trouble retrieveing the bold data from the following source Source: ----------------------------------------------------- <description><![CDATA[ <html> <head> <title>Latest jobs</title> </head> <body> <div id="rssFeed" style="margin-top:15px; margin-bottom:8px"> <div id="rssListedDate"><strong>Date & time listed: </strong>13/06/2008 1:11:19 PM</div> <div id="rssJobSummary" style="margin-top:10px"> <div id="rssDes"><strong>Job ad summary: </strong><br>If your looking for a change I can help you open the door of opportunity</div> <div id="rssSalary" style="margin-top:10px"> </div> <div id="rssNewFeedLink" style="margin-top:18px"> <a href="http://www.website.com">Create a new feed with different search criteria</a> </div> </div> </div> </body> </html>]]> </description> ----------------------------------------------------- php code I have tried ----------------------------------------------------- $desc = str_replace("&","&",$item[description][rssDes); $desc = str_replace("&","&",$item['description']['rssDes'); $desc = str_replace("&","&",$item[Div][rssDes); ----------------------------------------------------- Feed three, ISSUE 2 I am having trouble retrieveing the <name> and <content> (Bold content below) from the following code Source: ------------------------------------------------------------------ <title>Environmental Engineer - Water - Brisbane</title> <author><name>Skye Recruitment</name></author> <category term="Scientific" /> <content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml">Environmental Engineer - Water - Brisbane - Great mentoring & support - Prestigious projects - Excellent salary & benefits - Finish early on a Friday!</div></content> ------------------------------------------------------------------ I have tried several methods and can not seem to get the syntex right. I appreaciate you taking the time to help. zeroanarchy _________________________________________________________________ Be part of history. Take part in Australia's first e-mail archive with Email Australia. http://emailaustralia.ninemsn.com.au |
From: Gareth D. <gar...@he...> - 2008-06-11 11:02:35
|
The link seems to work fine in its' truncated form. I suspect that the problem is that technically the full-length URL you give below is malformed, or at least malformed to the point where Magpie doesn't like it (it has an equals sign where it probably shouldn't: /ref=tag_...etc), and therefore is truncating it. You might want to try CURLing in the xml of the feed and pre-processing it before you give it to magpie to replace /ref= with /ref__EQUALS__ or something similar, then see if you get the full length link coming through. You can then swap the __EQUALS__ back for = when you're doing whatever it is you're doing to the magpie output. HTH, Gareth Gareth Dart Web Developer Higher Education Statistics Agency (HESA) 95 Promenade, Cheltenham, Gloucestershire GL50 1HZ T 01242 211128 F 01242 211122 W www.hesa.ac.uk -----Original Message----- From: mag...@li... [mailto:mag...@li...] On Behalf Of Brad Stewart Sent: Tuesday 10 June 2008 21:49 To: mag...@li... Subject: [Magpierss-general] Link Parsing Issues I am trying to parse an rss feed from amazon. Here is the xml: http://www.amazon.com/rss/tag/wii/new?tag=fcom-20 This is the raw xml for an item: <item> <link> http://www.amazon.com/TAKE-TWO-Table-Tennis-Wii/dp/B000XWJUBU/ref=tag_rs n_rs_edpp_url?ie=UTF8&tag=fcom-20&creative=381421 </link> </item> The problem i am having is that the link being returned by magpie is: [link] => http://www.amazon.com/TAKE-TWO-Table-Tennis-Wii/dp/B000XWJUBU Or in other words the link is getting truncated. Anyone know how to fix this? Thanks. ------------------------------------------------------------------------ - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php _______________________________________________ Magpierss-general mailing list Mag...@li... https://lists.sourceforge.net/lists/listinfo/magpierss-general ______________________________________________________________ This incoming email was virus scanned for HESA by MessageLabs. ______________________________________________________________ _____________________________________________________________________ Higher Education Statistics Agency Ltd (HESA) is a company limited by guarantee, registered in England at 95 Promenade Cheltenham GL50 1HZ. Registered No. 2766993. The members are Universities UK and GuildHE. Registered Charity No. 1039709. Certified to ISO 9001 and BS 7799. HESA Services Ltd (HSL) is a wholly owned subsidiary of HESA, registered in England at the same address. Registered No. 3109219. _____________________________________________________________________ This outgoing email was virus scanned for HESA by MessageLabs. _____________________________________________________________________ |
From: Brad S. <pu...@br...> - 2008-06-10 20:48:44
|
I am trying to parse an rss feed from amazon. Here is the xml: http://www.amazon.com/rss/tag/wii/new?tag=fcom-20 This is the raw xml for an item: <item> <link> http://www.amazon.com/TAKE-TWO-Table-Tennis-Wii/dp/B000XWJUBU/ref=tag_rsn_rs_edpp_url?ie=UTF8&tag=fcom-20&creative=381421 </link> </item> The problem i am having is that the link being returned by magpie is: [link] => http://www.amazon.com/TAKE-TWO-Table-Tennis-Wii/dp/B000XWJUBU Or in other words the link is getting truncated. Anyone know how to fix this? Thanks. |
From: Rods L. <rod...@gm...> - 2008-06-02 16:45:04
|
Hi I am a new user. How can I install this? Whai is important to do? Thanks |
From: Rods L. <rod...@gm...> - 2008-05-29 14:14:23
|
Hi I am a new user. How can I install this? Whai is important to do? Thanks Rods |
From: Gareth D. <gar...@he...> - 2008-05-23 09:33:52
|
Hi, There's some good, concise dox here: http://www.ilovejackdaniels.com/cheat-sheets/php-cheat-sheet/ Also, the appropriately named website http://www.regular-expressions.info/ has a ton and a half of tutorials on regexes, although not specific to php's PCRE regexes it's still useful. Forward slashes need escaping, maybe tag brackets too if I remember rightly. Cheers, G Gareth Dart Web Developer Higher Education Statistics Agency (HESA) 95 Promenade, Cheltenham, Gloucestershire GL50 1HZ T 01242 211128 F 01242 211122 W www.hesa.ac.uk <http://www.hesa.ac.uk/> ________________________________ From: S T [mailto:mo...@ya...] Sent: Friday 23 May 2008 00:49 To: Gareth Dart; mag...@li... Subject: RE: [Magpierss-general] $item[description] returns "image +htmlformating + description". How do I parse? Thank you very much. This helped me solve 50% of the problem. I need to extract all of the text between </a> and </p>. From the Example #3 in the php.net link you sent, I tried several variations of this line... preg_match('@^(?:</a>)?([^(?:</p>)]+)@i', $description <mailto:'@^(?:</a>)?([^(?:</div>)]+)@i', $description> , $matches3); I think the problem might have to do with escaping characters correctly. I didn't very much documentation on that. Any ideas? Thanks again. Gareth Dart wrote: Hola, If the description node has a predictable structure, you'll probably be able to extract the data you need with regular expressions: see http://uk3.php.net/manual/en/function.preg-match.php for a starting point if regexes are not something you've come across before. They have a bit of a learning curve, but are an extremely valuable tool when parsing strings. For your example below, the following (untested) call would probably work to extract the a's href: preg_match("/a\shref\s?=\s?[\'\"]([^\'\"]*)[\'\"]/i", $description, $matches); //the href of the link should be returned in $matches[1] ...and you should be able to do something similar to extract the img src. If there's only one image, it can be as simple as: preg_match("/src\s?=\s?[\'\"]([^\'\"]*)[\'\"]/i", $description, $matches); //the src of the image should be returned in $matches[1] Just calling strip_tags() on the description node will extract the text - this probably has less overhead than a regex function. Hope this is helpful, G Gareth Dart Web Developer Higher Education Statistics Agency (HESA) 95 Promenade, Cheltenham, Gloucestershire GL50 1HZ T 01242 211128 F 01242 211122 W www.hesa.ac.uk <http://www.hesa.ac.uk/> ________________________________ From: mag...@li... [mailto:mag...@li...] On Behalf Of S T Sent: Wednesday 21 May 2008 19:38 To: mag...@li... Subject: [Magpierss-general] $item[description] returns "image +htmlformating + description". How do I parse? Hello, I am successfully using the Magpie code to parse a spanish xml file. So far so good. You can see my progress here: http://www.hispanocarro.com/test/index3.php There are some CSS hacks there that some listserve members might find useful. I need to take this a step further though. When I request the description node.... $description = $item[description]; ...I get a chunk of formatted html in return. It looks like this... <p> <a href="myImageURL"> <img align="left" alt="photo" border="0" height="myHeight" src="myImageSrc" width="myWidth" /></a> MyDescription </p> <br clear="all" /> I really need to break this down into it's individual elements. How can I request "myImageURL", "myImageSrc" etc. so that i can have more control over the design of my pages? Thank you in advance. John ------------------------------------------------------------------------ ------------ Get Flash Components and Templates at www.thermalstudios.com <http://www.thermalstudios.com/> ______________________________________________________________ This incoming email was virus scanned for HESA by MessageLabs. ______________________________________________________________ _____________________________________________________________________ Higher Education Statistics Agency Ltd (HESA) is a company limited by guarantee, registered in England at 95 Promenade Cheltenham GL50 1HZ. Registered No. 2766993. The members are Universities UK and GuildHE. Registered Charity No. 1039709. Certified to ISO 9001 and BS 7799. HESA Services Ltd (HSL) is a wholly owned subsidiary of HESA, registered in England at the same address. Registered No. 3109219. _____________________________________________________________________ This outgoing email was virus scanned for HESA by MessageLabs. _____________________________________________________________________ ______________________________________________________________ This incoming email was virus scanned for HESA by MessageLabs. ______________________________________________________________ _____________________________________________________________________ Higher Education Statistics Agency Ltd (HESA) is a company limited by guarantee, registered in England at 95 Promenade Cheltenham GL50 1HZ. Registered No. 2766993. The members are Universities UK and GuildHE. Registered Charity No. 1039709. Certified to ISO 9001 and BS 7799. HESA Services Ltd (HSL) is a wholly owned subsidiary of HESA, registered in England at the same address. Registered No. 3109219. _____________________________________________________________________ This outgoing email was virus scanned for HESA by MessageLabs. _____________________________________________________________________ |
From: S T <mo...@ya...> - 2008-05-22 23:48:54
|
Thank you very much. This helped me solve 50% of the problem. I need to extract all of the text between </a> and </p>. From the Example #3 in the php.net link you sent, I tried several variations of this line... preg_match()?([^(?: )]+)@i', $description">'@^(?:</a>)?([^(?:</p>)]+)@i', $description, $matches3); I think the problem might have to do with escaping characters correctly. I didn't very much documentation on that. Any ideas? Thanks again. Gareth Dart wrote: Hola, If the description node has a predictable structure, you'll probably be able to extract the data you need with regular expressions: see http://uk3.php.net/manual/en/function.preg-match.php for a starting point if regexes are not something you've come across before. They have a bit of a learning curve, but are an extremely valuable tool when parsing strings. For your example below, the following (untested) call would probably work to extract the a's href: preg_match("/a\shref\s?=\s?[\'\"]([^\'\"]*)[\'\"]/i", $description, $matches); //the href of the link should be returned in $matches[1] ...and you should be able to do something similar to extract the img src. If there's only one image, it can be as simple as: preg_match("/src\s?=\s?[\'\"]([^\'\"]*)[\'\"]/i", $description, $matches); //the src of the image should be returned in $matches[1] Just calling strip_tags() on the description node will extract the text - this probably has less overhead than a regex function. Hope this is helpful, G Gareth Dart Web Developer Higher Education Statistics Agency (HESA) 95 Promenade, Cheltenham, Gloucestershire GL50 1HZ T 01242 211128 F 01242 211122 W www.hesa.ac.uk --------------------------------- From: mag...@li... [mailto:mag...@li...] On Behalf Of S T Sent: Wednesday 21 May 2008 19:38 To: mag...@li... Subject: [Magpierss-general] $item[description] returns "image +htmlformating + description". How do I parse? Hello, I am successfully using the Magpie code to parse a spanish xml file. So far so good. You can see my progress here: http://www.hispanocarro.com/test/index3.php There are some CSS hacks there that some listserve members might find useful. I need to take this a step further though. When I request the description node.... $description = $item[description]; ...I get a chunk of formatted html in return. It looks like this... <p> <a href="myImageURL"> <img align="left" alt="photo" border="0" height="myHeight" src="myImageSrc" width="myWidth" /></a> MyDescription </p> <br clear="all" /> I really need to break this down into it's individual elements. How can I request "myImageURL", "myImageSrc" etc. so that i can have more control over the design of my pages? Thank you in advance. John ------------------------------------------------------------------------------------ Get Flash Components and Templates at www.thermalstudios.com ______________________________________________________________ This incoming email was virus scanned for HESA by MessageLabs. ______________________________________________________________ _____________________________________________________________________ Higher Education Statistics Agency Ltd (HESA) is a company limited by guarantee, registered in England at 95 Promenade Cheltenham GL50 1HZ. Registered No. 2766993. The members are Universities UK and GuildHE. Registered Charity No. 1039709. Certified to ISO 9001 and BS 7799. HESA Services Ltd (HSL) is a wholly owned subsidiary of HESA, registered in England at the same address. Registered No. 3109219. _____________________________________________________________________ This outgoing email was virus scanned for HESA by MessageLabs. _____________________________________________________________________ |
From: Gareth D. <gar...@he...> - 2008-05-22 06:55:47
|
Hola, If the description node has a predictable structure, you'll probably be able to extract the data you need with regular expressions: see http://uk3.php.net/manual/en/function.preg-match.php for a starting point if regexes are not something you've come across before. They have a bit of a learning curve, but are an extremely valuable tool when parsing strings. For your example below, the following (untested) call would probably work to extract the a's href: preg_match("/a\shref\s?=\s?[\'\"]([^\'\"]*)[\'\"]/i", $description, $matches); //the href of the link should be returned in $matches[1] ...and you should be able to do something similar to extract the img src. If there's only one image, it can be as simple as: preg_match("/src\s?=\s?[\'\"]([^\'\"]*)[\'\"]/i", $description, $matches); //the src of the image should be returned in $matches[1] Just calling strip_tags() on the description node will extract the text - this probably has less overhead than a regex function. Hope this is helpful, G Gareth Dart Web Developer Higher Education Statistics Agency (HESA) 95 Promenade, Cheltenham, Gloucestershire GL50 1HZ T 01242 211128 F 01242 211122 W www.hesa.ac.uk <http://www.hesa.ac.uk/> ________________________________ From: mag...@li... [mailto:mag...@li...] On Behalf Of S T Sent: Wednesday 21 May 2008 19:38 To: mag...@li... Subject: [Magpierss-general] $item[description] returns "image +htmlformating + description". How do I parse? Hello, I am successfully using the Magpie code to parse a spanish xml file. So far so good. You can see my progress here: http://www.hispanocarro.com/test/index3.php There are some CSS hacks there that some listserve members might find useful. I need to take this a step further though. When I request the description node.... $description = $item[description]; ...I get a chunk of formatted html in return. It looks like this... <p> <a href="myImageURL"> <img align="left" alt="photo" border="0" height="myHeight" src="myImageSrc" width="myWidth" /></a> MyDescription </p> <br clear="all" /> I really need to break this down into it's individual elements. How can I request "myImageURL", "myImageSrc" etc. so that i can have more control over the design of my pages? Thank you in advance. John ------------------------------------------------------------------------ ------------ Get Flash Components and Templates at www.thermalstudios.com ______________________________________________________________ This incoming email was virus scanned for HESA by MessageLabs. ______________________________________________________________ _____________________________________________________________________ Higher Education Statistics Agency Ltd (HESA) is a company limited by guarantee, registered in England at 95 Promenade Cheltenham GL50 1HZ. Registered No. 2766993. The members are Universities UK and GuildHE. Registered Charity No. 1039709. Certified to ISO 9001 and BS 7799. HESA Services Ltd (HSL) is a wholly owned subsidiary of HESA, registered in England at the same address. Registered No. 3109219. _____________________________________________________________________ This outgoing email was virus scanned for HESA by MessageLabs. _____________________________________________________________________ |
From: S T <mo...@ya...> - 2008-05-21 18:38:06
|
Hello, I am successfully using the Magpie code to parse a spanish xml file. So far so good. You can see my progress here: http://www.hispanocarro.com/test/index3.php There are some CSS hacks there that some listserve members might find useful. I need to take this a step further though. When I request the description node.... $description = $item[description]; ...I get a chunk of formatted html in return. It looks like this... <p> <a href="myImageURL"> <img align="left" alt="photo" border="0" height="myHeight" src="myImageSrc" width="myWidth" /></a> MyDescription </p> <br clear="all" /> I really need to break this down into it's individual elements. How can I request "myImageURL", "myImageSrc" etc. so that i can have more control over the design of my pages? Thank you in advance. John ------------------------------------------------------------------------------------ Get Flash Components and Templates at www.thermalstudios.com |
From: Atlantis <atl...@gm...> - 2008-04-30 13:45:12
|
Hi, I need help with the following error message: Warning: MagpieRSS: Failed to fetch http://www.pfadiradio.ch/prusik.xml (HTTP Error: connection failed (11) in /home/www/*/html/rss/rss_fetch.inc on line 238 Warning: Invalid argument supplied for foreach() in /home/www/*/html/index.php on line 37 Warning: Invalid argument supplied for foreach() in /home/www/*/html/index.php on line 41 This message is thrown when the page is loaded. The most mysterious thing is that magpierss has been running on exactly this site for about 2 years until it suddenly stopped working. I haven't made any changes to the code at the time it stopped working. My web provider says they didn't change anything on the server either, however the exact same magpierss source code still works on another server of mine, so there must be an issue with the server, somewhere I read it might be wrong php settings. Can anybody tell me what to do about it? If it's a php settings issue, can you please tell me what exactly? Thanks. Kind regards Martin |
From: Farrell, A. <and...@ng...> - 2008-04-22 20:17:04
|
I printed out the response from the rss request to google: We're sorry... ... but your query looks similar to automated requests from a computer virus or spyware application. To protect our users, we can't process your request right now. We'll restore your access as quickly as possible, so try again soon. In the meantime, if you suspect that your computer or network has been infected, you might want to run a virus checker or spyware remover to make sure that your systems are free of viruses and other spurious software. If you're continually receiving this error, you may be able to resolve the problem by deleting your Google cookie and revisiting Google. For browser-specific instructions, please consult your browser's online support center. If your entire network is affected, more information is available in the Google Web Search Help Center. We apologize for the inconvenience, and hope we'll see you again on Google. Any idea what google might be using to come to this conclusion? MAGPIE_USER_AGENT? In my testing, I've hit the server a couple times within a few minutes to test my code... Maybe it's the frequency with which I'm searching??? On Apr 22, 2008, at 3:38 PM, Farrell, Andy wrote: Hi list... I'm using LeafRSS (which uses MagpieRSS) to aggregate a number of news feeds via news.google.com. I'm trying to use google to filter out what news I retrieve. Via the search box at news.google.com I can search for "baseball site:www.mynewspaper.com/sports". You can subscribe to the resulting articles as an RSS feed via google as well. I'm trying to use MagpieRSS to retrieve the google generated RSS feed but it's giving me an error which happens sporadically: Warning: MagpieRSS: Failed to parse RSS file. (not well-formed (invalid token) at line 13, column 18) in /<dirpath>/magpierss/rss_fetch.inc on line 238. The line of code in rss_fetch.inc is : 238: trigger_error( $errormsg, $lvl); Is this en error on the trigger_error call, or an error generated somewhere else? Any way to tell which function called the error function which calls trigger_error? Any idea what the error is? Thanks- Andy ------------------------------------------------------------------------- This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone _______________________________________________ Magpierss-general mailing list Mag...@li... https://lists.sourceforge.net/lists/listinfo/magpierss-general |
From: Farrell, A. <and...@ng...> - 2008-04-22 19:43:29
|
Hi list... I'm using LeafRSS (which uses MagpieRSS) to aggregate a number of news feeds via news.google.com. I'm trying to use google to filter out what news I retrieve. Via the search box at news.google.com I can search for "baseball site:www.mynewspaper.com/sports". You can subscribe to the resulting articles as an RSS feed via google as well. I'm trying to use MagpieRSS to retrieve the google generated RSS feed but it's giving me an error which happens sporadically: Warning: MagpieRSS: Failed to parse RSS file. (not well-formed (invalid token) at line 13, column 18) in /<dirpath>/magpierss/rss_fetch.inc on line 238. The line of code in rss_fetch.inc is : 238: trigger_error( $errormsg, $lvl); Is this en error on the trigger_error call, or an error generated somewhere else? Any way to tell which function called the error function which calls trigger_error? Any idea what the error is? Thanks- Andy |
From: <ch...@gm...> - 2008-04-12 10:38:46
|
Hi mag...@li...,<br/><br/> ch...@gm... has invited you to Spokeo, which finds your friends' updates across the Web. On average, Spokeo finds 87 friends across 30 different social networks. See what Spokeo can find for you now! Click http://www.spokeo.com/public/join?c=7417dddeb95fbf473f5dfcbcdf162d7b18b09f6b to accept your invitation. ----- This invitation was sent with chedong's approval. If you wish to opt out of all future emails, go to http://www.spokeo.com/optout?c=m495355545651485749 Copyright (c) 2008 Spokeo, Inc. All rights reserved. 1685 Plymouth Street #200, Mountain View, CA 94043 |
From: Gandolfi <un...@if...> - 2008-04-07 20:32:03
|
magpierss-general, i'm a french user of magpierss and i have a problem because i want to parse a big file XML (5 Mo) with magpierrs et it seem to be limited. In fact i have just a cache file of 450 ko and not more. i partially parse my file. is it a problem direct with magpierss ? how to resolve it ? is there a option in magpierss to increase the siez of cache or not limited magpierss ? thank you Gandolfi ================================= |
From: Neerav <nee...@us...> - 2008-04-06 04:16:05
|
There were a number of problems. The str_replace wasn't being stored. Defining RSS_URL as a constant doesn't allow it to be changed later. I've made a few changes and sent you the file (from the server without a subject, so please check your spam). Your code in if($rss) does not seem to be working correctly, but $rss is definitely populated with your spider picture item as you can see through the print_r that I added just before it. You'll have to change the magpierss include path. -- Neerav |
From: S H. <sha...@gm...> - 2008-04-05 18:17:01
|
hmm i tried all that still doesn't quite work. sorry, thanks for the help! maybe i'm just doing something stupid, and if i show you the full file it would help: <?php /* * This script is a modified version of getflickr.php. * This modification Created by Shawna Hein, 2008 * First iterations created by the authors below. * * / /* * getflickr.php uses the MagpieRSS Parser to fetch and parse a * tag, user or group pool RSS feed from the photo sharing site www.flickr.com, * and display the pictures. * * V 0.6 renamed back to getflickr.php, and fixed up to account for new flickr * url structure by Tim Bishop (http://www.timbishop.com). * * V 0.5 of flickrrss.php written and released under GPL by * Dave Kellam (http://www.eightface.com) * * V 0.2 of getflickr.php written and released under GPL by * Tim Bishop, http://www.timbishop.com * * This script requires the lastest version of MagpieRSS, which * can be obtained from: http://magpierss.sourceforge.net * * Thanks to Kellan Elliott-McCrea, author of MagpieRSS, and * cip at acme.sinless.org, who wrote the first version of this * script. And Tim Bishop (http://www.timbishop.com) who wrote * the getflickr.php script that I used as a base. * * THIS SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY * KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE * WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR * PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS * OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR * OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE * SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. */ ################################# ## User configurable variables ## ################################# # set what type of tag you're pulling from flickr # 0 = tag; 1 = user; 2 = group pool //$tagtype = 0; $tagtype = $_REQUEST["tagtype"] or $tagtype = 0; # set the tag to pull from flickr (tag, user or group pool) # example: "cats", "44124462494@N01", "circle" //$tag = "55589891@N00"; //$tag = "sapphire07"; //$tag = "Istanbul"; $tag = $_REQUEST["tag"] or $tag = "istanbul"; # set the user to pull from flickr # example: "44124462494@N01" $user = $_REQUEST["user"] or $user = "23254418@N07"; $user = urldecode($user); # set default value for number of pictures to show # (flickr show maximum of 10 photos in feed) //$num_items = 10; $num_items = $_REQUEST["num_items"] or $num_items = 10; # set to true to use medium pics, otherwise it uses small $mediumPics = true; # set location of Magpie RSS files require_once('magpierss/rss_fetch.inc'); require_once('magpierss/rss_utils.inc'); #require_once('./rss_fetch.inc'); #require_once('./rss_utils.inc'); ################################################################ ## End of user defined variables, except for some html cruft ## ################################################################ // get the feeds if ($tagtype == 0) { define('RSS_URL',' http://www.flickr.com/services/feeds/photos_public.gne?tags=' . $tag . '&id=' . $user . '&format=rss_200'); } elseif ($tagtype == 1) { define('RSS_URL',' http://www.flickr.com/services/feeds/photos_public.gne?id=' . $tag . '&format=rss_200'); } //elseif () { define('RSS_URL',' http://api.flickr.com/services/feeds/photoset.gne?set=' . $tag . '&nsid=23254418@N07&lang=en-us' elseif ($tagtype == 2) { define('RSS_URL',' http://api.flickr.com/services/feeds/groups_pool.gne?id=' . $tag . '&format=rss_200'); } else { Print "Invalid tagtype"; } str_replace("&", "&", RSS_URL); print RSS_URL . "</br>"; # get rss file $rss = fetch_rss(RSS_URL); if ($rss) { #echo "test"; # HTML formatting of results $imgurl = ""; # specifies number of pictures $items = array_slice($rss->items, 0, $num_items); foreach ( $items as $item ) { $count = 0; foreach ($item as $element) { #print $count; if($count==6){ $description = $element; #print $description; if(preg_match('<img src="([^"]*)" [^/]*/>',$description,$imgUrlMatches)) { $imgurl = $imgUrlMatches[1]; print "$imgurl|"; } } #if($count==11) #print "$element|"; $count++; } #print $item[0]; #print "NEXT ITEM"; $title = $item['title']; $url = $item['link']; } } else { echo magpie_error(); } ?> On Fri, Apr 4, 2008 at 11:00 PM, Neerav <nee...@us...> wrote: > > > > define('RSS_URL',' > http://www.flickr.com/services/feeds/photos_public.gne?tags=' > > > > . $tag . '&id=' . $user . '&format=rss_200'); } > > On Sat, Apr 5, 2008 at 10:41 AM, S Hein <sha...@gm...> wrote: > > yes, thanks, I tried that already. when i do that, none of the photos > show > > up... > > I HAD to str_replace("&", "&", $url) to get magpierss to work. > The encoded url's that were supplied from elsewhere broke it. > > The url you have given as an example is working in my implementation > -- I'm seeing just the one giant spider from your bathtub. > > Just noticed that the $user has @ in it. Is your $user already > urlencoded? Do a urldecode($user) The urldecoded @ in the url is > working on my system, whereas a urlencoded @ returns "Photos from > everyone tagged spider". I think that's the solution. > > -- > Neerav > > ------------------------------------------------------------------------- > This SF.net email is sponsored by the 2008 JavaOne(SM) Conference > Register now and save $200. Hurry, offer ends at 11:59 p.m., > Monday, April 7! Use priority code J8TLD2. > > http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone > _______________________________________________ > Magpierss-general mailing list > Mag...@li... > https://lists.sourceforge.net/lists/listinfo/magpierss-general > |
From: Neerav <nee...@us...> - 2008-04-05 06:36:27
|
I just discovered a problem with [David] Pogue's Post RSS feed. When using the URL: http://www.nytimes.com/services/xml/rss/nyt/PoguesPosts.xml I get the error "Error: MagpieRSS: Failed to parse RSS file. (Mismatched tag at line 93, column 8)". However, when entering the url in a browser, I am redirected to http://pogue.blogs.nytimes.com/rss2.xml Using that URL in magpieRSS, I get "Error: MagpieRSS: Failed to parse RSS file. (Invalid document end at line 384, column 1)". There is no line 384 in that feed! Both url's are VALID according to feedvalidator.org with the following caveat: ======= Recommendations This feed is valid, but interoperability with the widest range of feed readers could be improved by implementing the following recommendations. * line 27, column 2: wfw:commentRSS should be wfw:commentRss (10 occurrences) [help] <wfw:commentRSS>http://pogue.dblogs.nytimes.com/?feed=rss2&p=444</wfw: ... * line 139, column 3: Missing atom:link with rel="self" [help] </channel> ======= So my guess is that these missing "recommendations" are something that is causing MagpieRSS to fail. Any suggestions on how I can fix magpieRSS? -- Neerav |
From: Neerav <nee...@us...> - 2008-04-05 06:00:35
|
> > > define('RSS_URL','http://www.flickr.com/services/feeds/photos_public.gne?tags=' > > > . $tag . '&id=' . $user . '&format=rss_200'); } On Sat, Apr 5, 2008 at 10:41 AM, S Hein <sha...@gm...> wrote: > yes, thanks, I tried that already. when i do that, none of the photos show > up... I HAD to str_replace("&", "&", $url) to get magpierss to work. The encoded url's that were supplied from elsewhere broke it. The url you have given as an example is working in my implementation -- I'm seeing just the one giant spider from your bathtub. Just noticed that the $user has @ in it. Is your $user already urlencoded? Do a urldecode($user) The urldecoded @ in the url is working on my system, whereas a urlencoded @ returns "Photos from everyone tagged spider". I think that's the solution. -- Neerav |
From: S H. <sha...@gm...> - 2008-04-05 05:11:28
|
yes, thanks, I tried that already. when i do that, none of the photos show up... On Fri, Apr 4, 2008 at 7:56 PM, Neerav <nee...@us...> wrote: > On Sat, Apr 5, 2008 at 6:07 AM, S Hein <sh...@is...> wrote: > > > if ($tagtype == 0) { > > define('RSS_URL',' > http://www.flickr.com/services/feeds/photos_public.gne?tags=' > > . $tag . '&id=' . $user . '&format=rss_200'); } > > Try replacing the & and make it & The ampersand escaping is > needed for display, not for the rss fetching. > > -- > Neerav > > ------------------------------------------------------------------------- > This SF.net email is sponsored by the 2008 JavaOne(SM) Conference > Register now and save $200. Hurry, offer ends at 11:59 p.m., > Monday, April 7! Use priority code J8TLD2. > > http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone > _______________________________________________ > Magpierss-general mailing list > Mag...@li... > https://lists.sourceforge.net/lists/listinfo/magpierss-general > |