From: Phil R. <phi...@go...> - 2011-10-07 17:16:49
|
Like many people I am an avid Slashdot reader, and I would firstly like to thank you for producing such an excellent site. I'm not sure that this is the correct place to ask this question, if not can you please point me in the correct direction. I am writing an application that is pulling RSS feeds from (among others) Slashdot. I am not an expert with RSS, but have not had too many problems until this one. When I pull in the feed from http://rss.slashdot.org/Slashdot/slashdot I received a block of xml (which I tried to attach, but the email bounded). Unfortunately I cannot parse this with the java parser JDOM: the xml is malformed. An example of the problem is: <item rdf:about="http://ask.slashdot.org/story/11/10/07/1438230/Ask-Slashdot-Does-Being-Loyal-Pay-As-a-Developer?utm_source=rss1.0mainlinkanon&utm_medium=feed"> <title> Ask Slashdot: Does Being 'Loyal' Pay As a Developer? </title> As you can see the &utm_medium=feed is malformed. It should be &utm_medium=feed, if I understand the W3C standards correctly. I have tried a number of attempts to clean this up pre feeding into my application, with mixed success. I actually have three questions: "Am I correct that this is malformed" "If so Are you planning on changing your feed so that it is no long malformed". "In anycase are you aware of any Java XML parsers that can accept this broken feed?" Thanks |