From: Rob M. <rob...@gm...> - 2009-08-07 04:38:38
|
All, Thanks for all the help, I got the NTRS Fetcher packaged up as a proper plugin. It isn't perfect, but it does a reasonable job given what NTRS spits out. This means: - XML files with blank lines at the start. - Lots of fields with more than one entry (date, identifiers, etc). - No particular format for things like date. - Not all information pushed by OAI (keyword, etc.) - Limited use hours (shut down 8am-8pm EST). It might be more useful to build an NTRS fetcher based on a pure HTML scraper to get around most of these problems. I hope someone finds it useful. Rob |