Work at SourceForge, help us to make it a better place! We have an immediate need for a Support Technician in our San Francisco or Denver office.
Hello all, again!
Just wondering if someone out there may have a backup (doesn't has to be recent) of records harvested from PubMed Central, but with a special twist: metadataPrefix = pmc_fm
Although doing the harvesting at night (in the States), breaking it in the morning and resume it at night or weekends (off-peak times), we are talking about 2.1 million records and that is quite a considerable amount of work to ask pubmed servers to do.
If someone has that and would be so kind to share it (don't think PubMed Central will not like that, quite the opposite)...
But you may ask why metadataPrefix = pmc_fm instead of the usual metadataPrefix = oai_dc...
Well, Demian's lastest patch in http://vufind.org/jira/browse/VUFIND-258 (Use VuFind as article index) makes possible to full take advantage of the healthiness of information PubMed sends in this metadata schema, not present in dc one's, beside that in pmc_fm there is a full set of keywords that in oai_dc is just (usually):
<kwd>hormone replacement therapy</kwd>
Besides that, I really want is this group:
<journal-id journal-id-type="nlm-ta">Breast Cancer Res</journal-id>
<journal-title>Breast Cancer Research</journal-title>
Yes, it all there whilst in oai_dc the record is "just" (at least this random record):
<oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/"; xmlns:dc="http://purl.org/dc/elements/1.1/"; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"; xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><identifier>pubmed-13913</identifier><datestamp>2001-02-27</datestamp>
<dc:title>The Million Women Study: design and characteristics of the study population</dc:title>
<dc:rights>Copyright © 1999 Current Science Ltd</dc:rights>
Ok, I guess I will receive an e-mail soon from pubmed central blaming me for the quantity of extra requests from all of you out there that are re-harvesting their repository because of this message of mine... :)
Thanking in advance anyone who has done this harvesting and might share the .tar (.zip, whatever) of it;
All the best,
Filipe Manuel S. Bento | http://about.me/filipeb
Universidade de Aveiro | Campus Universitário Santiago
3810-193 Aveiro | Portugal