From: Tuan N. <tu...@yo...> - 2011-10-13 21:25:02
|
Hi David, thanks for checking this out. Your explanation makes perfect sense. On the other hand, it raises a different issue, I am afraid different cataloguing practices may dictate how much details is coded in these fields. If you got a chance to update the code I'd love to test it out and report my findings. Cheers, T On 2011-10-13, at 3:14 PM, Walker, David wrote: > Hi Tuan, et alles, > > I think the issue here w/ the print count is that getMediaTypes() looks for explicitly declared media types in the 007 or the "form of item" entry in the 008/006. > > I suspect -- although I'll have to look at this more closely -- that most MARC records for print items simply don't have any explicit media type declared, under the assumption that the record is for an item in print unless it states otherwise. > > In which case, we might update the code to default to print if no value is found. > > I need to download and try out the latest (MIxin) version of the code myself. I notice Bob made a few (albeit relatively minor) changes. > > --Dave > > ----------------- > David Walker > Library Web Services Manager > California State University > > > -----Original Message----- > From: sol...@go... [mailto:sol...@go...] On Behalf Of Demian Katz > Sent: Thursday, October 13, 2011 11:27 AM > To: Tuan Nguyen; vuf...@li... Tech > Cc: sol...@go... > Subject: [solrmarc-tech] RE: [VuFind-Tech] well this is strange > > I'm copying this to the solrmarc-tech list in case anyone over there has comments. I haven't tried this myself yet, but I'll see if I can find some time in the next week or two to see how our collection breaks down. > > - Demian > >> -----Original Message----- >> From: Tuan Nguyen [mailto:tu...@yo...] >> Sent: Thursday, October 13, 2011 2:02 PM >> To: vuf...@li... Tech >> Subject: [VuFind-Tech] well this is strange >> >> Well I thought I'd tried out the new methods to see how our collection >> looks in terms of media types and content types, I indexed 2205853 >> marc records. The content types look reasonable, but the media types >> look suspicious. Particularly the Print (3666). I'm sure we have more >> than >> 3666 books in print. Has anyone tried this out? >> >> This is what I get: >> >> >> Content Types: >> ------------------------- >> Book (2043180) >> ComputerFile (378935) >> MusicRecording (39292) >> Periodical (37856) >> Thesis (32405) >> Serial (30109) >> Video (21007) >> MusicalScore (11341) >> Map (4555) >> MotionPicture (3987) >> MapSingle (3722) >> BookSubunit (2657) >> ProjectedMedium (1769) >> SoundRecording (1756) >> BookSeries (513) >> BookComponentPart (497) >> MixedMaterial (421) >> Newspaper (371) >> FlashCard (337) >> Website (324) >> ComputerCombination (289) >> ComputerInteractiveMultimedia (277) >> Atlas (256) >> MapSeries (240) >> Kit (209) >> ComputerDocument (176) >> Realia (159) >> BookCollection (124) >> Database (121) >> ComputerBibliographicData (93) >> ComputerProgram (92) >> SerialIntegratingResource (73) >> MapSerial (72) >> ArtReproduction (38) >> MusicalScoreManuscript (30) >> ComputerNumericData (22) >> ComputerRepresentational (21) >> Image (18) >> Slide (15) >> LooseLeaf (14) >> Model (14) >> Chart (13) >> SerialComponentPart (10) >> ComputerOnlineSystem (9) >> Filmstrip (9) >> PhysicalObject (7) >> Picture (6) >> Toy (6) >> Game (5) >> MapManuscript (4) >> >> >> Media Types: >> -------------------------------- >> Electronic (388049) >> Online (357551) >> Microfiche (124710) >> SoundDisc (27719) >> SoundDiscCD (14120) >> SoundDiscLP (13140) >> Microfilm (10085) >> VideoDVD (8635) >> VideoVHS (8515) >> Map (7123) >> SoundRecordingOther (7042) >> Print (3666) >> Filmstrip (1337) >> ComputerOpticalDisc (1329) >> VideoOther (988) >> MapOther (906) >> ComputerOther (733) >> SoundCassette (355) >> Atlas (347) >> VideoLaserdisc (303) >> SensorImage (136) >> MicrofilmReel (91) >> PrintLarge (85) >> VideoUMatic (85) >> Microform (53) >> Slide (51) >> Microopaque (47) >> SoundTapeReel (43) >> PhotomechanicalPrint (39) >> ComputerFloppyDisk (32) >> Braille (24) >> VideoBeta (22) >> MapView (20) >> Picture (17) >> MapSection (14) >> Chart (11) >> ComputerOpticalDiscCartridge (10) >> MapDiagram (10) >> VideoBluRay (10) >> ElectronicDirect (9) >> ImageOther (9) >> ComputerMagnetoOpticalDisc (7) >> FilmOther (7) >> ComputerDisk (4) >> VideoMII (4) >> FlashCard (3) >> GlobeOther (3) >> VideoEIAJ (3) >> FilmCassette (2) >> ComputerTapeCartridge (1) >> >> >> >> >> ---------------------------------------------------------------------- >> - >> ------- >> All the data continuously generated in your IT infrastructure contains >> a definitive record of customers, application performance, security >> threats, fraudulent activity and more. Splunk takes this data and >> makes sense of it. Business sense. IT sense. Common sense. >> http://p.sf.net/sfu/splunk-d2d-oct >> _______________________________________________ >> Vufind-tech mailing list >> Vuf...@li... >> https://lists.sourceforge.net/lists/listinfo/vufind-tech > > -- > You received this message because you are subscribed to the Google Groups "solrmarc-tech" group. > To post to this group, send email to sol...@go.... > To unsubscribe from this group, send email to sol...@go.... > For more options, visit this group at http://groups.google.com/group/solrmarc-tech?hl=en. > |