Re: [Scalablecr-discuss] User configuration file format
Brought to you by:
kathrynmohror,
moody20
From: Adam T. M. <mo...@ll...> - 2016-01-26 23:45:12
|
Hi Maksym, We replaced CACEDESC with STORE. The user manual is correct in this case (Section 6.2). I've pushed a fix for this: https://github.com/hpc/scr/commit/f8e067ac3057c4c6b9f9a464431c3be913112117 Sorry about that. I obviously forgot to check that the example scr.user.conf file was up-to-date with the latest source code. -Adam Maksym Planeta wrote: > And a follow up question. > > The example proposes using CACHEDESC (or CACHE of you do the same > change as with CKPTDESC) keyword. The documentation proposes using > STORE keyword. > > Both the keyword seems to specify the same thing. > > Which one is the right one? > > On 01/26/2016 12:54 AM, Mohror, Kathryn wrote: > >> Hi Maksym, >> >>> >>> I decided to try out SCR. I compiled it and installed as specified >>> in the manual. >>> No I try to specify checkpoint descriptors in use configuration file. >> >> >> Glad to hear you're trying out SCR! >> >>> It turns out that the documentation describes other format, from >>> what an >>> example at https://github.com/hpc/scr/blob/master/scr.user.conf shows. >>> >>> For example, file doc/scr_users_manual.pdf does not contain keyword >>> CKPTDESC whatsoever. >>> >>> Could you tell me what is the correct format? >> >> >> It looks like you have uncovered a bug in the example scr.user.conf >> file. Please use the keyword CKPT as you found in the user's guide >> for those lines instead of CKPTDESC. >> >>> I tried to use the one which documentation specifies, but I get an >>> error which >>> tells, that I probably don't have enough nodes: >>> >>> SCR v1.1.8 WARNING: rank 10 on taurusi6325: Failed to find partner >>> processes for redundancy descriptor 0, disabling checkpoint, too few >>> nodes? >>> @ scr_reddesc.c:169 >>> >>> I definitely do, because I specify SET_SIZE=1 and create a job with >>> 4 nodes. >> >> >> Yes, this error is related to the SCR_SET_SIZE parameter. Try setting >> it to 8 and see if it works better. I believe the reason you get that >> message is because the set size needs to be greater to 1 for a >> redundancy scheme to work. >> >> Let me know if that helps! If not we can work some more on it. >> >> Kathryn >> >>> >>> I attach all the configuration files for completeness. >>> >>> -- >>> Regards, >>> Maksym Planeta >> > >------------------------------------------------------------------------ > >------------------------------------------------------------------------------ >Site24x7 APM Insight: Get Deep Visibility into Application Performance >APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month >Monitor end-to-end web transactions and take corrective actions now >Troubleshoot faster and improve end-user experience. Signup Now! >http://pubads.g.doubleclick.net/gampad/clk?id=267308311&iu=/4140 > >------------------------------------------------------------------------ > >_______________________________________________ >Scalablecr-discuss mailing list >Sca...@li... >https://lists.sourceforge.net/lists/listinfo/scalablecr-discuss > > |