From: Rui L. <ru...@il...> - 2015-10-16 13:41:42
|
Hi Nilton, Thanks a lot for letting me know, and glad that it works for you! :-) Copying the list for archiving purposes... Thanks, Rui On 10/15/2015 05:36 PM, Nilton Luiz Queiroz Junior wrote: > Hello Rui, > I'm sorry for disturbing you, but i dont see that the rapl component was disabled. > > I just enable the component, and it worked. > > Thanks for help me, > and sorry for the inconvenience. > > Nilton > > > > Subject: Re: [PerfSuite-users] Error running psrun with papi rapl component > > To: nil...@ho... > > From: ru...@il... > > Date: Thu, 15 Oct 2015 11:46:45 -0500 > > > > Hi Nilton, > > > > Sorry for the delay in replying, and sorry you might have received two emails, where in the first one I forgot to attach a file and canceled it while sending. > > > > First a few questions: > > 1. Is it correct that your PerfSuite installation works with regular CPU events, and this issue is just for RAPL events? > > 2. What if you put only 1 RAPL event in your event config file? > > 3. What if you put a PAPI_TOT_CYC and 1 RAPL event in your event config file? > > > > PerfSuite does a dry run before actually running the program and in your case, the error was in the dry run part. From the source code, the problem was after calling PAPI_add_events(...) in do_dryrun() in src/libpshwpc/hwpc-papi.c. It looked like PAPI_add_events(...) did not return PAPI_OK, but the code handled only the error case of PAPI_ECNFLCT (counter conflict, i.e., they can not be measured without multiplexing) and the case where events were partially added. > > > > I added a few lines to print more debug info in the hwpc-papi.c file. Could you please replace the original one in src/libpshwpc/, then do a rebuild, re-install and re-run, and let me know the output? > > > > Sorry now I don't have a machine which supports RAPL events and allows me the root access, so I can not debug/test further. > > > > Thanks, > > Rui > > > > On 10/13/2015 02:59 PM, Nilton Luiz Queiroz Junior wrote: > > > Hello, i was trying to run a program with rapl component and got the following error: > > > > > > "libpsrun fatal error: error reported by performance software layer" > > > > > > then i recompile perfsuit with debug, and set the PS_DEBUG enviroment variable with value 3 > > > > > > i ran again and got the following error: > > > > > > PerfSuite debugging enabled (debug level: PS_DEBUG_INFO) [PID 20980] > > > Library version: threaded > > > [PID 20980] Environment (entry of psrun_init) > > > [PID 20980] PSRUN_DOFORK = (null) > > > [PID 20980] LD_PRELOAD = libpsrun.so.0 > > > [PID 20980] PSRUN_PID = 20980 > > > [PID 20980] PS_HWPC_FILE = energy.out.stats > > > libpsrun.c:201 : [PID 20980] entering psrun_init > > > libpsrun.c:304 : [PID 20980]: unsetting LD_PRELOAD > > > hwpc.c:301 : Initializing real time clock... > > > timers.c:416 : Using timestamp counter for RTC. > > > cpuid-x86.c:134 : Retrieving max CPUID function and vendor string... > > > cpuid-x86.c:114 : d 756e6547 6c65746e 49656e69 > > > cpuid-x86.c:149 : Vendor identified as Intel (GenuineIntel) > > > cpuid-x86.c:190 : Retrieving processor signature... > > > cpuid-x86.c:114 : 306a9 7100800 7fbae3ff bfebfbff > > > cpuid-x86.c:222 : eax = 306a9, ebx=7100800 > > > cpuid-x86.c:226 : family=6, model=58, stepping=9. > > > cpuid-x86.c:235 : Checking for implementation of brand string... > > > cpuid-x86.c:114 : 80000008 0 0 0 > > > cpuid-x86.c:114 : 20202020 20202020 65746e49 2952286c > > > cpuid-x86.c:114 : 726f4320 4d542865 37692029 3737332d > > > cpuid-x86.c:114 : 50432030 20402055 30342e33 7a4847 > > > cpuid-x86.c:267 : Brand string is available > > > cpuid-x86.c:283 : Checking deterministic cache parameters... > > > cpuid-x86.c:114 : 1c004121 1c0003f 3f 0 > > > cpuid-x86.c:114 : 1c004122 1c0003f 3f 0 > > > cpuid-x86.c:114 : 1c004143 1c0003f 1ff 0 > > > cpuid-x86.c:114 : 1c03c163 3c0003f 1fff 6 > > > cpuid-x86.c:114 : 0 0 0 0 > > > cpuid-x86.c:292 : No more caches (iteration 4) > > > cpuid-x86.c:356 : Retrieving cache information... > > > cpuid-x86.c:114 : 76035a01 f0b2ff 0 ca0000 > > > cpuid-x86.c:364 : Need to run CPUID 0 more times to get cache info > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 5a > > > cpuid-x86.c:404 : Cache regval = 3 > > > cpuid-x86.c:404 : Cache regval = 76 > > > cpuid-x86.c:404 : Cache regval = ff > > > cpuid-x86.c:1399 : CPUID reports that leaf 2 lacks cache info > > > cpuid-x86.c:404 : Cache regval = b2 > > > cpuid-x86.c:404 : Cache regval = f0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor f0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = ca > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > hwpc.c:311 : Initialized global data > > > hwpc.c:320 : Signal handling off. > > > dyn-conf.c:86 : arch=0, family=6, model=58, stepping=9, extra=-1. > > > hwpc.c:2389 : Configuration file set to rapl.xml > > > hwpc.c:2434 : Attempting to parse "rapl.xml" > > > xmlparse.c:305 : XML parser: parsing expected XML document element ps_hwpc_eventlist (class = "PAPI") > > > xmlparse.c:494 : XML parser: found 3 events: > > > rapl::RAPL_ENERGY_CORES > > > rapl::RAPL_ENERGY_PKG > > > rapl::RAPL_ENERGY_GPU > > > hwpc.c:334 : Configuration file read > > > hwpc.c:336 : Read 3 events > > > hwpc.c:2259 : Initializing PAPI class... > > > hwpc-papi.c:343 : PAPI initialized successfully > > > hwpc-papi.c:362 : event 'rapl::RAPL_ENERGY_CORES', component index 1. > > > hwpc-papi.c:372 : PAPI_MAX_MPX_CTRS = 128. > > > hwpc.c:365 : Performance software supports 128 events max. > > > hwpc.c:458 : Calling package init routine > > > hwpc-papi.c:409 : PAPI initialized successfully > > > hwpc-papi.c:431 : PAPI version in use: 5.4.1.0 > > > hwpc-papi.c:557 : Available counters: 11 > > > Events requested: 3 > > > Derived events: 0 > > > hwpc-papi.c:1063 : Starting dry run check of event set. > > > hwpc-papi.c:1076 : Created dry run non-mpx event set successfully. > > > hwpc-papi.c:1229 : Dry run PAPI_start failed != ECNFLCT (Invalid argument). > > > hwpc-papi.c:603 : Dry run mpx check failed. > > > libpsrun fatal error: error reported by performance software layer > > > root@lcp-u2:/home/anderson/gerar_bases/benchs# psrun -r -F text -c rapl.xml -o energy.out.stats ./bubblesort > > > PerfSuite debugging enabled (debug level: PS_DEBUG_INFO) [PID 20996] > > > Library version: threaded > > > [PID 20996] Environment (entry of psrun_init) > > > [PID 20996] PSRUN_DOFORK = (null) > > > [PID 20996] LD_PRELOAD = libpsrun.so.0 > > > [PID 20996] PSRUN_PID = 20996 > > > [PID 20996] PS_HWPC_FILE = energy.out.stats > > > libpsrun.c:201 : [PID 20996] entering psrun_init > > > libpsrun.c:304 : [PID 20996]: unsetting LD_PRELOAD > > > hwpc.c:301 : Initializing real time clock... > > > timers.c:416 : Using timestamp counter for RTC. > > > cpuid-x86.c:134 : Retrieving max CPUID function and vendor string... > > > cpuid-x86.c:114 : d 756e6547 6c65746e 49656e69 > > > cpuid-x86.c:149 : Vendor identified as Intel (GenuineIntel) > > > cpuid-x86.c:190 : Retrieving processor signature... > > > cpuid-x86.c:114 : 306a9 1100800 7fbae3ff bfebfbff > > > cpuid-x86.c:222 : eax = 306a9, ebx=1100800 > > > cpuid-x86.c:226 : family=6, model=58, stepping=9. > > > cpuid-x86.c:235 : Checking for implementation of brand string... > > > cpuid-x86.c:114 : 80000008 0 0 0 > > > cpuid-x86.c:114 : 20202020 20202020 65746e49 2952286c > > > cpuid-x86.c:114 : 726f4320 4d542865 37692029 3737332d > > > cpuid-x86.c:114 : 50432030 20402055 30342e33 7a4847 > > > cpuid-x86.c:267 : Brand string is available > > > cpuid-x86.c:283 : Checking deterministic cache parameters... > > > cpuid-x86.c:114 : 1c004121 1c0003f 3f 0 > > > cpuid-x86.c:114 : 1c004122 1c0003f 3f 0 > > > cpuid-x86.c:114 : 1c004143 1c0003f 1ff 0 > > > cpuid-x86.c:114 : 1c03c163 3c0003f 1fff 6 > > > cpuid-x86.c:114 : 0 0 0 0 > > > cpuid-x86.c:292 : No more caches (iteration 4) > > > cpuid-x86.c:356 : Retrieving cache information... > > > cpuid-x86.c:114 : 76035a01 f0b2ff 0 ca0000 > > > cpuid-x86.c:364 : Need to run CPUID 0 more times to get cache info > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 5a > > > cpuid-x86.c:404 : Cache regval = 3 > > > cpuid-x86.c:404 : Cache regval = 76 > > > cpuid-x86.c:404 : Cache regval = ff > > > cpuid-x86.c:1399 : CPUID reports that leaf 2 lacks cache info > > > cpuid-x86.c:404 : Cache regval = b2 > > > cpuid-x86.c:404 : Cache regval = f0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor f0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > cpuid-x86.c:404 : Cache regval = ca > > > cpuid-x86.c:404 : Cache regval = 0 > > > cpuid-x86.c:422 : Skipping cache alloc for descriptor 0 > > > hwpc.c:311 : Initialized global data > > > hwpc.c:320 : Signal handling off. > > > dyn-conf.c:86 : arch=0, family=6, model=58, stepping=9, extra=-1. > > > hwpc.c:2389 : Configuration file set to rapl.xml > > > hwpc.c:2434 : Attempting to parse "rapl.xml" > > > xmlparse.c:305 : XML parser: parsing expected XML document element ps_hwpc_eventlist (class = "PAPI") > > > xmlparse.c:494 : XML parser: found 3 events: > > > rapl::RAPL_ENERGY_CORES > > > rapl::RAPL_ENERGY_PKG > > > rapl::RAPL_ENERGY_GPU > > > hwpc.c:334 : Configuration file read > > > hwpc.c:336 : Read 3 events > > > hwpc.c:2259 : Initializing PAPI class... > > > hwpc-papi.c:343 : PAPI initialized successfully > > > hwpc-papi.c:362 : event 'rapl::RAPL_ENERGY_CORES', component index 1. > > > hwpc-papi.c:372 : PAPI_MAX_MPX_CTRS = 128. > > > hwpc.c:365 : Performance software supports 128 events max. > > > hwpc.c:458 : Calling package init routine > > > hwpc-papi.c:409 : PAPI initialized successfully > > > hwpc-papi.c:431 : PAPI version in use: 5.4.1.0 > > > hwpc-papi.c:557 : Available counters: 11 > > > Events requested: 3 > > > Derived events: 0 > > > hwpc-papi.c:1063 : Starting dry run check of event set. > > > hwpc-papi.c:1076 : Created dry run non-mpx event set successfully. > > > hwpc-papi.c:1229 : Dry run PAPI_start failed != ECNFLCT (Invalid argument). > > > hwpc-papi.c:603 : Dry run mpx check failed. > > > libpsrun fatal error: error reported by performance software layer > > > > > > > > > ps: i'm using PAPI version 5.4.1 and perfsuite 1.1.4 and as root (because the rapl events only works with root user) in a linux kernel 3.16.0-41-generic (ubuntu 14.10) > > > > > > > > > Nilton > > > > > > > > > ------------------------------------------------------------------------------ > > > > > > > > > > > > _______________________________________________ > > > PerfSuite-users mailing list > > > Per...@li... > > > https://lists.sourceforge.net/lists/listinfo/perfsuite-users > > > |