RRD file only gets NaN values from a service

IT Dept
2011-12-02
2013-05-20
  • IT Dept

    IT Dept - 2011-12-02

    I've installed nagiosgraph 1.4.4 in order to chart NetApp disk use but I'm not getting any graphs  and need help debugging the problem.  I use the check_netapp plugin from nagios exchange.  The plugin puts data like this in the perfdata.log file:

    1322788407||cbs-fs1||DISK_vol0||DISKUSED64 OK - /vol/vol0/ - total: 33554432 Kb - used 268736 Kb (1%) - free: 33285696 Kb||NetApp /vol/vol0/ Used Space=268736KB;30198988;31876710;0;33554432

    With nagiosgraph set to debug level 5, the following shows up in nagiosgraph.log:

    Thu Dec  1 17:14:07 2011 insert.pl debug insert.pl processing started
    Thu Dec  1 17:14:07 2011 insert.pl debug getrules(/usr/local/nagiosgraph/etc/map)
    Thu Dec  1 17:14:07 2011 insert.pl debug readperfdata: perflog is /tmp/perfdata.log
    Thu Dec  1 17:14:07 2011 insert.pl info read 7 lines from perflog
    Thu Dec  1 17:14:07 2011 insert.pl debug processdata (7 lines)
    Thu Dec  1 17:14:07 2011 insert.pl debug processdata data = [
      '1322788407',
      'cbs-fs1',
      'DISK_vol0',
      'DISKUSED64 OK - /vol/vol0/ - total: 33554432 Kb - used 268736 Kb (1%) - free: 33285696 Kb',
      'NetApp /vol/vol0/ Used Space=268736KB;30198988;31876710;0;33554432'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl info processing output/perfdata:
    hostname:cbs-fs1
    servicedesc:DISK_vol0
    output:DISKUSED64 OK - /vol/vol0/ - total: 33554432 Kb - used 268736 Kb (1%) - free: 33285696 Kb
    perfdata:NetApp /vol/vol0/ Used Space=268736KB;30198988;31876710;0;33554432
    Thu Dec  1 17:14:07 2011 insert.pl debug createrrd(1322788406,cbs-fs1,DISK_vol0,NetApp /vol/vol0/ Used Space)
    Thu Dec  1 17:14:07 2011 insert.pl debug createrrd rrdfile is /usr/local/nagiosgraph/var/rrd/cbs-fs1/DISK_vol0___NetApp%20%2Fvol%2Fvol0%2F%20Used%20Space.rrd
    Thu Dec  1 17:14:07 2011 insert.pl debug createrrd step=300 heartbeat=600 xff=0.5 resolutions=4 steps=1 6 24 288
    Thu Dec  1 17:14:07 2011 insert.pl debug labels-> = [
      'data',
      'GAUGE',
      275185664
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug labels-> = [
      'warn',
      'GAUGE',
      '30923763712'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug labels-> = [
      'crit',
      'GAUGE',
      '32641751040'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug labels-> = [
      'min',
      'GAUGE',
      0
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug labels-> = [
      'max',
      'GAUGE',
      '34359738368'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug createrrd filenames = [
      'DISK_vol0___NetApp%20%2Fvol%2Fvol0%2F%20Used%20Space.rrd'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug createrrd datasets = [
      [
        0,
        1,
        2,
        3,
        4
      ]
    ];
    Thu Dec  1 17:14:07 2011 insert.pl info runupdate dataset = [
      '/usr/local/nagiosgraph/var/rrd/cbs-fs1/DISK_vol0___NetApp%20%2Fvol%2Fvol0%2F%20Used%20Space.rrd',
      '1322788407:275185664:30923763712:32641751040:0:34359738368'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug createrrd(1322788406,cbs-fs1,DISK_vol0,netappdisk)
    Thu Dec  1 17:14:07 2011 insert.pl debug createrrd rrdfile is /usr/local/nagiosgraph/var/rrd/cbs-fs1/DISK_vol0___netappdisk.rrd
    Thu Dec  1 17:14:07 2011 insert.pl debug createrrd step=300 heartbeat=600 xff=0.5 resolutions=4 steps=1 6 24 288
    Thu Dec  1 17:14:07 2011 insert.pl debug labels-> = [
      'diskused',
      'GAUGE',
      '275185664'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug labels-> = [
      'diskwarn',
      'GAUGE',
      '30923763712'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug labels-> = [
      'diskcrit',
      'GAUGE',
      '32641751040'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug labels-> = [
      'diskmaxi',
      'GAUGE',
      '34359738368'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug createrrd filenames = [
      'DISK_vol0___netappdisk.rrd'
    ];
    Thu Dec  1 17:14:07 2011 insert.pl debug createrrd datasets = [
      [
        0,
        1,
        2,
        3
      ]
    ];
    Thu Dec  1 17:14:07 2011 insert.pl info runupdate dataset = [
      '/usr/local/nagiosgraph/var/rrd/cbs-fs1/DISK_vol0___netappdisk.rrd',
      '1322788407:275185664:30923763712:32641751040:34359738368'
    ];

    I use the following map file entry to parse the data:

    /perfdata:NetApp.*Space=(\d+)KB;(\d+);(\d+);\d+;(\d+)/
    and push @s, [ 'netappdisk',
        ,
        ,
        ,
         ];
    };

    All that ends up for values in the RRD file is NaN.  Here's part of the output from "rrdtool dump":

    <rrd>
            <version>0003</version>
            <step>300</step> <!- Seconds ->
            <lastupdate>1322788407</lastupdate> <!- 2011-12-01 17:13:27 PST ->

            <ds>
                    <name> diskused </name>
                    <type> GAUGE </type>
                    <minimal_heartbeat>600</minimal_heartbeat>
                    <min>NaN</min>
                    <max>NaN</max>

                    <!- PDP Status ->
                    <last_ds>275185664</last_ds>
                    <value>NaN</value>
                    <unknown_sec> 207 </unknown_sec>
            </ds>

            <ds>
                    <name> diskwarn </name>
                    <type> GAUGE </type>
                    <minimal_heartbeat>600</minimal_heartbeat>
                    <min>NaN</min>
                    <max>NaN</max>

                    <!- PDP Status ->
                    <last_ds>30923763712</last_ds>
                    <value>NaN</value>
                    <unknown_sec> 207 </unknown_sec>
            </ds>

            <ds>
                    <name> diskcrit </name>
                    <type> GAUGE </type>
                    <minimal_heartbeat>600</minimal_heartbeat>
                    <min>NaN</min>
                    <max>NaN</max>

                    <!- PDP Status ->
                    <last_ds>32641751040</last_ds>
                    <value>NaN</value>
                    <unknown_sec> 207 </unknown_sec>
            </ds>

            <ds>
                    <name> diskmaxi </name>
                    <type> GAUGE </type>
                    <minimal_heartbeat>600</minimal_heartbeat>
                    <min>NaN</min>
                    <max>NaN</max>

                    <!- PDP Status ->
                    <last_ds>34359738368</last_ds>
                    <value>NaN</value>
                    <unknown_sec> 207 </unknown_sec>
            </ds>

    Any ideas on what I'm doing wrong?

    Thanks,
    Carl

     
  • Matthew Wall

    Matthew Wall - 2011-12-07

    it looks like there are two different rrd files involved, one called '/usr/local/nagiosgraph/var/rrd/cbs-fs1/DISK_vol0___NetApp%20%2Fvol%2Fvol0%2F%20Used%20Space.rrd' that is created by the default map rule, and another called '/usr/local/nagiosgraph/var/rrd/cbs-fs1/DISK_vol0___netappdisk.rrd' that is created by the map rule that you defined.

    in both cases, the number of elements in the update looks ok.  for example, the first rrd file defines data, warn, crit, min, max.  the update contains 6 elements (timestamp followed by data, warn, crit, min).  the second rrd file looks ok as well.

    what is the nagios check interval for this plugin?  does it align with the step=300 heartbeat=600?  also, remember that you'll need a couple of updates before the NaN go away.

     

Log in to post a comment.

Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:





No, thanks