#297 New adminmail stats, # of seconds DBs late

Slash 2.3/2.4
closed-wont-fix
MySQL (16)
5
2003-09-09
2003-06-22
No

A "SHOW PROCESSLIST" on a slave DB tells us how many
seconds behind that DB is on replication, in the Time
column of the Slave SQL thread. E.g., on one of our slaves:

| Id | User | Host | db | Command | Time
| State | Info |
| 8 | system user | localhost | NULL | Connect | 1
| Slave: waiting for binlog update | NULL |

This slave is 1 second behind the master.

I would suggest that a new task be written which runs
every minute and sees how far behind each slave DB is
(readers and log_slaves). Store this info in a new
daily table. Then adminmail.pl would find the median,
90th-, and 99th-percentile values for each slave, and
write that into stats_daily (and then purge the
previous day's values from the new daily table of course).

If we get a baseline now on how well this is working,
then as we change to new DB machines, as load changes,
and as code changes, we can see the effects.

Discussion

  • Jamie McCarthy

    Jamie McCarthy - 2003-09-04

    Logged In: YES
    user_id=3889

    There's a fuzzy line between what stats Slash should keep and
    what stats are better suited for MON. I think this is on the MON
    side.

    I'm going to work with jab to get this into MON, and not bother
    having it accessible with stats.pl. I'll see if we can't get some
    Slash admins access to the graphs that AFAIK currently only
    netops can see...

     
  • Jamie McCarthy

    Jamie McCarthy - 2003-09-04
    • assigned_to: jamiemccarthy --> cmdrtaco
    • status: open --> open-wont-fix
     
  • Rob Malda

    Rob Malda - 2003-09-09
    • status: open-wont-fix --> closed-wont-fix
     

Log in to post a comment.