#3766 SMART status only shows first drive on 3ware RAID controller

1.520
closed-fixed
Jamie Cameron
5
2010-10-07
2010-09-30
Steve McGrath
No

I am opening a new issue for this, as the others are already closed and somewhat old. On my Ubuntu 10.04 server with a 3ware 9550SX-8LP card, the SMART status page only shows the first disk on my RAID controller.

I believe I saw something in another report saying that the current logic attempts to read each 3ware port sequentially until it gets a failure, to determine how many drives are present. If this is the case, it is likely the cause of my problem; I may have something of an edge case here, but I have drives on ports 0, 3, 4, and 5.

Is there a better solution, besides re-arranging my drives? I'll do it if I have to, but I'm also very willing to assist in finding a better way.

Discussion

1 2 > >> (Page 1 of 2)
  • Jamie Cameron
    Jamie Cameron
    2010-09-30

    More recent versions of Webmin attempt to count the number of disks in a 3ware array using the command :

    tw_cli info c0

    What does that output on your system?

     
  • Steve McGrath
    Steve McGrath
    2010-09-30

    Actually for whatever reason, (maybe the slot it's installed in?) my controller is c2. Here is the output from tw_cli info c2:

    steve@cthulu:/dev$ sudo tw_cli info c2

    Unit UnitType Status %RCmpl %V/I/M Stripe Size(GB) Cache AVrfy
    ------------------------------------------------------------------------------
    u0 SINGLE OK - - - 232.82 OFF OFF
    u1 RAID-5 OK - - 64K 1862.62 OFF OFF

    Port Status Unit Size Blocks Serial
    ---------------------------------------------------------------
    p0 OK u0 232.88 GB 488397168 9VY94PFZ
    p1 NOT-PRESENT - - - -
    p2 NOT-PRESENT - - - -
    p3 OK u1 931.51 GB 1953525168 S246J1CZ700977
    p4 OK u1 931.51 GB 1953525168 S246J1CZ700987
    p5 OK u1 931.51 GB 1953525168 S246J1CZ700978
    p6 NOT-PRESENT - - - -
    p7 NOT-PRESENT - - - -

     
  • Jamie Cameron
    Jamie Cameron
    2010-09-30

    smart-status.wbm.gz

     
    Attachments
  • Jamie Cameron
    Jamie Cameron
    2010-09-30

    Thanks - I think I see the issue, which is that Webmin assumes that all the disks in the array are numbered 0,1,2,etc..

    Attached to this bug is an update to the Smart Status module that should fix the issue. You can install it at Webmin -> Webmin Configuration -> Webmin Modules.

     
  • Steve McGrath
    Steve McGrath
    2010-09-30

    Well, I installed the attached module, and it seems to make no difference. I still only see "3ware physical disk 0" in the drive selection dropdown.

     
  • Jamie Cameron
    Jamie Cameron
    2010-09-30

    Hmm .. what does it show as the /dev device name for the 3ware drive in the menu in the SMART module?

     
  • Steve McGrath
    Steve McGrath
    2010-09-30

    It actually doesn't show a device name in the menu, which is odd now that you mention it. If I proceed to select it and view the SMART info, it gives the device name as "/dev/twa0, disk 0"

     
  • Jamie Cameron
    Jamie Cameron
    2010-09-30

    So I guess you don't have c0 or c1 3ware devices?

     
  • Steve McGrath
    Steve McGrath
    2010-09-30

    Correct, there are no other RAID controllers in this server, and this one, for whatever reason, is c2. That's what the controller configuration shows as well, ID 2.

    Must be assigned based on PCI-X slot number, phase of the moon, or fair dice roll...

     
  • Jamie Cameron
    Jamie Cameron
    2010-09-30

    I think the drive being c2 is the issue, as Webmin expects the fitrst 3ware device to be c0.

    In your /etc/fstab file, what device name do you use for the mounts from the 3ware RAID array?

     
  • I'm using LVM, so my fstab has /dev/mapper devices listed rather than physical partitions.

    However, the devices visible to the operating system are /dev/sda and /dev/sdb.

    sda is 3ware port 0, a single drive for the operating system.
    sdb is 3ware ports 3, 4, 5 in a RAID-5 for storage.

    I will be taking this server down for maintenance later today, so while I'm at it I'll see if I can determine why it is being assigned ID 2.

    I note that the kernel sees this card as scsi2, with the onboard primary and secondary IDE being assigned scsi0 and scsi1, respectively. Perhaps this is why the controller is showing up as C2.

    It could still also be the slot number, or perhaps configurable in the controller's BIOS. I can check both during maintenance and experiment a bit.

    Thank you for looking in to this so quickly, Webmin is an amazing piece of software!

     
  • Jamie Cameron
    Jamie Cameron
    2010-09-30

    I'm pretty sure that if the 3ware device was on c0, Webmin would recognize it OK. Automatic detection that it is on c2 would be better though .. is there some 3ware command that can be used to discover that /dev/sda maps to c2 ?

     
  • I'm not aware of any specific command that would easily allow mapping /dev/sdX devices to their parent controller. But perhaps the output of "tw_cli show" could be parsed to determine what controller ids are present? Here's the output of that command on my server:

    Ctl Model (V)Ports Drives Units NotOpt RRate VRate BBU
    ------------------------------------------------------------------------
    c2 9550SX-8LP 8 4 2 0 1 1 -

     
  • Jamie Cameron
    Jamie Cameron
    2010-10-02

    OK, thanks .. I think we are getting closer now.

    So I presume that u0 is /dev/sda , and u1 is /dev/sdb ?

     
  • Steve McGrath
    Steve McGrath
    2010-10-02

    Correct, at least in my case. These are the "units" that the are created in the 3ware BIOS, and made visible as disks to the host system.

    I've done some more pondering and researching, and I have some thoughts:

    I believe the 3ware driver calls this card c2 because it is scsi controller 2 in the system. It appears, from my googling, that the scsi controller id is assigned in the order that the kernel loads drivers. It probes IDE first, apparently, so the onboard IDE channels get scsi0 and scsi1, so when it loads the 3ware driver, it gets scsi2.

    As far as easily mapping /dev/sdX names to controller numbers, I'm not sure the best way. In my dmesg at least, I can see lines like these:

    [ 1.871715] sd 2:0:0:0: [sda] 488259584 512-byte logical blocks: (249 GB/232 GiB)
    [ 1.873636] sd 2:0:1:0: [sdb] 3906207744 512-byte logical blocks: (1.99 TB/1.81 TiB)

    Those lines show that sda is controller 2, channel 0, unit 0, LUN 0. I'm not sure how to retrieve that information any other way, though.

     
  • Jamie Cameron
    Jamie Cameron
    2010-10-05

    Thanks for the info - I have one more question though. Does your system have /dev/twe* or /dev/twa* device files?

     
  • Steve McGrath
    Steve McGrath
    2010-10-05

    I have the /dev/twa* devices.

     
  • Jamie Cameron
    Jamie Cameron
    2010-10-06

    smart-status.wbm.gz version 2

     
    Attachments
  • Jamie Cameron
    Jamie Cameron
    2010-10-06

    Ok, I have attached a new module update that should help.

    If not, please let me know what lines get logged to /var/webmin/miniserv.error when you open the SMART module page.

     
  • Steve McGrath
    Steve McGrath
    2010-10-06

    I think we're getting closer now. The SMART module now lists the 4 physical disks in the drop down box. However, it reports that they do not support SMART, even after I manually enabled it and retrieved data with smartctl.

    I am attaching the tail end of my miniserv.error

     
  • Steve McGrath
    Steve McGrath
    2010-10-06

    miniserv.error after installing latest module

     
    Attachments
  • Jamie Cameron
    Jamie Cameron
    2010-10-06

    smart-status.wbm.gz version 3

     
    Attachments
  • Jamie Cameron
    Jamie Cameron
    2010-10-06

    Ok, thanks .. try out this even newer update.

    Also, what /dev/twa* files exist on your system?

     
  • I have /dev/twa[0-15].

    For some reason that last file wouldn't extract properly, I had to gunzip it on the command line, it extracted but with a "trailing garbage ignored" message. Once gunzipped, it installed, but still indicated no SMART support.

     
  • Jamie Cameron
    Jamie Cameron
    2010-10-06

    smart-status.wbm.gz version 4

     
    Attachments
1 2 > >> (Page 1 of 2)