From: Israel G. <iga...@gm...> - 2009-11-13 20:54:36
|
Hi list: I have a supermicro server with an adaptec 4x10 raid card and I want to monitor health status of the 4 hdd. This is the lspci output: 06:00.0 RAID bus controller: Adaptec AAC-RAID (rev 09) Subsystem: Super Micro Computer Inc AOC-USAS-S4i Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 32 bytes Interrupt: pin A routed to IRQ 28 Region 0: Memory at d8000000 (64-bit, non-prefetchable) [size=2M] [virtual] Expansion ROM at d8500000 [disabled] [size=512K] Capabilities: [98] Power Management version 2 Flags: PMEClk- DSI- D1+ D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [a0] Message Signalled Interrupts: Mask- 64bit+ Queue=0/1 Enable- Address: 0000000000000000 Data: 0000 Capabilities: [d0] Express (v1) Endpoint, MSI 00 DevCap: MaxPayload 512 bytes, PhantFunc 0, Latency L0s unlimited, L1 <1us ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset- DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported- RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+ MaxPayload 256 bytes, MaxReadReq 512 bytes DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr- TransPend- LnkCap: Port #0, Speed 2.5GT/s, Width x8, ASPM L0s, Latency L0 <128ns, L1 unlimited ClockPM- Suprise- LLActRep- BwNot- LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk+ ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- LnkSta: Speed 2.5GT/s, Width x8, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt- Capabilities: [90] Vital Product Data <?> Capabilities: [100] Advanced Error Reporting <?> Kernel driver in use: aacraid Kernel modules: aacraid As I know adaptec raid card is not supported by smartmontool, I read somewhere I can monitor every SATA disk accesing directly using scsi driver from the kernel, this way #smartctl -a -d sat /dev/sg1 #smartctl -a -d sat /dev/sg2 #smartctl -a -d sat /dev/sg3 #smartctl -a -d sat /dev/sg4 This way I get all SMART info from 4 hdds. Questions: Is any chance or possibility to harm the RAID or any data on any disk I'm accessing the way I show above? Is there any other way to get SMART health of the 4 disk of my server? thanks in advance -- Regards; Israel Garcia |
From: Martin B. <Mar...@ic...> - 2009-11-16 08:18:29
|
Hi, > Kernel modules: aacraid > As I know adaptec raid card is not supported by smartmontool, I read > somewhere I can monitor every SATA disk accesing directly using scsi > driver from the kernel, this way Since the aacraid driver exposes the /dev/sg* devices just to allow programs like smartmontool to work, I'd say the combination is quite well supported. > #smartctl -a -d sat /dev/sg1 > #smartctl -a -d sat /dev/sg2 > #smartctl -a -d sat /dev/sg3 > #smartctl -a -d sat /dev/sg4 > This way I get all SMART info from 4 hdds. > Questions: > Is any chance or possibility to harm the RAID or any data on any disk > I'm accessing the way I show above? > Is there any other way to get SMART health of the 4 disk of my server? The (r/o) access to the /dev/sg* devices by smartmontool should be save. There is another way to get overall SMART status of the devices using adaptecs storage manager software: /usr/StorMan/arcconf GETCONFIG 1 will show you controller and device configuration including SMART status and number of SMATRT warning for each device, here's an excerpt for just one disk: ---------------------------------------------------------------------- Physical Device information ---------------------------------------------------------------------- Device #0 Device is a Hard drive State : Online Supported : Yes Transfer Speed : SATA 3.0 Gb/s Reported Channel,Device(T:L) : 0,0(0:0) Reported Location : Connector 0, Device 0 Vendor : Model : ST31500341AS Firmware : CC1H Serial number : 9VS29Q3X Size : 1430799 MB Write Cache : Enabled (write-back) FRU : None S.M.A.R.T. : No S.M.A.R.T. warnings : 0 Power State : Full rpm Supported Power States : Full rpm,Powered off SSD : No MaxIQ Cache Capable : No MaxIQ Cache Assigned : No NCQ status : Enabled Bye, Martin |
From: Israel G. <iga...@gm...> - 2009-11-16 15:04:24
|
On 11/16/09, Martin Bene <Mar...@ic...> wrote: > Hi, > >> Kernel modules: aacraid > >> As I know adaptec raid card is not supported by smartmontool, I read >> somewhere I can monitor every SATA disk accesing directly using scsi >> driver from the kernel, this way > > Since the aacraid driver exposes the /dev/sg* devices just to allow programs > like smartmontool to work, I'd say the combination is quite well supported. Hi Martin, Thanks for your answer, I see you can do some self-test to disks using smartctl, but I want to ask you if this self-test are necesary to get the health status of all disks. When do I have to run this self-test to disks? From time to time? Or only when problems are detected on disks? thanks again. regards, Israel. > >> #smartctl -a -d sat /dev/sg1 >> #smartctl -a -d sat /dev/sg2 >> #smartctl -a -d sat /dev/sg3 >> #smartctl -a -d sat /dev/sg4 > >> This way I get all SMART info from 4 hdds. > >> Questions: > >> Is any chance or possibility to harm the RAID or any data on any disk >> I'm accessing the way I show above? > >> Is there any other way to get SMART health of the 4 disk of my server? > > The (r/o) access to the /dev/sg* devices by smartmontool should be save. > > There is another way to get overall SMART status of the devices using > adaptecs storage manager software: > > /usr/StorMan/arcconf GETCONFIG 1 > > will show you controller and device configuration including SMART status and > number of SMATRT warning for each device, here's an excerpt for just one > disk: > > ---------------------------------------------------------------------- > Physical Device information > ---------------------------------------------------------------------- > Device #0 > Device is a Hard drive > State : Online > Supported : Yes > Transfer Speed : SATA 3.0 Gb/s > Reported Channel,Device(T:L) : 0,0(0:0) > Reported Location : Connector 0, Device 0 > Vendor : > Model : ST31500341AS > Firmware : CC1H > Serial number : 9VS29Q3X > Size : 1430799 MB > Write Cache : Enabled (write-back) > FRU : None > S.M.A.R.T. : No > S.M.A.R.T. warnings : 0 > Power State : Full rpm > Supported Power States : Full rpm,Powered off > SSD : No > MaxIQ Cache Capable : No > MaxIQ Cache Assigned : No > NCQ status : Enabled > > Bye, Martin > > -- Regards; Israel Garcia |