From: Eric R. <er...@ro...> - 2010-08-26 07:57:19
|
I just got some OCZ Vertex 2 SSD's (240 gig) today and I went looking for a way to monitor how quickly I will exhaust the write lifetime of the drive under our workload. I tried running the version of smartctl that is available by default in CentOS 5.5, but many of the attributes do not make sense (see below). I then downloaded 5.39.1 along with the latest drivedb.h (from SVN) but I still am having issues making heads or tails of many of the attributes (see way below). I am going to make the wild and crazy assumption that all of the disks that I got are not simultaneously getting millions of read errors. ;-) Please let me know if I can run any more tests or provide other output in order to get this device recognized in the database properly. Thanks in advance! [root@qa01 ~]# smartctl -A /dev/sdb smartctl version 5.38 [x86_64-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF READ SMART DATA SECTION === SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 117 110 050 Pre-fail Always - 162733300 5 Reallocated_Sector_Ct 0x0033 094 094 003 Pre-fail Always - 2304 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 17815524343817 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 3 171 Unknown_Attribute 0x0000 000 000 000 Old_age Offline FAILING_NOW 0 172 Unknown_Attribute 0x0000 000 000 000 Old_age Offline FAILING_NOW 0 174 Unknown_Attribute 0x0030 000 000 000 Old_age Offline FAILING_NOW 1 177 Unknown_Attribute 0x0000 000 000 000 Old_age Offline FAILING_NOW 0 177 Unknown_Attribute <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 181 Unknown_Attribute <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 181 Unknown_Attribute 0x0000 000 000 000 Old_age Offline FAILING_NOW 0 181 Unknown_Attribute <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 182 Unknown_Attribute <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 182 Unknown_Attribute 0x0000 000 000 000 Old_age Offline FAILING_NOW 0 182 Unknown_Attribute <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 187 Reported_Uncorrect <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 187 Reported_Uncorrect <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 194 Temperature_Celsius <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 194 Temperature_Celsius 0x0022 000 000 000 Old_age Always FAILING_NOW 0 194 Temperature_Celsius <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 195 Hardware_ECC_Recovered <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 195 Hardware_ECC_Recovered 0x001c 117 110 003 Old_age Offline - 162733300 195 Hardware_ECC_Recovered <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 196 Reallocated_Event_Count <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 196 Reallocated_Event_Count 0x0033 100 100 010 Pre-fail Always - 0 196 Reallocated_Event_Count <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 231 Temperature_Celsius <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 231 Temperature_Celsius 0x0013 090 090 000 Pre-fail Always - 1 231 Temperature_Celsius <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 233 Unknown_Attribute <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 233 Unknown_Attribute 0x0000 000 000 000 Old_age Offline FAILING_NOW 128 233 Unknown_Attribute <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 234 Unknown_Attribute <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 234 Unknown_Attribute 0x0000 000 000 000 Old_age Offline FAILING_NOW 640 234 Unknown_Attribute <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 241 Unknown_Attribute <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 241 Unknown_Attribute 0x0032 000 000 000 Old_age Always FAILING_NOW 640 241 Unknown_Attribute <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 242 Unknown_Attribute <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 242 Unknown_Attribute 0x0032 000 000 000 Old_age Always FAILING_NOW 320 242 Unknown_Attribute <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 0 Unknown_Attribute <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA [root@qa01 smartmontools-5.39.1]# ./smartctl --all -B drivedb.h /dev/sdb smartctl 5.39.1 2010-01-28 r3054 [x86_64-unknown-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: OCZ-VERTEX2 3.5 Serial Number: OCZ-<removed> Firmware Version: 1.11 User Capacity: 240,057,409,536 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Thu Aug 26 00:21:25 2010 PDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 0) seconds. Offline data collection capabilities: (0x7f) SMART execute Offline immediate. Auto Offline data collection on/off support. Abort Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 48) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 117 110 050 Pre-fail Always - 169450146 5 Reallocated_Sector_Ct 0x0033 094 094 003 Pre-fail Always - 2304 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 171601123344395 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 3 171 Unknown_Attribute 0x0000 000 000 000 Old_age Offline - 0 172 Unknown_Attribute 0x0000 000 000 000 Old_age Offline - 0 174 Unknown_Attribute 0x0030 000 000 000 Old_age Offline - 1 177 Wear_Leveling_Count 0x0000 000 000 --- Old_age Offline - 0 177 Wear_Leveling_Count <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 181 Program_Fail_Cnt_Total <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 181 Program_Fail_Cnt_Total 0x0000 000 000 --- Old_age Offline - 0 181 Program_Fail_Cnt_Total <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 182 Erase_Fail_Count_Total <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 182 Erase_Fail_Count_Total 0x0000 000 000 --- Old_age Offline - 0 182 Erase_Fail_Count_Total <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 187 Reported_Uncorrect <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 187 Reported_Uncorrect 0x0032 100 100 --- Old_age Always - 0 187 Reported_Uncorrect <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 194 Temperature_Celsius <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 194 Temperature_Celsius 0x0022 000 000 --- Old_age Always - 0 194 Temperature_Celsius <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 195 Hardware_ECC_Recovered <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 195 Hardware_ECC_Recovered 0x001c 117 110 --- Old_age Offline - 169450146 195 Hardware_ECC_Recovered <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 196 Reallocated_Event_Count <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 196 Reallocated_Event_Count 0x0033 100 100 --- Pre-fail Always - 0 196 Reallocated_Event_Count <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 231 Temperature_Celsius <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 231 Temperature_Celsius 0x0013 090 090 --- Pre-fail Always - 1 231 Temperature_Celsius <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 233 Media_Wearout_Indicator <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 233 Media_Wearout_Indicator 0x0000 000 000 --- Old_age Offline - 128 233 Media_Wearout_Indicator <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 234 Unknown_Attribute <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 234 Unknown_Attribute 0x0000 000 000 --- Old_age Offline - 640 234 Unknown_Attribute <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 241 Total_LBAs_Written <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 241 Total_LBAs_Written 0x0032 000 000 --- Old_age Always - 640 241 Total_LBAs_Written <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO 242 Total_LBAs_Read <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA 242 Total_LBAs_Read 0x0032 000 000 --- Old_age Always - 320 Error SMART Error Log Read failed: Input/output error Smartctl: SMART Error Log Read Failed Error SMART Error Self-Test Log Read failed: Input/output error Smartctl: SMART Self Test Log Read Failed SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. [root@qa01 smartmontools-5.39.1]# -Eric |
From: Christian F. <Chr...@t-...> - 2010-08-26 11:45:55
|
Eric Rosenberry wrote: > I just got some OCZ Vertex 2 SSD's (240 gig) today and I went > looking for a way to monitor how quickly I will exhaust the write > lifetime of the drive under our workload. > I tried running the version of smartctl that is available by default > in CentOS 5.5, but many of the attributes do not make sense (see > below). > I then downloaded 5.39.1 along with the latest drivedb.h (from SVN) > but I still am having issues making heads or tails of many of the > attributes (see way below). > Raw values of attributes 1, 9 and 195 use 56-bit instead of 48-bit and provide two values (errors/total, hours+milliseconds). > 1 Raw_Read_Error_Rate ...162733300 > 5 Reallocated_Sector_Ct ...2304 > 9 Power_On_Hours ...17815524343817 > New print formats have been added to current SVN as smartctl 5.39.1 is not able to print these properly. The drivedb.h entry from the 5.39 branch uses hex48 as a workaround. > 177 ... <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO > 181 ... <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA > With current SVN, the above messages will now longer appear. The root of the problem is a missing threshold for attribute 177. According to SandForce this will be fixed in future firmware versions. I would recommend to update to current SVN. > Device Model: OCZ-VERTEX2 3.5 > This doesn't match the regex in drivedb.h. Older drives use "OCZ-VERTEX2", OCZ apparently started to append version numbers. Could you please change the drivedb.h (trunk or branch) entry as follows and test it: --- drivedb.h (revision 3141) +++ drivedb.h (working copy) @@ -90,7 +90,7 @@ }, { "SandForce Driven SSDs", "SandForce 1st Ed\\.|" // Demo Drive, ... - "OCZ[ -](AGILITY2|VERTEX2|VERTEX-LE)", // ... + "OCZ[ -](AGILITY2|VERTEX2|VERTEX-LE)( .*)?", // ... "", "", "-v 1,raw24/raw32,Raw_Read_Error_Rate " "-v 5,raw48,Retired_Block_Count " If this works, I will fix it in SVN on trunk and branch soon. Thanks, Christian |
From: Eric R. <er...@ro...> - 2010-08-26 15:33:06
|
Christian- Thanks for the quick turnaround on this! Success! FYI- The 3.5 in the version number indicates that it is a 3.5 inch form factor (rather than 2.5 in, etc...) [root@qa01 smartmontools]# ./smartctl --all -B drivedb.h /dev/sdb smartctl 5.40 2010-08-24 r3141 [x86_64-unknown-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: SandForce Driven SSDs Device Model: OCZ-VERTEX2 3.5 Serial Number: OCZ-<removed> Firmware Version: 1.11 User Capacity: 240,057,409,536 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Thu Aug 26 08:24:34 2010 PDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 0) seconds. Offline data collection capabilities: (0x7f) SMART execute Offline immediate. Auto Offline data collection on/off support. Abort Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 48) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 118 110 050 Pre-fail Always - 0/196456147 5 Retired_Block_Count 0x0033 094 094 003 Pre-fail Always - 2304 9 Power_On_Hours_and_Msec 0x0032 100 100 000 Old_age Always - 19h+26m+44.550s 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 3 171 Program_Fail_Count 0x0000 000 000 000 Old_age Offline - 0 172 Erase_Fail_Count 0x0000 000 000 000 Old_age Offline - 0 174 Unexpect_Power_Loss_Ct 0x0030 000 000 000 Old_age Offline - 1 177 Wear_Range_Delta 0x0000 000 000 --- Old_age Offline - 0 181 Program_Fail_Count 0x0000 000 000 000 Old_age Offline - 0 182 Erase_Fail_Count 0x0000 000 000 000 Old_age Offline - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 194 Temperature_Celsius 0x0022 000 000 000 Old_age Always - 0 195 ECC_Uncorr_Error_Count 0x001c 118 110 000 Old_age Offline - 0/196456147 196 Reallocated_Event_Count 0x0033 100 100 003 Pre-fail Always - 0 231 SSD_Life_Left 0x0013 090 090 010 Pre-fail Always - 1 233 SandForce_Internal 0x0000 000 000 000 Old_age Offline - 128 234 SandForce_Internal 0x0000 000 000 000 Old_age Offline - 640 241 Lifetime_Writes_GiB 0x0032 000 000 000 Old_age Always - 640 242 Lifetime_Reads_GiB 0x0032 000 000 000 Old_age Always - 384 Error SMART Error Log Read failed: Input/output error Smartctl: SMART Error Log Read Failed Error SMART Error Self-Test Log Read failed: Input/output error Smartctl: SMART Self Test Log Read Failed SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. [root@qa01 smartmontools]# -Eric On Thu, Aug 26, 2010 at 4:45 AM, Christian Franke < Chr...@t-...> wrote: > Eric Rosenberry wrote: > > I just got some OCZ Vertex 2 SSD's (240 gig) today and I went > > looking for a way to monitor how quickly I will exhaust the write > > lifetime of the drive under our workload. > > I tried running the version of smartctl that is available by default > > in CentOS 5.5, but many of the attributes do not make sense (see > > below). > > I then downloaded 5.39.1 along with the latest drivedb.h (from SVN) > > but I still am having issues making heads or tails of many of the > > attributes (see way below). > > > > Raw values of attributes 1, 9 and 195 use 56-bit instead of 48-bit and > provide two values (errors/total, hours+milliseconds). > > > 1 Raw_Read_Error_Rate ...162733300 > > 5 Reallocated_Sector_Ct ...2304 > > 9 Power_On_Hours ...17815524343817 > > > > New print formats have been added to current SVN as smartctl 5.39.1 is > not able to print these properly. The drivedb.h entry from the 5.39 > branch uses hex48 as a workaround. > > > > 177 ... <== Data Page | WARNING: PREVIOUS ATTRIBUTE HAS TWO > > 181 ... <== Threshold Page | INCONSISTENT IDENTITIES IN THE DATA > > > > With current SVN, the above messages will now longer appear. The root of > the problem is a missing threshold for attribute 177. According to > SandForce this will be fixed in future firmware versions. > > I would recommend to update to current SVN. > > > > Device Model: OCZ-VERTEX2 3.5 > > > > This doesn't match the regex in drivedb.h. Older drives use > "OCZ-VERTEX2", OCZ apparently started to append version numbers. > > Could you please change the drivedb.h (trunk or branch) entry as follows > and test it: > > --- drivedb.h (revision 3141) > +++ drivedb.h (working copy) > @@ -90,7 +90,7 @@ > }, > { "SandForce Driven SSDs", > "SandForce 1st Ed\\.|" // Demo Drive, ... > - "OCZ[ -](AGILITY2|VERTEX2|VERTEX-LE)", // ... > + "OCZ[ -](AGILITY2|VERTEX2|VERTEX-LE)( .*)?", // ... > "", "", > "-v 1,raw24/raw32,Raw_Read_Error_Rate " > "-v 5,raw48,Retired_Block_Count " > > If this works, I will fix it in SVN on trunk and branch soon. > > Thanks, > Christian > > > > |
From: Christian F. <Chr...@t-...> - 2010-08-26 17:01:30
|
Eric Rosenberry wrote: > > Thanks for the quick turnaround on this! Thanks for quick testing! > > Success! r3142, r3143 !-) > > FYI- The 3.5 in the version number indicates that it is a 3.5 inch > form factor (rather than 2.5 in, etc...) I see, thanks for the info. > > [root@qa01 smartmontools]# ./smartctl --all -B drivedb.h /dev/sdb > smartctl 5.40 2010-08-24 r3141 [x86_64-unknown-linux-gnu] (local build) > Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net > > === START OF INFORMATION SECTION === > Model Family: SandForce Driven SSDs > Device Model: OCZ-VERTEX2 3.5 > Serial Number: OCZ-<removed> BTW: You could use '-q noserial' to suppress serial number output. > [...] > Error SMART Error Log Read failed: Input/output error > Smartctl: SMART Error Log Read Failed > Error SMART Error Self-Test Log Read failed: Input/output error > Smartctl: SMART Self Test Log Read Failed This is due to http://sourceforge.net/apps/trac/smartmontools/ticket/89 'smartctl -x' should read the new logs. Thanks, Christian |
From: Eric R. <er...@ro...> - 2010-08-26 23:51:23
|
How does everything look to you now with the -x? P.S. I see some reference to the SSD_Life_Left attribute perhaps being incorrect in these drives on the current firmware. This is just FYI for anyone that stumbles across this thread: http://www.ocztechnologyforum.com/forum/showthread.php?76447-Drive-life-attribute-231-is-reporting-incorrectly -Eric [root@qa01 smartmontools]# ./smartctl -x -q noserial --all -B drivedb.h /dev/sdb smartctl 5.40 2010-08-24 r3141 [x86_64-unknown-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: SandForce Driven SSDs Device Model: OCZ-VERTEX2 3.5 Firmware Version: 1.11 User Capacity: 240,057,409,536 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Thu Aug 26 16:37:32 2010 PDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 0) seconds. Offline data collection capabilities: (0x7f) SMART execute Offline immediate. Auto Offline data collection on/off support. Abort Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 48) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 118 110 050 Pre-fail Always - 0/193408503 5 Retired_Block_Count 0x0033 094 094 003 Pre-fail Always - 2304 9 Power_On_Hours_and_Msec 0x0032 100 100 000 Old_age Always - 27h+39m+42.240s 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 3 171 Program_Fail_Count 0x0000 000 000 000 Old_age Offline - 0 172 Erase_Fail_Count 0x0000 000 000 000 Old_age Offline - 0 174 Unexpect_Power_Loss_Ct 0x0030 000 000 000 Old_age Offline - 1 177 Wear_Range_Delta 0x0000 000 000 --- Old_age Offline - 0 181 Program_Fail_Count 0x0000 000 000 000 Old_age Offline - 0 182 Erase_Fail_Count 0x0000 000 000 000 Old_age Offline - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 194 Temperature_Celsius 0x0022 000 000 000 Old_age Always - 0 195 ECC_Uncorr_Error_Count 0x001c 118 110 000 Old_age Offline - 0/193408503 196 Reallocated_Event_Count 0x0033 100 100 003 Pre-fail Always - 0 231 SSD_Life_Left 0x0013 090 090 010 Pre-fail Always - 1 233 SandForce_Internal 0x0000 000 000 000 Old_age Offline - 192 234 SandForce_Internal 0x0000 000 000 000 Old_age Offline - 832 241 Lifetime_Writes_GiB 0x0032 000 000 000 Old_age Always - 832 242 Lifetime_Reads_GiB 0x0032 000 000 000 Old_age Always - 704 General Purpose Logging (GPL) feature set supported General Purpose Log Directory Version 1 SMART Log Directory Version 1 [multi-sector log support] GP/S Log at address 0x00 has 1 sectors [Log Directory] GP Log at address 0x07 has 1 sectors [Extended self-test log] SMART Log at address 0x09 has 1 sectors [Selective self-test log] GP Log at address 0x10 has 1 sectors [NCQ Command Error] GP/S Log at address 0x11 has 1 sectors [SATA Phy Event Counters] GP/S Log at address 0x80 has 16 sectors [Host vendor specific log] GP/S Log at address 0x81 has 16 sectors [Host vendor specific log] GP/S Log at address 0x82 has 16 sectors [Host vendor specific log] GP/S Log at address 0x83 has 16 sectors [Host vendor specific log] GP/S Log at address 0x84 has 16 sectors [Host vendor specific log] GP/S Log at address 0x85 has 16 sectors [Host vendor specific log] GP/S Log at address 0x86 has 16 sectors [Host vendor specific log] GP/S Log at address 0x87 has 16 sectors [Host vendor specific log] GP/S Log at address 0x88 has 16 sectors [Host vendor specific log] GP/S Log at address 0x89 has 16 sectors [Host vendor specific log] GP/S Log at address 0x8a has 16 sectors [Host vendor specific log] GP/S Log at address 0x8b has 16 sectors [Host vendor specific log] GP/S Log at address 0x8c has 16 sectors [Host vendor specific log] GP/S Log at address 0x8d has 16 sectors [Host vendor specific log] GP/S Log at address 0x8e has 16 sectors [Host vendor specific log] GP/S Log at address 0x8f has 16 sectors [Host vendor specific log] GP/S Log at address 0x90 has 16 sectors [Host vendor specific log] GP/S Log at address 0x91 has 16 sectors [Host vendor specific log] GP/S Log at address 0x92 has 16 sectors [Host vendor specific log] GP/S Log at address 0x93 has 16 sectors [Host vendor specific log] GP/S Log at address 0x94 has 16 sectors [Host vendor specific log] GP/S Log at address 0x95 has 16 sectors [Host vendor specific log] GP/S Log at address 0x96 has 16 sectors [Host vendor specific log] GP/S Log at address 0x97 has 16 sectors [Host vendor specific log] GP/S Log at address 0x98 has 16 sectors [Host vendor specific log] GP/S Log at address 0x99 has 16 sectors [Host vendor specific log] GP/S Log at address 0x9a has 16 sectors [Host vendor specific log] GP/S Log at address 0x9b has 16 sectors [Host vendor specific log] GP/S Log at address 0x9c has 16 sectors [Host vendor specific log] GP/S Log at address 0x9d has 16 sectors [Host vendor specific log] GP/S Log at address 0x9e has 16 sectors [Host vendor specific log] GP/S Log at address 0x9f has 16 sectors [Host vendor specific log] GP/S Log at address 0xb7 has 16 sectors [Device vendor specific log] GP/S Log at address 0xe0 has 1 sectors [SCT Command/Status] GP/S Log at address 0xe1 has 1 sectors [SCT Data Transfer] SMART Extended Comprehensive Error Log (GP Log 0x03) not supported Error SMART Error Log Read failed: Input/output error Smartctl: SMART Error Log Read Failed SMART Extended Self-test Log Version: 1 (1 sectors) No self-tests have been logged. [To run self-tests, use: smartctl -t] Error SMART Error Self-Test Log Read failed: Input/output error Smartctl: SMART Self Test Log Read Failed SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. SCT Status Version: 3 SCT Version (vendor specific): 0 (0x0000) SCT Support Level: 1 Device State: Active (0) Current Temperature: ? Celsius Power Cycle Min/Max Temperature: 127/-127 Celsius Lifetime Min/Max Temperature: 0/ 0 Celsius Under/Over Temperature Limit Count: 0/0 SCT Temperature History Version: 2 Temperature Sampling Period: 1 minute Temperature Logging Interval: 1 minute Min/Max recommended Temperature: ?/ ? Celsius Min/Max Temperature Limit: ?/ ? Celsius Temperature History Size (Index): 478 (231) Index Estimated Time Temperature Celsius 232 2010-08-26 08:40 ? - ... ..(476 skipped). .. - 231 2010-08-26 16:37 ? - SCT Error Recovery Control: Read: 256 (25.6 seconds) Write: 256 (25.6 seconds) SATA Phy Event Counters (GP Log 0x11) ID Size Value Description 0x0001 2 0 Command failed due to ICRC error 0x000a 2 5 Device-to-host register FISes sent due to a COMRESET [root@qa01 smartmontools]# -Eric On Thu, Aug 26, 2010 at 10:01 AM, Christian Franke < Chr...@t-...> wrote: > Eric Rosenberry wrote: > >> >> Thanks for the quick turnaround on this! >> > > Thanks for quick testing! > > >> Success! >> > > r3142, r3143 !-) > > > >> FYI- The 3.5 in the version number indicates that it is a 3.5 inch form >> factor (rather than 2.5 in, etc...) >> > > I see, thanks for the info. > > > >> [root@qa01 smartmontools]# ./smartctl --all -B drivedb.h /dev/sdb >> smartctl 5.40 2010-08-24 r3141 [x86_64-unknown-linux-gnu] (local build) >> Copyright (C) 2002-10 by Bruce Allen, >> http://smartmontools.sourceforge.net >> >> === START OF INFORMATION SECTION === >> Model Family: SandForce Driven SSDs >> Device Model: OCZ-VERTEX2 3.5 >> Serial Number: OCZ-<removed> >> > > BTW: You could use '-q noserial' to suppress serial number output. > > [...] >> >> Error SMART Error Log Read failed: Input/output error >> Smartctl: SMART Error Log Read Failed >> Error SMART Error Self-Test Log Read failed: Input/output error >> Smartctl: SMART Self Test Log Read Failed >> > > This is due to http://sourceforge.net/apps/trac/smartmontools/ticket/89 > > 'smartctl -x' should read the new logs. > > Thanks, > Christian > > |
From: Christian F. <Chr...@t-...> - 2010-08-27 13:27:37
|
Eric Rosenberry wrote: > How does everything look to you now with the -x? Looks good. Temperature table is empty because these drives don't have a sensor. > > 9 Power_On_Hours_and_Msec 0x0032 100 100 000 Old_age > Always - 27h+39m+42.240s > 12 Power_Cycle_Count 0x0032 100 100 000 Old_age > Always - 3 > ... > 241 Lifetime_Writes_GiB 0x0032 000 000 000 Old_age > Always - 832 > 242 Lifetime_Reads_GiB 0x0032 000 000 000 Old_age > Always - 704 > Do the above raw values look reasonable ? > > P.S. I see some reference to the SSD_Life_Left attribute perhaps > being incorrect in these drives on the current firmware. This is just > FYI for anyone that stumbles across this thread: > http://www.ocztechnologyforum.com/forum/showthread.php?76447-Drive-life-attribute-231-is-reporting-incorrectly > The raw value is not documented, the normalized value should be the percentage of remaining drive life. > 231 SSD_Life_Left 0x0013 090 090 010 Pre-fail > Always - 1 90% life left after ~27h uptime and ~800GiB writes? The problem is probably that the value decreases too fast. Thanks, Christian |
From: Eric R. <er...@ro...> - 2010-08-27 16:45:58
|
On Fri, Aug 27, 2010 at 6:27 AM, Christian Franke wrote: > Do the above raw values look reasonable ? > Yup. The drive just got into my hot hands a little over 24 hours ago and we have written about that amount of data to it already. > >> 90% life left after ~27h uptime and ~800GiB writes? The problem is > probably that the value decreases too fast. > Indeed. Seems too aggressive. One of the drives was at 88% after only having it for one day... Thanks again! -Eric |