From: Ivan D. <im...@fy...> - 2003-10-09 16:16:57
|
Dear madam/sir, we are using your software and first of all thank you for this. Soft is great. Just small question. When I am getting the information about my scsi device with smartctl -a /dev/sdf1 between others it gives ... Manufactured in week 00 of year 2000 ... what it means? Where from smartctl takes this date info? My problem is that harddrive which we bought is labeled by date: 28 Apr 2003 and 'smartctl' gives year 2000. Who is wrong? full output: ------------------------------------------- # smartctl -a /dev/sdb1 smartctl version 5.20 Copyright (C) 2002-3 Bruce Allen Home page is http://smartmontools.sourceforge.net/ Device: MAXTOR ATLAS15K_73SCA Version: DT60 Serial number: C80356MK Device type: disk Local Time is: Thu Oct 9 19:05:50 2003 EEST Device supports SMART and is Enabled Temperature Warning Enabled SMART Health Status: OK Current Drive Temperature: 36 C Manufactured in week 00 of year 2000 Current start stop count: 182 times Recommended start stop count: 4294967295 times Error counter log: Errors Corrected Total Total Correction Gigabytes Total delay: [rereads/ errors algorithm processed uncorrected minor | major rewrites] corrected invocations [10^9 bytes] errors read: 349618 0 0 0 0 772.206 0 write: 0 0 0 0 0 668.155 0 Non-medium error count: 4 No self-tests have been logged Long (extended) Self Test duration: 2056 seconds [34.3 minutes] ------------------------------------------ thank you in advance, Ivan Degtyarenko --------- Ivan Degtyarenko ----------- im...@fy... ---------- Laboratory of Physics Helsinki University of Technology tel +358-9-451 3103 P.O.Box 1100 fax +358-9-451 3116 02015 Espoo, Finland ---------------------------------- http://www.fyslab.hut.fi/ ------ |
From: Harry G. M. J. <w5...@wo...> - 2004-02-21 20:11:15
|
Fcc: +sent Reply-to: w5...@wo... Subject: 5.27 Build Problem -------- I just downloaded and built version 5.27 specifying --prefix=/usr/local in configure. The build worked, but config.h had: /* smartmontools System Configuration Directory */ #define SMARTMONTOOLS_SYSCONFDIR "${prefix}/etc" instead of #define SMARTMONTOOLS_SYSCONFDIR "/usr/local/etc" Having edited that by hand, the rest of the build seems to be ok and to run ok so far... so I'm not sure if there are other places that ${prefix} didn't get expanded. The install put everything in the right places, so ${prefix} is expanded properly for some things at least. Please forgive me if this has already been mentioned before on the list, but I sent this in case it had not... Harry -- Harry Work: Internet e-mail: hg...@la... (Harry G. McGavran, Jr.) Los Alamos National Laboratory, Los Alamos, New Mexico 87545 Phone: 505/667-4050 Home: Internet e-mail: w5...@wo... -- Harry Work: Internet e-mail: hg...@la... (Harry G. McGavran, Jr.) Los Alamos National Laboratory, Los Alamos, New Mexico 87545 Phone: 505/667-4050 Home: Internet e-mail: w5...@wo... |
From: Bruce A. <ba...@gr...> - 2004-02-22 20:12:17
|
Hi Harry, This is a new problem -- thanks for the report. Guido, I can reproduce this with: ./configure --prefix=/usr/local Do you understand what's wrong, and if so, can you check in a fix? I've looked and don't see what's wrong. Cheers, Bruce On Sat, 21 Feb 2004, Harry G. McGavran Jr. wrote: > Fcc: +sent > Reply-to: w5...@wo... > Subject: 5.27 Build Problem > -------- > > I just downloaded and built version 5.27 specifying > --prefix=/usr/local in configure. The build worked, > but config.h had: > > /* smartmontools System Configuration Directory */ > #define SMARTMONTOOLS_SYSCONFDIR "${prefix}/etc" > > instead of > > #define SMARTMONTOOLS_SYSCONFDIR "/usr/local/etc" > > Having edited that by hand, the rest of the build seems to be ok > and to run ok so far... so I'm not sure if there are other > places that ${prefix} didn't get expanded. > > The install put everything in the right places, so ${prefix} > is expanded properly for some things at least. > > Please forgive me if this has already been mentioned before > on the list, but I sent this in case it had not... > > Harry > > > > -- > > > Harry > > Work: > Internet e-mail: hg...@la... (Harry G. McGavran, Jr.) > Los Alamos National Laboratory, Los Alamos, New Mexico 87545 > Phone: 505/667-4050 > > Home: > Internet e-mail: w5...@wo... > > > -- > > > Harry > > Work: > Internet e-mail: hg...@la... (Harry G. McGavran, Jr.) > Los Alamos National Laboratory, Los Alamos, New Mexico 87545 > Phone: 505/667-4050 > > Home: > Internet e-mail: w5...@wo... > > > > > > ------------------------------------------------------- > SF.Net is sponsored by: Speed Start Your Linux Apps Now. > Build and deploy apps & Web services for Linux with > a free DVD software kit from IBM. Click Now! > http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > > |
From: Guido G. <ag...@si...> - 2004-02-23 10:29:11
|
On Sun, Feb 22, 2004 at 02:04:46PM -0600, Bruce Allen wrote: > Guido, I can reproduce this with: > ./configure --prefix=3D/usr/local > Do you understand what's wrong, and if so, can you check in a fix? I've > looked and don't see what's wrong. Well ${sysconfdir} is passed down to Makefile as '${prefix}/etc' (have a look at the resulting Makefile), so what I'd do is to _not_ write it into config.h but into the Makefile: --- configure.in.old 2004-02-22 22:45:35.000000000 +0100 +++ configure.in 2004-02-22 22:46:24.000000000 +0100 @@ -139,8 +139,8 @@ fi =20 AC_DEFINE_UNQUOTED(SMARTMONTOOLS_BUILD_HOST, "${host}", = [smartmontools Build Host]) -AC_DEFINE_UNQUOTED(SMARTMONTOOLS_SYSCONFDIR, "${sysconfdir}", = [smartmontools System Configuration Directory]) =20 +CFLAGS=3D"$CFLAGS -DSMARTMONTOOLS_SYSCONFDIR=3D${sysconfdir}" AC_SUBST(CFLAGS) =20 AC_OUTPUT(Makefile examplescripts/Makefile smartd.initd) This will make sure we even catch cases where the user does weird things like: ./configure --prefix=3D/usr/local && make prefix=3D/usr/mylocal sourceforge refuses my logins so go ahead and commit the fix if you think it's o.k. Cheeers, -- Guido |
From: Bruce A. <ba...@gr...> - 2004-02-23 16:53:15
|
Hi Guido, On Sun, 22 Feb 2004, Guido Guenther wrote: > On Sun, Feb 22, 2004 at 02:04:46PM -0600, Bruce Allen wrote: > > Guido, I can reproduce this with: > > ./configure --prefix=/usr/local > > Do you understand what's wrong, and if so, can you check in a fix? I've > > looked and don't see what's wrong. > Well ${sysconfdir} is passed down to Makefile as '${prefix}/etc' (have a > look at the resulting Makefile), so what I'd do is to _not_ write it > into config.h but into the Makefile: > > --- configure.in.old 2004-02-22 22:45:35.000000000 +0100 > +++ configure.in 2004-02-22 22:46:24.000000000 +0100 > @@ -139,8 +139,8 @@ > fi > > AC_DEFINE_UNQUOTED(SMARTMONTOOLS_BUILD_HOST, "${host}", [smartmontools Build Host]) > -AC_DEFINE_UNQUOTED(SMARTMONTOOLS_SYSCONFDIR, "${sysconfdir}", [smartmontools System Configuration Directory]) > > +CFLAGS="$CFLAGS -DSMARTMONTOOLS_SYSCONFDIR=${sysconfdir}" > AC_SUBST(CFLAGS) > > AC_OUTPUT(Makefile examplescripts/Makefile smartd.initd) > > This will make sure we even catch cases where the user does weird things > like: > ./configure --prefix=/usr/local && make prefix=/usr/mylocal > sourceforge refuses my logins so go ahead and commit the fix if you > think it's o.k. Will do, but a question first. Wouldn't it be better to put -DSMARTMONTOOLS_SYSCONFDIR=${sysconfdir} into Makefile.am? That way if the user changes CFLAGS when they do 'make', they won't lose the define. Cheers, Bruce |
From: Guido G. <ag...@si...> - 2004-02-23 17:38:52
|
On Mon, Feb 23, 2004 at 10:45:13AM -0600, Bruce Allen wrote: > Will do, but a question first. Wouldn't it be better to put > -DSMARTMONTOOLS_SYSCONFDIR=${sysconfdir} > into Makefile.am? That way if the user changes CFLAGS when they do > 'make', they won't lose the define. Well, the string is _appended_ to the CFLAGS passed to configure. If you want the flexibility to change the CFLAGS later on, I won't object. Cheers, -- Guido |
From: <pz...@fr...> - 2004-02-26 19:18:02
|
I'm a programmer. Please send me a list of S.M.A.R.T. attributes if it's possibile. I found some list in the net, but not the whole ID 1 upto 255. Thanx ____________________________________________________________________ Miert fizetsz az internetert? Korlatlan, ingyenes internet hozzaferes a FreeStarttol. Probald ki most! http://www.freestart.hu |
From: Bruce A. <ba...@gr...> - 2004-02-26 23:32:10
|
> I'm a programmer. Please send me a list of S.M.A.R.T. attributes if > it's possibile. I found some list in the net, but not the whole ID 1 > upto 255. Thanx Look at ataPrintSmartAttribName() in: http://cvs.sourceforge.net/viewcvs.py/smartmontools/sm5/atacmds.c?view=markup |
From: <sse...@cs...> - 2005-04-20 10:56:29
|
Hello Here is pissible test status messages: "Completed without error " "Aborted by host " "Interrupted (host reset) " "Fatal or unknown error " "Completed: unknown failure " "Completed: electrical failure" "Completed: servo/seek failure" "Completed: read failure " "Completed: handling damage?? " "Self-test routine in progress" "Unknown/reserved test status " Most of them are intuitively clear, but some not much. Could you please give little explonation regarding: "Completed: electrical failure" "Completed: servo/seek failure" "Completed: read failure " "Completed: handling damage?? " Or specification/documentation if it is difficult to say few words about that. The difference between "Fatal or unknown error " "Completed: unknown failure " is that first one informs about not finished test and second says that test completes with unknown status, correct? Thank you a lot Regards, Stas |
From: Phunk <ph...@ma...> - 2005-04-22 08:17:20
|
Hello. I have a couple of Cheetah's combined into hardware RAID1 on MegaRAID card and really would like to know if something goes wrong with those discs, but after reading FAQ's I still have no idea how can I configure smartd for such monitoring: array present as /dev/sda in system, how can I reach separate drives? It's a Redhat-a-like on 2.6 kernel. Thank you. |
From: Michael M. <mi...@np...> - 2005-04-22 11:20:32
|
Hi, > Hello. > I have a couple of Cheetah's combined into hardware RAID1 on > MegaRAID card and really would like to know if something goes wrong > with those discs, but after reading FAQ's I still have no idea how > can I configure smartd for such monitoring: array present as > /dev/sda in system, how can I reach separate drives? It's a Redhat-a- > like on 2.6 kernel. Thank you. The way I monitor these is through the Dell monitor utilities (Server Manager etc), you can still find the ftp site through a google search and install them into Linux to monitor the arrays. To the best of my knowledge, smartd won't handle them as the controller "hides" the drives from the OS. Someone correct me if I'm wrong. Michael. |
From: <sou...@tv...> - 2005-04-22 12:18:30
|
On Fri, 22 Apr 2005, Michael Mansour wrote: > > Hello. > > I have a couple of Cheetah's combined into hardware RAID1 on > > MegaRAID card and really would like to know if something goes wrong > > with those discs, but after reading FAQ's I still have no idea how > > can I configure smartd for such monitoring: array present as > > /dev/sda in system, how can I reach separate drives? It's a Redhat-a- > > like on 2.6 kernel. Thank you. >=20 > The way I monitor these is through the Dell monitor utilities (Server > Manager etc), you can still find the ftp site through a google search a= nd > install them into Linux to monitor the arrays. >=20 > To the best of my knowledge, smartd won't handle them as the > controller "hides" the drives from the OS. Someone correct me if I'm wr= ong. Correct. The megaraid driver has a feature to pass commands through to the physica= l=20 drives, but it is sparingly documented and only used by the vendor=20 utilities so far. My megaraid-using computer is also my main workstation, which disinclines= =20 me from doing too adventurous experiments. --=20 Erik I. Bols=F8 |
From: Anton L. <ph...@ma...> - 2005-04-22 19:19:32
|
Thank you. One more question, after short test fails, i'm getting such log messages each time another short test on the same HDD passes: Apr 22 20:21:37 217 smartd[2330]: Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error Apr 22 20:21:37 217 smartd[2330]: # 2 Short offline Completed: read failure 60% 3218 552882021 Apr 22 20:21:37 217 smartd[2330]: #10 Short offline Completed: read failure 60% 3050 552890173 Also a mail message from smartd itself arrives, and it would be enough for me, but server keeps informing me about past errors for 24 cycles of tests (even when sector was relocated and there's no failures in fresh tests), and it's quite annoying. How can I keep smartd mails but disable showing failures in system logs? Or decrease 24 cycles to 1-2 (i'm checking disks daily, so 24 cycles means 24 days of such reports). I've found -U option -U ID [ATA only] Report if the number of offline uncorrectable sectors is non-zero. Here ID is the id number of the Attribute whose raw value is the Offline Uncorrectable sector count. The default value of ID is 198. To turn off this reporting, use ID = 0. The allowed range of ID is 0 to 255 inclusive. An offline uncorrectable sector is a disk sector which was not readable during an off-line scan or a self-test. This is important to know, because if you have data stored in this disk sector, and you need to read it, the read will fail. Please see the previous ?-C? option for more details. but it's not so clear how much warnings would be supressed, at least 1 mail message is necessary of course. |
From: Bruce A. <ba...@gr...> - 2005-04-22 19:45:38
|
> Thank you. One more question, after short test fails, i'm getting such log > messages each time another short test on the same HDD passes: > > Apr 22 20:21:37 217 smartd[2330]: Num Test_Description Status > Remaining LifeTime(hours) LBA_of_first_error > Apr 22 20:21:37 217 smartd[2330]: # 2 Short offline Completed: read > failure 60% 3218 552882021 > Apr 22 20:21:37 217 smartd[2330]: #10 Short offline Completed: read > failure 60% 3050 552890173 I don't know where this message is coming from. Smartd logs a message, but not of this form. Apparently someone has modified smartd or is using it to run some external script. > Also a mail message from smartd itself arrives, and it would be enough for > me, but server keeps informing me about past errors for 24 cycles of tests > (even when sector was relocated and there's no failures in fresh tests), and > it's quite annoying. How can I keep smartd mails but disable showing > failures in system logs? Or decrease 24 cycles to 1-2 (i'm checking disks > daily, so 24 cycles means 24 days of such reports). I've found -U option > > -U ID [ATA only] Report if the number of offline uncorrectable > sectors is non-zero. Here ID is the id number of the Attribute whose raw > value is the Offline Uncorrectable > sector count. The default value of ID is 198. To turn off > this reporting, use ID = 0. The allowed range of ID is 0 to 255 inclusive. > > An offline uncorrectable sector is a disk sector which was not > readable during an off-line scan or a self-test. This is important to know, > because if you have data > stored in this disk sector, and you need to read it, the read > will fail. Please see the previous ?-C? option for more details. > > but it's not so clear how much warnings would be supressed, at least 1 mail > message is necessary of course. I'm confused about the mail messages from smartd. Perhaps you could send a couple of examples. Bruce |
From: Anton L. <ph...@ma...> - 2005-04-22 20:04:27
|
Hm, you gave me an idea. I have a script executing daily to check temperature of all disks, like echo -n echo -n sda smartctl -a /dev/twe0 -d 3ware,0 | grep Tempe | cut --bytes=87- smartctl -a /dev/twe0 -d 3ware,0 | grep result | cut --bytes=50- echo -n sdi smartctl -a /dev/twe1 -d 3ware,0 | grep Tempe | cut --bytes=87- smartctl -a /dev/twe1 -d 3ware,0 | grep result | cut --bytes=50- echo -n sdb smartctl -a /dev/twe0 -d 3ware,1 | grep Tempe | cut --bytes=87- smartctl -a /dev/twe0 -d 3ware,1 | grep result | cut --bytes=50- ... .. . I will disable it now and see if it helps. But mail message is a standard thing I guess, every time short test fails, i'm getting it via sms, looks like ------- This email was generated by the smartd daemon running on: host name: 12.12.12.12 DNS domain: 12.12.12 NIS domain: (none) The following warning/error was logged by the smartd daemon: Device: /dev/twe0 [3ware_disk_03], Self-Test Log error count increased from 1 to 2 For details see host's SYSLOG (default: /var/log/messages). You can also use the smartctl utility for further investigation. The original email about this issue was sent at Fri Apr 15 08:51:37 2005 MSD Another email message will be sent in 24 hours if the problem persists. --------- Here's part of my smartd.conf /dev/twe0 -d 3ware,0 -a -I 194 -I 8 -m sm...@do... -M daily -s S/../.././07 /dev/twe0 -d 3ware,1 -a -I 194 -I 8 -m sm...@do... -M daily -s S/../.././07 /dev/twe0 -d 3ware,2 -a -I 194 -I 8 -m sm...@do... -M daily -s S/../.././08 /dev/twe0 -d 3ware,3 -a -I 194 -I 8 -m sm...@do... -M daily -s S/../.././08 ... .. . So two drives on diff cards are being checked every hour and if something goes wrong, scary sms wakes me in the morning. ----- Original Message ----- From: "Bruce Allen" <ba...@gr...> To: "Anton Losev" <ph...@ma...> Cc: <sma...@li...> Sent: 22 ?????? 2005 ?. 23:44 Subject: Re: [smartmontools-support]turning off log warnings but keeping mails >> Thank you. One more question, after short test fails, i'm getting such >> log >> messages each time another short test on the same HDD passes: >> >> Apr 22 20:21:37 217 smartd[2330]: Num Test_Description Status >> Remaining LifeTime(hours) LBA_of_first_error >> Apr 22 20:21:37 217 smartd[2330]: # 2 Short offline Completed: >> read >> failure 60% 3218 552882021 >> Apr 22 20:21:37 217 smartd[2330]: #10 Short offline Completed: >> read >> failure 60% 3050 552890173 > > I don't know where this message is coming from. Smartd logs a message, > but not of this form. Apparently someone has modified smartd or is using > it to run some external script. > >> Also a mail message from smartd itself arrives, and it would be enough >> for >> me, but server keeps informing me about past errors for 24 cycles of >> tests >> (even when sector was relocated and there's no failures in fresh tests), >> and >> it's quite annoying. How can I keep smartd mails but disable showing >> failures in system logs? Or decrease 24 cycles to 1-2 (i'm checking disks >> daily, so 24 cycles means 24 days of such reports). I've found -U option >> >> -U ID [ATA only] Report if the number of offline uncorrectable >> sectors is non-zero. Here ID is the id number of the Attribute whose raw >> value is the Offline Uncorrectable >> sector count. The default value of ID is 198. To turn off >> this reporting, use ID = 0. The allowed range of ID is 0 to 255 >> inclusive. >> >> An offline uncorrectable sector is a disk sector which was >> not >> readable during an off-line scan or a self-test. This is important to >> know, >> because if you have data >> stored in this disk sector, and you need to read it, the >> read >> will fail. Please see the previous ?-C? option for more details. >> >> but it's not so clear how much warnings would be supressed, at least 1 >> mail >> message is necessary of course. > > I'm confused about the mail messages from smartd. Perhaps you could send > a couple of examples. > > Bruce > > > > ------------------------------------------------------- > SF email is sponsored by - The IT Product Guide > Read honest & candid reviews on hundreds of IT Products from real users. > Discover which products truly live up to the hype. Start reading now. > http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > |
From: Anton L. <ph...@ma...> - 2005-04-22 20:07:38
|
p.s. but I really sure that those log messages are from smartd, not from smartctl, so if anyone has more ideas, please advice. What usuallyhappens when one of your drives fails on a short test, how smartd informs you about a problem? > Hm, you gave me an idea. I have a script executing daily to check > temperature of all disks, like > > echo -n > echo -n sda > smartctl -a /dev/twe0 -d 3ware,0 | grep Tempe | cut --bytes=87- > smartctl -a /dev/twe0 -d 3ware,0 | grep result | cut --bytes=50- > echo -n sdi > smartctl -a /dev/twe1 -d 3ware,0 | grep Tempe | cut --bytes=87- > smartctl -a /dev/twe1 -d 3ware,0 | grep result | cut --bytes=50- > echo -n sdb > smartctl -a /dev/twe0 -d 3ware,1 | grep Tempe | cut --bytes=87- > smartctl -a /dev/twe0 -d 3ware,1 | grep result | cut --bytes=50- > ... > .. > . > > I will disable it now and see if it helps. > But mail message is a standard thing I guess, every time short test > fails, i'm getting it via sms, looks like > ------- > This email was generated by the smartd daemon running on: > > host name: 12.12.12.12 > DNS domain: 12.12.12 > NIS domain: (none) > > The following warning/error was logged by the smartd daemon: > > Device: /dev/twe0 [3ware_disk_03], Self-Test Log error count increased > from 1 to 2 > > For details see host's SYSLOG (default: /var/log/messages). > > You can also use the smartctl utility for further investigation. > The original email about this issue was sent at Fri Apr 15 08:51:37 2005 > MSD > Another email message will be sent in 24 hours if the problem persists. > --------- > > Here's part of my smartd.conf > /dev/twe0 -d 3ware,0 -a -I 194 -I 8 -m sm...@do... -M daily -s > S/../.././07 > /dev/twe0 -d 3ware,1 -a -I 194 -I 8 -m sm...@do... -M daily -s > S/../.././07 > > /dev/twe0 -d 3ware,2 -a -I 194 -I 8 -m sm...@do... -M daily -s > S/../.././08 > /dev/twe0 -d 3ware,3 -a -I 194 -I 8 -m sm...@do... -M daily -s > S/../.././08 > ... > .. > . > > So two drives on diff cards are being checked every hour and if something > goes wrong, scary sms wakes me in the morning. > > > ----- Original Message ----- > From: "Bruce Allen" <ba...@gr...> > To: "Anton Losev" <ph...@ma...> > Cc: <sma...@li...> > Sent: 22 ?????? 2005 ?. 23:44 > Subject: Re: [smartmontools-support]turning off log warnings but keeping > mails > > >>> Thank you. One more question, after short test fails, i'm getting such >>> log >>> messages each time another short test on the same HDD passes: >>> >>> Apr 22 20:21:37 217 smartd[2330]: Num Test_Description Status >>> Remaining LifeTime(hours) LBA_of_first_error >>> Apr 22 20:21:37 217 smartd[2330]: # 2 Short offline Completed: >>> read >>> failure 60% 3218 552882021 >>> Apr 22 20:21:37 217 smartd[2330]: #10 Short offline Completed: >>> read >>> failure 60% 3050 552890173 >> >> I don't know where this message is coming from. Smartd logs a message, >> but not of this form. Apparently someone has modified smartd or is using >> it to run some external script. >> >>> Also a mail message from smartd itself arrives, and it would be enough >>> for >>> me, but server keeps informing me about past errors for 24 cycles of >>> tests >>> (even when sector was relocated and there's no failures in fresh tests), >>> and >>> it's quite annoying. How can I keep smartd mails but disable showing >>> failures in system logs? Or decrease 24 cycles to 1-2 (i'm checking >>> disks >>> daily, so 24 cycles means 24 days of such reports). I've found -U option >>> >>> -U ID [ATA only] Report if the number of offline uncorrectable >>> sectors is non-zero. Here ID is the id number of the Attribute whose >>> raw >>> value is the Offline Uncorrectable >>> sector count. The default value of ID is 198. To turn >>> off >>> this reporting, use ID = 0. The allowed range of ID is 0 to 255 >>> inclusive. >>> >>> An offline uncorrectable sector is a disk sector which was >>> not >>> readable during an off-line scan or a self-test. This is important to >>> know, >>> because if you have data >>> stored in this disk sector, and you need to read it, the >>> read >>> will fail. Please see the previous ?-C? option for more details. >>> >>> but it's not so clear how much warnings would be supressed, at least 1 >>> mail >>> message is necessary of course. >> >> I'm confused about the mail messages from smartd. Perhaps you could send >> a couple of examples. >> >> Bruce >> >> >> >> ------------------------------------------------------- >> SF email is sponsored by - The IT Product Guide >> Read honest & candid reviews on hundreds of IT Products from real users. >> Discover which products truly live up to the hype. Start reading now. >> http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click >> _______________________________________________ >> Smartmontools-support mailing list >> Sma...@li... >> https://lists.sourceforge.net/lists/listinfo/smartmontools-support >> > > > > ------------------------------------------------------- > SF email is sponsored by - The IT Product Guide > Read honest & candid reviews on hundreds of IT Products from real users. > Discover which products truly live up to the hype. Start reading now. > http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > |
From: Michael M. <mi...@np...> - 2005-04-27 06:32:20
|
Hi Anton, Just in regards to your email below: > p.s. > but I really sure that those log messages are from smartd, not from > smartctl, so if anyone has more ideas, please advice. What > usuallyhappens when one of your drives fails on a short test, how > smartd informs you about a problem? > > > Hm, you gave me an idea. I have a script executing daily to check > > temperature of all disks, like > > > > echo -n > > echo -n sda > > smartctl -a /dev/twe0 -d 3ware,0 | grep Tempe | cut --bytes=87- > > smartctl -a /dev/twe0 -d 3ware,0 | grep result | cut --bytes=50- > > echo -n sdi > > smartctl -a /dev/twe1 -d 3ware,0 | grep Tempe | cut --bytes=87- > > smartctl -a /dev/twe1 -d 3ware,0 | grep result | cut --bytes=50- > > echo -n sdb > > smartctl -a /dev/twe0 -d 3ware,1 | grep Tempe | cut --bytes=87- > > smartctl -a /dev/twe0 -d 3ware,1 | grep result | cut --bytes=50- > > ... > > .. > > . > > > > I will disable it now and see if it helps. > > But mail message is a standard thing I guess, every time short test > > fails, i'm getting it via sms, looks like I'm wondering, how do you get that onto your phone via SMS? I'm looking at this at the moment and would like to consider options. Thanks. Michael. |
From: Anton L. <ph...@ma...> - 2005-04-27 09:00:08
|
My mobile operator provides function of mail2sms redirects, so I'm just receiving messages to some technical mailbox at mobile provider's server, and those messages are being automatically forwarded to my phone as sms. Besides original service, we have many of 3rd party redirectors, but non of them are such cheap and straight as the operator's one. >> p.s. >> but I really sure that those log messages are from smartd, not from >> smartctl, so if anyone has more ideas, please advice. What >> usuallyhappens when one of your drives fails on a short test, how >> smartd informs you about a problem? >> >> > Hm, you gave me an idea. I have a script executing daily to check >> > temperature of all disks, like >> > >> > echo -n >> > echo -n sda >> > smartctl -a /dev/twe0 -d 3ware,0 | grep Tempe | cut --bytes=87- >> > smartctl -a /dev/twe0 -d 3ware,0 | grep result | cut --bytes=50- >> > echo -n sdi >> > smartctl -a /dev/twe1 -d 3ware,0 | grep Tempe | cut --bytes=87- >> > smartctl -a /dev/twe1 -d 3ware,0 | grep result | cut --bytes=50- >> > echo -n sdb >> > smartctl -a /dev/twe0 -d 3ware,1 | grep Tempe | cut --bytes=87- >> > smartctl -a /dev/twe0 -d 3ware,1 | grep result | cut --bytes=50- >> > ... >> > .. >> > . >> > >> > I will disable it now and see if it helps. >> > But mail message is a standard thing I guess, every time short test >> > fails, i'm getting it via sms, looks like > > I'm wondering, how do you get that onto your phone via SMS? > > I'm looking at this at the moment and would like to consider options. > > Thanks. > > Michael. > > > > ------------------------------------------------------- > SF.Net email is sponsored by: Tell us your software development plans! > Take this survey and enter to win a one-year sub to SourceForge.net > Plus IDC's 2005 look-ahead and a copy of this survey > Click here to start! http://www.idcswdc.com/cgi-bin/survey?id=105hix > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > |
From: Ken D. <ken...@gm...> - 2005-12-04 16:11:15
|
Hi Smartmontools: I've spent the last few days (newbie) downloading and installing smartmontools on my G4PB OS X (Darwin) and then scouring the Net for anything and everything about how to set it up and then how to interpret the meaning (attributes) of the reports generated. I discovered the fantastic article written by Bruce Allen some time ago in Linux Magazine - next to the description written by SpeedFan it's the best and clearest yet (that I've come across). I have one question. When I run: /usr/local/sbin/smartctl -l error /dev/disk0 I get the following (abstracted) and indications in the report and log that this error occurs again and again (I'm motivated to do this because my performance with Tiger 10.4.2 is treacly slow and it seems to be associated with disk accesses). In particular EVERY error points to (or has the same signature) i.e. 40 51 01 7f f8 04 e0 Error: UNC 1 sectors at LBA = 0x0004f87f = 325759. Please, any ideas about what this means? At the bottom of this note are my overall disk scores from smartmontools. Thanks for a fantastic tool and really cool and clear explanations in your articles and on sourceforge! Cheers (and hope to hear from you), Ken David Error 1102 occurred at disk power-on lifetime: 3368 hours (140 days + 8 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 01 7f f8 04 e0 Error: UNC 1 sectors at LBA = 0x0004f87f = 325759 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 01 7f f8 04 e0 00 00:14:46.300 READ DMA EXT 25 00 01 7e f8 04 e0 00 00:14:46.200 READ DMA EXT 25 00 01 7d f8 04 e0 00 00:14:46.100 READ DMA EXT 25 00 01 7c f8 04 e0 00 00:14:46.100 READ DMA EXT 25 00 01 7b f8 04 e0 00 00:14:46.000 READ DMA EXT OVERALL SCORES === START OF INFORMATION SECTION === Device Model: HTS548080M9AT00 Serial Number: MRL421L4GB1NHB Firmware Version: MG4OA53A User Capacity: 80,026,361,856 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 6 ATA Standard is: ATA/ATAPI-6 T13 1410D revision 3a Local Time is: Sun Dec 4 14:03:13 2005 GMT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VAL WRS THR TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 090 090 062 Pre- fail Always - 2359323 2 Throughput_Performance 0x0005 100 100 040 Pre- fail Offline - 0 3 Spin_Up_Time 0x0007 154 154 033 Pre- fail Always - 2 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 1513 5 Reallocated_Sector_Ct 0x0033 054 054 005 Pre- fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre- fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 040 Pre- fail Offline - 0 9 Power_On_Hours 0x0012 093 093 000 Old_age Always - 3407 10 Spin_Retry_Count 0x0013 100 100 060 Pre- fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 1233 191 G-Sense_Error_Rate 0x000a 100 100 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 165 193 Load_Cycle_Count 0x0012 090 090 000 Old_age Always - 108543 194 Temperature_Celsius 0x0002 152 152 000 Old_age Always - 36 (Lifetime Min/Max 18/55) 196 Reallocated_Event_Count 0x0032 067 067 000 Old_age Always - 1498 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 2 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 |
From: Mark P. <ema...@ho...> - 2008-05-27 03:07:30
|
From: tail -n25 /var/log/syslog May 17 09:57:33 Lexington-19 smartd[6901]: Try adding '-d sat' to the device line in the smartd.conf file. May 17 09:57:33 Lexington-19 smartd[6901]: For example: '/dev/sda -a -d sat' my smartd.conf file shows a number of lines and I can't properly distinguish which need having the # removed. The drive is Seagate/Maxtor Drive Model STM3320620A 320 gig PATA. I have only one hard drive in the computer. _________________________________________________________________ E-mail for the greater good. Join the i’m Initiative from Microsoft. http://im.live.com/Messenger/IM/Join/Default.aspx?source=EML_WL_ GreaterGood |
From: Jeremy F. <ja...@lc...> - 2006-01-05 17:43:40
|
Hi all, I'm confused about "captive mode" in smartctl. i.e. How does the command smartctl -C -t long /dev/sda differ from the command smartctl -t long /dev/sda =3F The docs say "don't run smartctl in captive mode while partitions are mounted on the disk", but they don't say why that is a bad idea or what the consequences of doing so would be. Would doing that corrupt the disk=3F Skew the results=3F Cause problems for other programs=3F etc. Thanks, Jeremy |
From: <sou...@tv...> - 2006-01-06 14:26:00
|
On Thu, 5 Jan 2006, Jeremy Friesner wrote: > Hi all, >=20 > I'm confused about "captive mode" in smartctl. i.e. How does the comm= and >=20 > smartctl -C -t long /dev/sda >=20 > differ from the command >=20 > smartctl -t long /dev/sda >=20 > ? The docs say "don't run smartctl in captive mode while partitions ar= e=20 >mounted on the disk", but they don't say why that is a bad idea or what=20 >the consequences of doing so would be. Would doing that corrupt the=20 >disk? Skew the results? Cause problems for other programs? etc. It keeps the disk occupied and unresponsive to other commands until the=20 test is finished. You cannot access anything on it. So for a typical=20 long test, it will be totally unresponsive for, say, two hours. --=20 Erik I. Bols=F8 |
From: Diego M. <die...@bo...> - 2007-02-21 11:18:47
|
Hello I hope this is the correct place to ask this.. I have this SPLINDLE IMPENDING... lots of time in the System log, I dont know what it means, I have tried to find with google something, which didnt help. I found this thread: http://sourceforge.net/mailarchive/message.php?msg_id=4290755 My knowledge is to limited about this, so I dont know if there has been posted the solution, allthough I read it through... Afaik there is no solution posted.. Well what I want to know, if there is some action needed, or if the drive is wrongly configured, my best guess is that it is incorrect configured in SCSI utility, or maybe even a wrong jumper.. The server runs since some weeks, so that error seems not critical.. ----- # /usr/sbin/smartctl -H /dev/sda smartctl version 5.36 [i386-redhat-linux-gnu] Copyright (C) 2002-6 Bruce Allen Home page is http://smartmontools.sourceforge.net/ SMART Health Status: SPINDLE IMPENDING FAILURE START UNIT TIMES TOO HIGH [asc=5d, ascq=56] ----- # /usr/sbin/smartctl -a /dev/sda smartctl version 5.36 [i386-redhat-linux-gnu] Copyright (C) 2002-6 Bruce Allen Home page is http://smartmontools.sourceforge.net/ Device: IBM DDYS-T09170N Version: S80D Serial number: VDF6X526 Device type: disk Transport protocol: Fibre channel (FCP-2) Local Time is: Wed Feb 21 12:01:15 2007 CET Device supports SMART and is Enabled Temperature Warning Disabled or Not Supported SMART Health Status: SPINDLE IMPENDING FAILURE START UNIT TIMES TOO HIGH [asc=5d, ascq=56] Current Drive Temperature: 48 C Drive Trip Temperature: 85 C Manufactured in week 28 of year 2000 Current start stop count: 776 times Recommended maximum start stop count: 10000 times Elements in grown defect list: 0 Error counter log: Errors Corrected by Total Correction Gigabytes Total ECC rereads/ errors algorithm processed uncorrected fast | delayed rewrites corrected invocations [10^9 bytes] errors read: 0 0 0 0 0 145.358 0 write: 0 0 0 0 0 241.556 0 verify: 0 0 0 0 0 12.397 0 Non-medium error count: 0 No self-tests have been logged Long (extended) Self Test duration: 350 seconds [5.8 minutes] ----- Thanks allot Diego Mathis -- No virus found in this outgoing message. Checked by AVG Free Edition. Version: 7.5.441 / Virus Database: 268.18.3/694 - Release Date: 20.02.2007 13:44 |
From: fred f. <fl...@ya...> - 2007-03-21 08:04:00
|
Ce message et toutes les pièces jointes sont confidentiels et établis à l'attention exclusive de ses destinataires. Toute utilisation de ce message non conforme à sa destination, toute diffusion ou toute publication, totale ou partielle,est interdite, sauf autorisation expresse. Si vous recevez ce message par erreur, merci de le détruire et d'en avertirimmédiatement l'expéditeur. Je décline toute responsabilité au titre de ce message s'il a été altéré,déformé ou falsifié. This message and all attachments are confidential and intended solely for the addressees. Any use not in accord with its purpose, any dissemination or disclosure, either whole or partial, is prohibited except formal approval. If you receive this message in error, please delete it and immediately notify the sender. ___________________________________________________________________________ Découvrez une nouvelle façon d'obtenir des réponses à toutes vos questions ! Profitez des connaissances, des opinions et des expériences des internautes sur Yahoo! Questions/Réponses http://fr.answers.yahoo.com |
From: <ro...@vo...> - 2007-12-29 23:00:52
|
Hallo, I have smartmontools installed on laptop, but sheduling options in smartd.conf are not very "laptop friendly". In discussion, however, I found post from 16.04.2004 that doing sleduling in a more clever way (like test after 7 days of run ...) is not possible due to faulty firmware and/or the danger of self-test high frequency if laptops are in sleep state most of the time. I want to ask, if the reasons are still topical, or if there is a plan to enhance sheduling support for laptops. Was any solution proposed? There are more and more laptops all around :-) Best regards, Dan |