|
From: Corey L. <cle...@gm...> - 2010-12-11 21:23:41
|
I'm having a problem with Bacula and my Autoloader/tape drive.
OS = Fedora 10
Bacula version = 2.4.2
Tape Drive = Quantum Superloader 3 - LTO3
Here is the problem up until a week ago everything was working fine. When I
was trying to label a new tape for Bacula to use the tape broke off into the
drive.
I called Quantum tech support and they sent me a replacement unit. The only
thing that has changed was the autoloader and drive. Now I am getting write
errors on all of my backup tapes old or new. The drive also requires
cleaning after every write error .Here is an example of the output.
10-Dec 21:00 tsunami-dir JobId 41915: Start Backup JobId 41915,
Job=Aquarius-daily.2010-12-10_21.00.28
10-Dec 21:00 tsunami-dir JobId 41915: Using Device "LTO-3"
10-Dec 21:00 neptune-sd JobId 41915: Volume "LTDAILY0011" previously
written, moving to end of data.
10-Dec 21:02 neptune-sd JobId 41915: Ready to append to end of Volume
"LTDAILY0011" at file=16.
10-Dec 21:09 neptune-sd JobId 41915: Error: block.c:568 Write error at
16:62091 on device "LTO-3" (/dev/nst0). ERR=Input/output error.
10-Dec 21:09 neptune-sd JobId 41915: Error: Error writing final EOF to tape.
This Volume may not be readable.
dev.c:1681 ioctl MTWEOF error on "LTO-3" (/dev/nst0). ERR=Input/output
error.
10-Dec 21:09 neptune-sd JobId 41915: End of medium on Volume "LTDAILY0011"
Bytes=53,956,482,048 Blocks=836,378 at 10-Dec-2010 21:09.
10-Dec 21:09 neptune-sd JobId 41915: 3307 Issuing autochanger "unload slot
2, drive 0" command.
10-Dec 21:11 tsunami-dir JobId 41915: Recycled volume "LTDAILY0020"
10-Dec 21:11 neptune-sd JobId 41915: 3301 Issuing autochanger "loaded? drive
0" command.
10-Dec 21:11 neptune-sd JobId 41915: 3302 Autochanger "loaded? drive 0",
result: nothing loaded.
10-Dec 21:11 neptune-sd JobId 41915: 3304 Issuing autochanger "load slot 1,
drive 0" command.
10-Dec 21:12 neptune-sd JobId 41915: 3305 Autochanger "load slot 1, drive
0", status is OK.
10-Dec 21:13 neptune-sd JobId 41915: Recycled volume "LTDAILY0020" on device
"LTO-3" (/dev/nst0), all previous data lost.
10-Dec 21:13 neptune-sd JobId 41915: New volume "LTDAILY0020" mounted on
device "LTO-3" (/dev/nst0) at 10-Dec-2010 21:13.
I called Quantum support to test and see if maybe they sent me a faulty
autoloader. They had me install their diagnostic software called xTalk. I
loaded a new tape and they had me run a few tests. I ran the health check
and a Full Read/Write test. They both passed. Their were no errors in the
xTalk log that pointed to hardware problems. So they would not replace the
unit. I tried backing up to that same tape that night and it also gave the
same write error message. So I ran btape for some more testing. The test job
ran fine with no errors. I ran the fill command and it failed. Here is the
output.
Wrote Volume label for volume "TestVolume1".
Wrote Start of Session label.
09:22:40 Begin writing Bacula records to first tape ...
Wrote blk_block=5000, dev_blk_num=4999 VolBytes=322,495,488 rate=40311.9
KB/s
Wrote blk_block=10000, dev_blk_num=9999 VolBytes=645,055,488 rate=49619.7
KB/s
Wrote blk_block=15000, dev_blk_num=14999 VolBytes=967,615,488 rate=53756.4
KB/s
Wrote blk_block=20000, dev_blk_num=19999 VolBytes=1,290,175,488 rate=56094.6
KB/s
Wrote blk_block=25000, dev_blk_num=24999 VolBytes=1,612,735,488 rate=59730.9
KB/s
Wrote blk_block=30000, dev_blk_num=29999 VolBytes=1,935,295,488 rate=60478.0
KB/s
09:23:14 Flush block, write EOF
-----------------------------------------------------------------------------------------------------------
Truncated
-----------------------------------------------------------------------------------------------------------
Wrote blk_block=1250000, dev_blk_num=2000 VolBytes=80,639,935,488
rate=65774.8 KB/s
Wrote blk_block=1255000, dev_blk_num=7000 VolBytes=80,962,495,488
rate=65823.2 KB/s
Wrote blk_block=1260000, dev_blk_num=12000 VolBytes=81,285,055,488
rate=65817.9 KB/s
09-Dec 09:44 btape JobId 0: Error: block.c:568 Write error at 40:14437 on
device "LTO-3" (/dev/nst0). ERR=Input/output error.
09-Dec 09:44 btape JobId 0: Error: Error writing final EOF to tape. This
Volume may not be readable.
dev.c:1681 ioctl MTWEOF error on "LTO-3" (/dev/nst0). ERR=Input/output
error.
btape: btape.c:2349 Last block at: 40:14436 this_dev_block_num=14437
btape: btape.c:2383 End of tape 40:14437. VolumeCapacity=81,442,271,232.
Write rate = 62312.4 KB/s
09-Dec 09:44 btape JobId 0: End of medium on Volume "TestVolume1"
Bytes=81,442,271,232 Blocks=1,262,436 at 09-Dec-2010 09:44.
09-Dec 09:44 btape JobId 0: 3307 Issuing autochanger "unload slot 1, drive
0" command.
09-Dec 09:46 btape JobId 0: 3301 Issuing autochanger "loaded? drive 0"
command.
09-Dec 09:46 btape JobId 0: 3302 Autochanger "loaded? drive 0", result:
nothing loaded.
09-Dec 09:46 btape JobId 0: 3304 Issuing autochanger "load slot 2, drive 0"
command.
09-Dec 09:47 btape JobId 0: 3305 Autochanger "load slot 2, drive 0", status
is OK.
09-Dec 09:47 btape: Fatal Error at dev.c:1663 because:
dev.c:1662 Attempt to WEOF on non-appendable Volume
Wrote Volume label for volume "TestVolume2".
09-Dec 09:48 btape JobId 0: Wrote label to prelabeled Volume "TestVolume2"
on device "LTO-3" (/dev/nst0)
09-Dec 09:48 btape JobId 0: New volume "TestVolume2" mounted on device
"LTO-3" (/dev/nst0) at 09-Dec-2010 09:48.
Done writing 0 records ...
Wrote End of Session label.
Wrote state file last_block_num1=14436 last_block_num2=11
09:48:05 Done filling tapes at 0:13. Now beginning re-read of first tape ...
09-Dec 09:48 btape JobId 0: 3307 Issuing autochanger "unload slot 2, drive
0" command.
09-Dec 09:49 btape JobId 0: 3304 Issuing autochanger "load slot 1, drive 0"
command.
09-Dec 09:50 btape JobId 0: 3305 Autochanger "load slot 1, drive 0", status
is OK.
09-Dec 09:50 btape JobId 0: Ready to read from volume "TestVolume1" on
device "LTO-3" (/dev/nst0).
Rewinding.
Reading the first 10000 records from 0:0.
10000 records read now at 1:5084
Reposition from 1:5084 to 40:14436
Reposition error. ERR=dev.c:1483 ioctl MTFSR 14436 error on "LTO-3"
(/dev/nst0). ERR=Input/output error.
I also tried running the rawfill command, it failed as well.
*rawfill
btape: btape.c:2500 Begin writing raw blocks of 64512 bytes.
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Write failed at block 1892705. stat=-1 ERR=Input/output error
btape: btape.c:501 Bad status from weof. ERR=dev.c:1681 ioctl MTWEOF error
on "LTO-3" (/dev/nst0). ERR=Input/output error.
Here is a copy of my bacula-sd.conf file.
#
# Default Bacula Storage Daemon Configuration file
#
# For Bacula release 2.4.2 (26 July 2008) -- redhat
#
# You may need to change the name of your tape drive
# on the "Archive Device" directive in the Device
# resource. If you change the Name and/or the
# "Media Type" in the Device resource, please ensure
# that dird.conf has corresponding changes.
#
Storage { # definition of myself
Name = neptune-sd
SDPort = 9103 # Director's port
WorkingDirectory = "/var/lib/bacula"
Pid Directory = "/var/run"
Maximum Concurrent Jobs = 20
}
#
# List Directors who are permitted to contact Storage daemon
#
Director {
Name = tsunami-dir
Password = "A0lD0q0pqE78lNjqlarC0QQFnEntiUr49n4kUzqAS2Kx"
}
#
# Restricted Director, used by tray-monitor to get the
# status of the storage daemon
#
Director {
Name = tsunami-mon
Password = "A0lD0q0pqE78lNjqlarC0QQFnEntiUr49n4kUzqAS2Kx"
Monitor = yes
}
#
# Devices supported by this Storage daemon
# To connect, the Director's bacula-dir.conf must have the
# same Name and MediaType.
#
Device {
Name = FileStorage
Media Type = File
Archive Device = /tmp/bacula
LabelMedia = yes; # lets Bacula label unlabeled media
Random Access = Yes;
AutomaticMount = yes; # when device opened, read it
RemovableMedia = no;
AlwaysOpen = no;
}
Autochanger {
Name = SuperLoader3
Device = LTO-3
Changer Command = "/usr/lib/bacula/mtx-changer %c %o %S %a %d"
Changer Device = /dev/sg5
}
Device {
Name = LTO-3 #
Media Type = LTO3
Drive Index = 0
Device Type = Tape
Archive Device = /dev/nst0
AutomaticMount = yes; # when device opened, read it
AlwaysOpen = yes;
RemovableMedia = yes;
RandomAccess = no;
AutoChanger = yes
# Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'"
# If you have smartctl, enable this, it has more info than tapeinfo
# Alert Command = "sh -c 'smartctl -H -l error %c'"
}
#
# Send all messages to the Director,
# mount messages also are sent to the email address
#
Messages {
Name = Standard
director = tsunami-dir = all
}
Please let me know if you have any ideas. Sorry for the large post.
Corey
|