Thread: [Jfs-discussion] first experiences with JFS

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Having had some time and feeling patient I have recently gone
thru a few days of Linux IO subsystem and filesystem analysis
and testing, as summarized (with lots of numbers and references)
here:

  http://WWW.sabi.co.UK/Notes/swhwAnno05.html#050906
  http://WWW.sabi.co.UK/Notes/swhwAnno05.html#050907
  http://WWW.sabi.co.UK/Notes/swhwAnno05.html#050908
  http://WWW.sabi.co.UK/Notes/swhwAnno05.html#050909
  http://WWW.sabi.co.UK/Notes/swhwAnno05.html#050910
  http://WWW.sabi.co.UK/Notes/swhwAnno05.html#050911
  http://WWW.sabi.co.UK/Notes/swhwAnno05.html#050912
  http://WWW.sabi.co.UK/Notes/swhwAnno05.html#050913
  http://WWW.sabi.co.UK/Notes/swhwAnno05.html#050914

and as a result of such tests I have decided to switch my
filesystems to JFS to see how much performance degrades with
time and files get deleted/added/rewritten.

The good news is that JFS seemed to me one of the two (with
'ext3' with 1KiB blocks) most desirable choices, and it has
turned out to have some unexpected boons too.

The bad news is that I have already suffered from several
crashes and one bizarre performance problem... My setup
consists of an Athlon Xp 2000+, 512MB, 2x80GB and 2x160GB hard
discs, running a mainline 2.6.13 kernel, with 1.1.18 'jfsprogs'.

The incidents so far:

* Some of my tests were tree traversals, that generate a flood of
  inode updates because, which hit the journal hard. So I wondered
  what would the timings be with '-o noatime', unfortunately I
  got a crash because of that.

* When converting from 'ext3' to JFS file systems, I did this by
  copying things around, and I got a couple of lockups. It may
  be that these were related to high buffer cache traffic (I was
  doing a large 'dd' between partitions at one time) and races
  thereof.

* When restoring a '.tar.bz2' held on a 'vfat' file system to a
  newly formatted 'jfs' one I got a dtree corruption, with no
  device errors. I 'fsck'ed it to fix that and redid the restore
  and it did not happen again. There was again a 'dd' between
  two partitions running at the same time.

* Making a file system with a 30MiB log instead of the default
  32MiB makes reading it with 'tar' over twice as slow. This for
  the same partition on the same hard disc with the same content
  freshly loaded (it was so strange I checked several times).

All which leads to think that not many people have used non
default log sizes, or used JFS with FAT32 or massive 'dd'ing, or
with 'noatime'... :-)

Some more context and some data... I was in multiuser but not
GUI mode when the incidents above happened, with only a few
d=E6mons running.

The output of 'jfs=5Ffsck' after the =ABDT=5FGETPAGE: dtree page
corrupt=BB errors:

----------------------------------------------------------------
jfs=5Ffsck version 1.1.8, 03-May-2005
processing started: 9/14/2005 18.52.55
Using default parameter: -p
The current device is:  /dev/hdb11
Block size in bytes:  4096
Filesystem size in blocks:  1028152
**Phase 0 - Replay Journal Log
**Phase 1 - Check Blocks, Files/Directories, and  Directory Entries
**Phase 2 - Count links
Incorrect link counts have been detected. Will correct.
**Phase 3 - Duplicate Block Rescan and Directory Connectedness
**Phase 4 - Report Problems
File system object DF20499 is linked as: /var
cannot repair the data format error(s) in this directory.
cannot repair DF20499.  Will release.
File system object DF20512 is linked as: /dev/ida
cannot repair DF20512.  Will release.
**Phase 5 - Check Connectivity
**Phase 6 - Perform Approved Corrections
768 files reconnected to /lost+found/.
**Phase 7 - Rebuild File/Directory Allocation Maps
**Phase 8 - Rebuild Disk Allocation Maps
  4112608 kilobytes total disk space.
    35465 kilobytes in 13747 directories.
  2756001 kilobytes in 135907 user files.
        0 kilobytes in extended attributes
   100356 kilobytes reserved for system use.
  1291716 kilobytes are available for use.
Filesystem is clean.
----------------------------------------------------------------

The one ''oops'' that got logged (it happened twice):

----------------------------------------------------------------
Unable to handle kernel paging request at virtual address cc05b9a4
 printing eip:
c0251f5d
*pde =3D 00030067
*pte =3D 0c05b000
Oops: 0000 [#1]
DEBUG=5FPAGEALLOC
Modules linked in: binfmt=5Fmisc snd=5Fcmipci snd=5Fopl3=5Flib snd=5Fhw=
dep snd=5Fseq=5Foss snd=5Fseq=5Fmidi snd=5Fseq=5Fmidi=5Fevent snd=5Fseq=
 snd=5Fvia82xx gameport snd=5Fac97=5Fcodec snd=5Fpcm=5Foss snd=5Fmixer=5F=
oss snd=5Fpcm snd=5Ftimer snd=5Fpage=5Falloc snd=5Fmpu401=5Fuart snd=5F=
rawmidi snd=5Fseq=5Fdevice snd soundcore 3c59x mii parport=5Fpc lp parp=
ort video thermal processor fan container button battery ac it87 eeprom=
 i2c=5Fsensor i2c=5Fisa i2c=5Fdev i2c=5Fcore ntfs nls=5Fiso8859=5F1 nls=
=5Fcp437 sg sr=5Fmod ide=5Fscsi scsi=5Fmod 8250 serial=5Fcore nvram rtc=

CPU:    0
EIP:    0060:[txUpdateMap+333/656]    Not tainted VLI
EFLAGS: 00010246   (2.6.13p)=20
EIP is at txUpdateMap+0x14d/0x290
eax: cc05b97c   ebx: e0996990   ecx: e08366c8   edx: 00000900
esi: 00000001   edi: e0996980   ebp: dfdc7f48   esp: dfdc7f10
ds: 007b   es: 007b   ss: 0068
Process jfsCommit (pid: 139, threadinfo=3Ddfdc7000 task=3Dc15725d0)
Stack: e084be30 0000060c dfdc7f48 c024f181 00000000 00000040 d94596fc d=
befc2fc=20
       00000202 00000000 00000000 dc64d160 e08366c8 e08366c8 dfdc7f74 c=
02529b2=20
       e08366c8 00000286 e0861514 dfdc7fe4 00000000 0000007b 0000007b d=
c64d160=20
Call Trace:
 [show=5Fstack+127/160] show=5Fstack+0x7f/0xa0
 [show=5Fregisters+343/448] show=5Fregisters+0x157/0x1c0
 [die+332/688] die+0x14c/0x2b0
 [do=5Fpage=5Ffault+921/1791] do=5Fpage=5Ffault+0x399/0x6ff
 [error=5Fcode+79/84] error=5Fcode+0x4f/0x54
 [txLazyCommit+34/688] txLazyCommit+0x22/0x2b0
 [jfs=5Flazycommit+844/1200] jfs=5Flazycommit+0x34c/0x4b0
 [kernel=5Fthread=5Fhelper+5/16] kernel=5Fthread=5Fhelper+0x5/0x10
Code: f6 47 04 02 0f 85 4f 01 00 00 8d 5f 10 0f b6 43 03 85 c0 74 4d 89=
 c6 8d b4 26 00 00 00 00 f6 43 04 f0 0f 85 16 01 00 00 8b 47 0c <0f> b7=
 40 28 25 00 f0 00 00 3d 00 40 00 00 0f 84 ef 00 00 00 8b=20
----------------------------------------------------------------
Unable to handle kernel paging request at virtual address c2cc8804
 printing eip:
c0251f5d
*pde =3D 0000b067
*pte =3D 02cc8000
Oops: 0000 [#1]
DEBUG=5FPAGEALLOC
Modules linked in: videodev loop binfmt=5Fmisc snd=5Fcmipci snd=5Fopl3=5F=
lib snd=5Fhwdep snd=5Fseq=5Foss snd=5Fseq=5Fmidi snd=5Fseq=5Fmidi=5Feve=
nt snd=5Fseq snd=5Fvia82xx gameport snd=5Fac97=5Fcodec snd=5Fpcm=5Foss =
snd=5Fmixer=5Foss snd=5Fpcm snd=5Ftimer snd=5Fpage=5Falloc snd=5Fmpu401=
=5Fuart snd=5Frawmidi snd=5Fseq=5Fdevice snd soundcore 3c59x mii parpor=
t=5Fpc lp parport video thermal processor fan container button battery =
ac it87 eeprom i2c=5Fsensor i2c=5Fisa i2c=5Fdev i2c=5Fcore ntfs nls=5Fi=
so8859=5F1 nls=5Fcp437 sg sr=5Fmod ide=5Fscsi scsi=5Fmod 8250 serial=5F=
core nvram rtc
CPU:    0
EIP:    0060:[txUpdateMap+333/656]    Not tainted VLI
EFLAGS: 00010246   (2.6.13p)=20
EIP is at txUpdateMap+0x14d/0x290
eax: c2cc87dc   ebx: e08b6b10   ecx: e0828700   edx: 00000900
esi: 00000001   edi: e08b6b00   ebp: dfdc7f48   esp: dfdc7f10
ds: 007b   es: 007b   ss: 0068
Process jfsCommit (pid: 139, threadinfo=3Ddfdc7000 task=3Dc15725d0)
Stack: e0879094 00000b85 dfdc7f48 c024f181 00000000 00000040 db26571c d=
9c48cfc=20
       00000206 00000000 00000000 d3ea84c0 e0828700 e0828700 dfdc7f74 c=
02529b2=20
       e0828700 00000000 b2f1fb80 00989dee c04d0c60 c157270c dfdc7000 d=
3ea84c0=20
Call Trace:
 [show=5Fstack+127/160] show=5Fstack+0x7f/0xa0
 [show=5Fregisters+343/448] show=5Fregisters+0x157/0x1c0
 [die+332/688] die+0x14c/0x2b0
 [do=5Fpage=5Ffault+921/1791] do=5Fpage=5Ffault+0x399/0x6ff
 [error=5Fcode+79/84] error=5Fcode+0x4f/0x54
 [txLazyCommit+34/688] txLazyCommit+0x22/0x2b0
 [jfs=5Flazycommit+844/1200] jfs=5Flazycommit+0x34c/0x4b0
 [kernel=5Fthread=5Fhelper+5/16] kernel=5Fthread=5Fhelper+0x5/0x10
Code: f6 47 04 02 0f 85 4f 01 00 00 8d 5f 10 0f b6 43 03 85 c0 74 4d 89=
 c6 8d b4 26 00 00 00 00 f6 43 04 f0 0f 85 16 01 00 00 8b 47 0c <0f> b7=
 40 28 25 00 f0 00 00 3d 00 40 00 00 0f 84 ef 00 00 00 8b=20
----------------------------------------------------------------

Sorry for the relative lack of details, I hope that there is
enough to start an investigation.

Thread: [Jfs-discussion] first experiences with JFS

jfs-discussion