Thread: [fuse-devel] bypassing read/write for mirror fs

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Hello,

I would like to avoid having the read() and write() calls go through
my user fs, but rather go directly to the underlying filesystem that
I'm mirroring. Essentially I want to let fuse know of the "real" file
descriptor somehow in the open() call, and when the fuse kernel side
gets a read() or write(), it will go back to the VFS layer with that
file descriptor instead of going out to my process (which just does a
pread() or pwrite() anyway). I would still like to get any other
calls, such as unlink(), chown(), access(), etc. This has been
discussed a while back:
http://thread.gmane.org/gmane.comp.file-systems.fuse.devel/5946/focus=5947
- has it been tried at all and is not actually workable? Or is there
just not much interest?

Some other similar threads I've found suggested a few other
performance improvements for write(), such as using -obig_writes,
using write_buf() instead of write(), and using direct_io. I tried
these approaches all in the same test scenario with my program and
came up with the following results:

4.200s: Default fuse fs (4k writes) with 582176 write() calls
4.252s: write_buf with 582176 write() calls
2.063s: direct_io (seems this enables 32k writes automatically?) with
72776 write() calls
2.510s: -obig_writes (32k writes) 72776 write() calls
0.419s: baseline without fuse at all (the closer to this, the better :)

(Note the absolute values are meaningless - this is just to compare
one against another)

So write_buf doesn't seem to help my program at all, while increasing
the buffer size helps quite a bit due to the fewer write calls.

I then ran callgrind on the direct_io version to see where the time is
actually going. The total Ir for this test is 258M, with 142M (55%)
going to fuse_lib_write_buf() and 49M (19%) going to fuse_lib_read().
Within fuse_lib_write_buf(), the major parts are:

67M: get_path_null()
17M: fuse_fs_write_buf()
22M: free_path()
27M: fuse_reply_write()

It seems a lot of effort is going into getting/freeing the path for
each 32k chunk of data written. In my case I don't care about the path
at all for read & write, so this is unnecessary overhead for me. As a
quick test I tried to comment out the calls to get_path_nullok() and
free_path() in both fuse_lib_read() and fuse_lib_write_buf(), but this
only shaved off ~100ms or so.

Do you think it is feasible to be able to provide fuse with a "real"
fd during open() so that read/write in the userspace side of things
can be skipped entirely? I'd be happy to try to work up a patch,
though any guidance would be appreciated :)

Thanks!
-Mike