Thread: [fuse-devel] fuse - mmap/writepages/other queries

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Miklos,
 first of all thank you SO MUCH for keeping the fuse project in such a
good state! I admire the way it is being managed.

 I am one of the authors of GlusterFS (www.gluster.org), a distributed
cluster filesystem which uses FUSE for its client.

 I have a few questions

1. shared writable mmap. - I saw your patches on the linux-kernel on
feb-28th for this. but i dont see the fixes gone into the mainstream
kernel yet. (is it present in the -mm?) neither do i see it in the
fuse cvs too (obviously since it depends on complementary changes in
the mm/?). when can one expect mmap() to work smoothly over files
opened with O_RDWR ?

2. fuse_writepages - this i presume will help improve write
performance for filesystems. does this mean that if an application
does write(fd,buf,131072) then filesystem gets entire 128kb, or, even
if application does multipel write(fd,buf,4k) then it would get
aggregated to some extent and filesystem gets an aggregated chunk?
also same as previous, when will fuse_writepages be available, atleast
in cvs? is this available in some other private repository of yours?
I'm very anxious for this!

3. is there any way to get inode's i_generation access to the
filesystem? (instead of fuse sending only fuse_ino_t). in glusterfs,
the glusterfs server uses underlying filesystem for storing files and
folders. glusterfs server can detect inode number recycling (reiserfs
does very aggressive inode number recycling) but the glusterfs client
(based on fuse) cannot express whether the inode number sent by the
fuse kernel is the latest or older generation. i also understand that
the generation sent by filesystem to fuse kernel is currently not
being used, correct me if i'm wrong.

4. the readahead and channel is peaked at 128kb, by changing few thing
in the kernel module (fuse_conn->bdi.ra_pages) i was able to increase
this to bigger readahead value (ofcourse by increasing channel size
too). why is the 128kb hard limit? what is the side-effect of having
ra_pages beyond VM_MAX_READAHEAD pages? can this lead to any issues?

5. how can a fuse based filesystem detect if the application did an
fcntl(fd,F_SETFL, O_DIRECT) on an fd which was not opened with
O_DIRECT during the read() and write() operations? it would be
convinient if the fi.flags be set with direct_io for all file based
operations.

6. (unrelated to fuse) is the VM's readahead careful enough not to
readahead into locked regions (hence causing an wrong 'block' in case
mandatory locks was enabled). taking this to the next level, is there
a trick for a fuse based filesystem detect what part of the read()
request is belonging to a valid application's request and how much is
from the VM's readahead logic (to ensure it doesnt
accidentally+wrongly get blocked into a mandatory locked region held
by another client machine?.) I understand disabling kernel's readahead
is one of the ways around, but kernel's readahead is tremondously
increasing performance by drastically reducing context switches.

thanks!
avati

-- 
Anand V. Avati