moosefs-users Mailing List for MooseFS (Page 115)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

Yes, it is rather disappointing how Linux slows down on large folders.Not only that but some file systems like ext2/3/4 limits the number of files in a folder to a low number (16k/16k/64k).

 Generally we store files in a hash based structure where the first n characters of the files hash is the first folder name. The next m characters of the files hash is the second folder name and then the filename is the hash of the file. This prevents us from having a too large number of files in a folder, avoids the number of files per folder limits and slowdowns and prevents a too deep directory structure. It also gives us some deduplication.

Yes, we use Centos, but installing and using the ktune package generally resolves most of the performance issues and differences I have seen with Ubuntu/Debian. It also helps to disable the cpuspeed daemon if you know the server does not have much down time.

I don't understand the comment on hitting metadata a lot? What is a lot? Why would it make a difference? All the metadata is in RAM anyway? The biggest limit to speed seems to be the number of IOPS that you can get out of your disks you have available to you. Looking up the metadata from RAM should be several orders of magnitude faster than that.

The activity reported through the CGI interface on the master is around 2,400 opens per minute average. Reads and writes are also around 2400 per minute alternating with each other. mknod has some peaks around 2,800 per minute but is generally much lower. Lookup's are around 8,000 per minute and getattr is around 700 per minute. Chunk replication and deletion is around 50 per minute. The other numbers are generally very low.

Is there a guide/hints specific to MooseFS on what IO/Net/Process parameters would be good to investigate for mfsmaster? 

Robert

-----Original Message-----
From: Robert Dye <ro...@in...>
To: 'Robert' <rsa...@ne...>; moosefs-users <moo...@li...>
Sent: Wed, Jun 29, 2011 3:00 pm
Subject: RE: [Moosefs-users] Write starvation

As Ricardo mentioned below, you could behitting the metadata, a lot.  When I ran benchmarks for enumerations in adirectory with more than 10,000 files, it was an obvious slow down incomparison to a directory with only a few files.

Not sure which OS you are running, but Iwould tweak the many IO/Net/Process parameters.  From personal experience, Ihad a huge slow down with mfsmasterserver when running on CentOS.  I have sincemoved to Ubuntu, which appears to be faster out of the box.

-Rob

From: Robert[mailto:rsa...@ne...] 
Sent: Wednesday, June 29, 201111:29 AM
To: ric...@da...;moo...@li...
Subject: Re: [Moosefs-users] Writestarvation

Yes,the master server is working hard. But I would still expect a somewhat fairdistribution of load between read and write.

The specs:

2 x quad core Xeon E5405 @ 2GHz.
64 GB of RAM
32 x 2 TB 7200 RPM SATA disks
68 million file system objects
65.4 million files

No swap is being used
mfsmaster is using 23 GB of RAM.

Robert

-----OriginalMessage-----
From: Ricardo J. Barberis <ric...@da...>
To: moosefs-users <moo...@li...>
Sent: Wed, Jun 29, 2011 12:08 pm
Subject: Re: [Moosefs-users] Write starvation

El Martes 28 Junio 2011, Robert Sandilands escribió:
> Write traffic does not stop completely, but seems to slow down to < 10
> kB per second under high read traffic conditions. When the read traffic
> decreases the write traffic will increase to normal levels.
> 
> Is this a known problem? Is there something I can do to ensure that
> write traffic is not starved by read traffic?
> 
> Robert

What are the master machine specs, and how many files do you have already on 
MFS?

You might be hiting the master too hard with metadata.

Regards,
-- 
Ricardo J. Barberis
Senior SysAdmin / ITI
Dattatec.com :: Soluciones de Web Hosting
Tu Hosting hecho Simple!

------------------------------------------------------------------------------
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
_______________________________________________
moosefs-users mailing list
moo...@li...
https://lists.sourceforge.net/lists/listinfo/moosefs-users

2009	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec (4)
2010	Jan (20)	Feb (11)	Mar (11)	Apr (9)	May (22)	Jun (85)	Jul (94)	Aug (80)	Sep (72)	Oct (64)	Nov (69)	Dec (89)
2011	Jan (72)	Feb (109)	Mar (116)	Apr (117)	May (117)	Jun (102)	Jul (91)	Aug (72)	Sep (51)	Oct (41)	Nov (55)	Dec (74)
2012	Jan (45)	Feb (77)	Mar (99)	Apr (113)	May (132)	Jun (75)	Jul (70)	Aug (58)	Sep (58)	Oct (37)	Nov (51)	Dec (15)
2013	Jan (28)	Feb (16)	Mar (25)	Apr (38)	May (23)	Jun (39)	Jul (42)	Aug (19)	Sep (41)	Oct (31)	Nov (18)	Dec (18)
2014	Jan (17)	Feb (19)	Mar (39)	Apr (16)	May (10)	Jun (13)	Jul (17)	Aug (13)	Sep (8)	Oct (53)	Nov (23)	Dec (7)
2015	Jan (35)	Feb (13)	Mar (14)	Apr (56)	May (8)	Jun (18)	Jul (26)	Aug (33)	Sep (40)	Oct (37)	Nov (24)	Dec (20)
2016	Jan (38)	Feb (20)	Mar (25)	Apr (14)	May (6)	Jun (36)	Jul (27)	Aug (19)	Sep (36)	Oct (24)	Nov (15)	Dec (16)
2017	Jan (8)	Feb (13)	Mar (17)	Apr (20)	May (28)	Jun (10)	Jul (20)	Aug (3)	Sep (18)	Oct (8)	Nov	Dec (5)
2018	Jan (15)	Feb (9)	Mar (12)	Apr (7)	May (123)	Jun (41)	Jul	Aug (14)	Sep	Oct (15)	Nov	Dec (7)
2019	Jan (2)	Feb (9)	Mar (2)	Apr (9)	May	Jun	Jul (2)	Aug	Sep (6)	Oct (1)	Nov (12)	Dec (2)
2020	Jan (2)	Feb	Mar	Apr (3)	May	Jun (4)	Jul (4)	Aug (1)	Sep (18)	Oct (2)	Nov	Dec
2021	Jan	Feb (3)	Mar	Apr	May	Jun	Jul (6)	Aug	Sep (5)	Oct (5)	Nov (3)	Dec
2022	Jan	Feb	Mar (3)	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec

moosefs-users Mailing List for MooseFS (Page 115)

Fault tolerant, POSIX-compliant, Net Distributed Storage / File System

moosefs-users — Mailing list for MooseFS Users