From: Wilson, S. M <st...@pu...> - 2018-10-19 12:47:38
|
?Thanks for the suggestion! I had thought about that also but a lot of our files need to be accessed simultaneously from different clients and, as you mentioned, this approach doesn't support that. Steve ________________________________ From: Alexander AKHOBADZE <ale...@op...> Sent: Friday, October 19, 2018 1:55 AM To: Wilson, Steven M; MooseFS-Users Subject: RE: Performance suggestions for millions of small files Hi Steve! You know... I'v made a conclution not store millions of small files on MooseFS ;--) In such case I use a big file stored on MooseFS as container, format it as EXT4 or XFS and then mount it on client side. Yes. I know that in such case small files are not clustered anymore ... it's a pity but here you are. Wbr Alexander (Anri) Akhobadze, ale...@op... System administrator, DATAVISION NN From: Wilson, Steven M [mailto:st...@pu...] Sent: Thursday, October 18, 2018 11:47 PM To: MooseFS-Users Subject: [MooseFS-Users] Performance suggestions for millions of small files Hi, We have ten different MooseFS installations in our research group and one, in particular, is struggling with poor I/O performance. This installation currently has 315 million files occupying 170TB of disk space (goal = 2). If anyone else has a similar installation, I would like to hear what you have done to maintain performance at a reasonable level. Here are some metrics to give a basic idea of the performance characteristics. I'll include in parentheses the range of measurements from other MFS installations with far fewer files for comparison. * tar xf linux-4.9-rc3.tar: 1185 secs (220 - 296 secs) * smallfile test, create MB/s: 0.8 (2.3 - 4.8) <== Ouch! * smallfile test, read MB/s: 10.7 (12.8 - 15.4) * smallfile test, append MB/s: 6.1 (3.0 - 7.7) It looks file creation is where I'm losing most of my performance compared to the other installations. My master server has a Xeon E5-1630v3 3.7GHz CPU with 256GB of DDR4 2133MHz memory. I tried several mfsmount options but the only one that showed any significant improvement was the mfsfsyncmintime option ("mfsfsyncmintime=5"). As to be expected, the improvement gained was during the write/append operation. Here are the results using the same tests as above: * tar xf linux-4.9-rc3.tar: 683 secs * smallfile test, create MB/s: 1.2 * smallfile test, read MB/s: 11.7 * smallfile test, append MB/s: 11.4 <== Dramatic improvement over 6.1 MB/s The smallfile benchmark test I used is ?from https://github.com/distributed-system-analysis/smallfile. Thanks for any suggestions you might have! Regards, Steve |