From: Elliot F. <efi...@gm...> - 2011-08-10 15:56:44
|
On Tue, Aug 9, 2011 at 6:46 PM, Robert Sandilands <rsa...@ne...> wrote: > Increasing the swap space fixed the fork() issue. It seems that you have to > ensure that memory available is always double the memory needed by > mfsmaster. None of the swap space was used over the last 24 hours. > > This did solve the extreme comb-like behavior of mfsmaster. It still does > not resolve its sensitivity to load on the server. I am still seeing > timeouts on the chunkservers and mounts on the hour due to the high CPU and > I/O load when the meta data is dumped to disk. It did however decrease > significantly. > > An example from the logs: > > Aug 9 04:03:38 http-lb-1 mfsmount[13288]: master: tcp recv error: ETIMEDOUT > (Operation timed out) (1) > Aug 9 04:03:39 http-lb-1 mfsmount[13288]: master: register error (read > header: ETIMEDOUT (Operation timed out)) > Aug 9 04:03:41 http-lb-1 mfsmount[13288]: registered to master Are you using this server as a combination mfsmaster/chunkserver/mfsclient? If so, is the metadata being written to a spindle(s) that are separate from what the chunkserver is using? How is this box laid out? Elliot |