From: Jun C. P. <jun...@gm...> - 2011-01-13 23:27:58
|
Hi, I'm wondering how I can make sure if the recovery process from failure (e.g., missing chunks due to node failure) has been done. Let me consider a little sophisticated scenario, assuming that setgoal is 3 and there are three chunkservers. 1. A chunkserver fails down for 5 minutes. 2. During the 5 mins, there have been a lot of update requests for the existing files and new creation of files, thereby relevant chunks are updated only on the currently alive two nodes. 3. After 5 mins, the failed node comes back, I think the metadata server will recognize that there are some missing chunks and new updates on the existing chunks. I could see via mfsfileinfo and mfscheckfile that the metadata server starts to gradually recover the inconsistent state by creating the missing chunks. However, it was ambiguous to figure out if the new updates of the existing chunks also were synchronized. Is there any specific way or command to see if the whole recovery process is really done including the new updates on chunks? Thanks, -Jun |