|
From: Natalia T. <nt...@ce...> - 2006-07-07 11:45:11
|
Thanks Michael, I'll experiment indexing job this way. About indexing proces .. I'm testing how it works (Heritrix+Hadoop+NutchWax+Wera) with our web and I'm running it in standalone mode with one crawled job (about 7 arc 700Mb). I want to start a hadoop cluster but i d0n't know how many slaves put and hardware requerimets to it. I'm looking for infomation about benchmarks, indexing performance .... to know more about hardware needed , but I don't find anything. Thanks, Natalia |