|
From: <kau...@cs...> - 2005-11-11 12:35:07
|
On 11/11/2005, "stack" <st...@ar...> wrote: > The below should be fixed by upgrade to nutchwax 0.4.1, if you haven't > already. >=20 > St.Ack Yes I noticed that errors in my archive were of the same type as reported later by other people. After installing nutchwax 0.4.1 the archive looks better now, thanks very much. I still have some isolated cases where a file is inside archive but wera shows 'not found'. Here are some examples of problem urls Perhaps '&' right after '?' is too much http://www.helsinki2005.fi/index.php?&Lang=3Deng http://www.helsinki2005.fi/index.php?&Name=3Dxteams http://www.helsinki2005.fi/index.php?&Name=3Dtickets http://www.noc.fi/mp/db/tiedotteet/foo/IMG?num=3D18157&FIELD=3Dkuva0_kk&R=3D8= 97830 http://www.noc.fi/taustasivut/artikkeliarkisto/?num=3D17075&JKNUM=3D17075 http://www.slu.fi/mp/db/tiedotteet/foo/IMG?num=3D29512&FIELD=3Dkuva0_pieni&R= =3D064184 Other urls with '&' work fine, but these with 'Name=3Dsomething&' do not. http://www.helsinki2005.fi/index.php?Name=3Dnewsitem&item=3D322 http://www.helsinki2005.fi/index.php?Name=3Dnewsitem&item=3D405 http://www.helsinki2005.fi/index.php?Name=3Dtickets&lang=3Deng When I make in wera a query url:http://www.helsinki2005.fi/index.php?Name=3Dtickets&lang=3Deng it reports 102 hits, the first one being http://www.helsinki2005.fi/index.php?Name=3Dtickets_1 but wera only wants to display hits 1-10 and 11-16. For some reason all images with a '%' character in url still refuse to come out. This could apply to html file urls as well if there were any in the archive. http://www.helsinki2005.fi/files/pics/1079364264_mascot%20medium.gif I'm not sure which part of nutchwax&wera combination causes it. Kaisa |