From: Reini U. <ru...@x-...> - 2004-12-08 14:17:53
|
Rui Carmo schrieb: > Besides yesterday's fix, I spent quite some time figuring out why I > still had another "hit" on the search pages. At first I thought it was > something inside the search code, but when I started logging IP > addresses it became obvious. > > It turned out that AdSense will trigger an _immediate_ hit from the > google bot if I display ads on my *Search pages, which prompted me to > include this little snippet right at the start of index.php (to waste as > little resources as possible): > > > define( "DUMB_BOTS", > '/(JPluck|Mediapartners|ia_archiver|googlebot|msnbot|Crawl)/i' ); > > if( preg_match( DUMB_BOTS, $_SERVER['HTTP_USER_AGENT'] ) ) { > if( preg_match( '/\?(s|action|version)=/', $_SERVER['REQUEST_URI'] ) ) { > header( "HTTP/1.1 404 File Not Found" ); > echo( "<H1>404 File Not Found</H1>" ); > exit; > } > } That's not a good idea! Google's Ad Sense checks where the ads really appear on the page, and calculates the rank (= money!) from this info. If you reject the checker you will get no profit from AdSense at all. rejected the main googlebot is also not a good idea. The googlebot is a good thing. You just have to prepare for being "slashdotted" to death once in a while. Some referrer check or referrer throttling. -- Reini Urban http://xarch.tu-graz.ac.at/home/rurban/ |