From: Simon W. <es...@ou...> - 2003-02-13 15:59:18
|
On Thu, 2003-02-13 at 15:26, Wizard wrote: > > I *think* that fr...@ao... should have been blocked by *@*.aol.* but > > it doesn't seem to match on all fields. > > If '.bl' is a valid tld (like '.co') then I'll add it to the $valid_tlds > variable and it should parse correctly. I wasn't aware of it. If it > conflicts with a country code, then I will have a problem. .co is not a valid tld, either generic or country code. See http://www.din.de/gremien/nas/nabd/iso3166ma/codlstp1/db_en.html .bl is not a tld in this example, uk is. It happens that the British Library for reasons dating back to antiquity, sit outside the generally accepted standard of .(co|ac|org).uk aol.police.uk is really going to mess you up :) I still believe that you cannot parse the ORGANISATION out of the domain part of the email address without more semantic information than you have available. You cannot expect to have *.aol.* block just websites belonging to AOL/Time Warner. You might work on something that allows *.aol.*.uk or *.aol.*.* to match in the third level domain. In fact, I'm doing this right now. I just need to test it... Simon. |