|
From: Jeff R. <py...@fi...> - 2005-09-20 22:15:24
|
Hi, Just thought I would share a small patch that deals with a number of single-use email addresses that weren't being recognized by the existing regex in sqlgrey. These are the sort of bounce-return-12310123981, etc. This patch just tries to mask the parts that appear to be unique, so the database doesn't get filled with addresses that won't be used again. I somewhat arbitrarily decided that if an email name contained a delimiter such as "-","_", or "." along with a string of 12 or more alphanumeric characters, then those characters should be masked. That may or may not result in some emails being masked when they should not, or some not being masked when they should. I don't believe the result will be tragic in either case, and this can be adjusted to your liking. It might not work as well for other folks, but it seems to catch the major ones I see. I am sure there are other patterns that I didn't catch simply because they don't come up frequently in my email mix. Jeff |