From: Lionel B. <lio...@bo...> - 2005-04-24 20:05:40
|
Michel Bouissou wrote the following on 23.04.2005 20:30 : >I also find values such as: > >bounce-#-5541b8d889bbaf2de05c02b4a0af204f93174a98-# > >in the "sender_name" column of from_awl, for some VERP messages that are >received on a regular basis. > >I believe such addresses should have been collapsed into: > >bounce-#-#-# > > I believed the same too :-/ >...so there may be a bug in the substitution regexp for such cases. > > > BTW, I just found out that we were matching "SRS" in the normalize_sender, but it is called after lowercasing the whole string... I've neither seen an SRS email, but I think the current code doesn't match. I lowercased the corresponding regexp in my tree. >The original corresponding "true" sender localpart for this example was (case >respected): > >bounce-439452-5541B8D889BBAF2DE05C02B4A0AF204F93174A98-439250@... > > > I'm adding a while loop to call the regexp until the string doesn't change anymore, this should strip each hex sequence one by one. I'm looking at the s/orig/dest/g syntax to see the problem (might come from variable usage in dest: $1#$2). >(Note that the 2 parts that have been correctly substituted were [0-9]+, where >the part that wasn't substituted was Hex, and the original address was >uppercase Hex, where SQLgrey stored it in from_awl as lowercase Hex...) > > > SQLgrey always lowercases the e-mail adresses. Lionel. |