From: Michel B. <mi...@bo...> - 2005-06-07 14:47:56
|
Hi there, I've noticed that there are situations where the connect table isn't cleaned up as much as it should at normal processing time, leaving its thorough cleanup to the cleanup task, keeping entries in connect longer than necessary, and resulting in messages being logged as "spam:" where they weren't spam (and were actually successfully retried and accepted by SQLgrey). First case sample: ============= Let's assume our group_domain_level = 3 for the sample from_awl contains: 123.231.12 joe bob.com 123.231.12 bill bob.com connect contains: 123.231.12 alice bob.com mi...@my... (Message #1) 123.231.12 sue bob.com pe...@my... (Message #2) Now let's suppose Message #1 comes back : 1/ "123.231.12 bob.com" moves to domain.awl 2/ from_awl gets cleaned of corresponding entries 3/ "Message #1" entry gets deleted from connect 4/ "Message #2" entry is NOT deleted from connect, although it matches the new entry in domain_awl 5/ When "Message #2" comes back, it is immediately accepted via domain_awl, and thus is NOT cleaned from connect. 6/ Cleanup of "Message #2" from connect will be done 24 hrs later by the cleanup tasks, and it will log it as "spam", where the message was actually represented, and accepted. 2nd case sample ============= (VERP-style, I saw something like this for real...) Let's suppose connect got a number of messages just after subscribing a VERP-style mailing-list: #1: gentoo+bounces-1119-me=mydom.net | gentoo.org | 140.105.134 | me...@my... #2: gentoo+bounces-1120-me=mydom.net | gentoo.org | 140.105.134 | me...@my... #3: gentoo+bounces-1121-me=mydom.net | gentoo.org | 140.105.134 | me...@my... #4: gentoo+bounces-1122-me=mydom.net | gentoo.org | 140.105.134 | me...@my... Now suppose Message #1 comes back : 1/ it gets added to from_awl in its de-VERP'd (here de-plussed) form: gentoo | gentoo.org | 140.105.134 2/ Entry #1 gets deleted from connect 3/ Entries #2-4 are NOT deleted from connect (although they match the entry just added to from_awl) 4/ When Messages #2-4 come back, they are immediately accepted via from_awl, and thus are NOT cleaned from connect. 6/ Cleanup of Messages #2-4 from connect will be done 24 hrs later by the cleanup tasks, and it will log them as "spam", where the messages were actually represented, and accepted. Hmmmmm.... Comments ? -- Michel Bouissou <mi...@bo...> OpenPGP ID 0xDDE8AC6E |