On 06/01, Benny Pedersen wrote: > Den 2012-06-01 03:54, firstname.lastname@example.org skrev:
> from 2007 ?
Yes, that data goes back to 2007. And the last tick on the right is July
2012, which the data lines haven't reached yet. > how does others make spamcorpus with mta rejecting spam from rbl &
> clamav milter ?
I actually stopped blocking on RBLs at my MTA to get better data for this.
"Real" spam sent to real users and manually verified is better than what
can easily be harvested in very large quantities with spamtraps. I'd say
the spam that gets through blocking on RBLs to real users is also much
better than what you can get from a spam trap.
What matters the most is the stuff that's hard to catch. And I fear
that is currently massively under-represented. It doesn't need to be
all spam that was sent to you in order to be useful for re-scoring.
I think when I started I was automatically rejecting on RBLs and
automatically rejecting everything over some spamassassin threshold, and
the people on this list at the time had no objection to me contributing the
spam that still got through. That's the most important stuff.
Spam traps are obviously very useful, but I'd say they're far from
representative of the spam end users ever see getting through their spam
"Let's just say that if complete and utter chaos was lightning, then
he'd be the sort to stand on a hilltop in a thunderstorm wearing wet
copper armour and shouting 'All gods are bastards'." - The Color of Magic http://www.ChaosReigns.com