spamassassin-dev September 2011 archive
Main Archive Page > Month Archives  > spamassassin-dev archives
spamassassin-dev: High ham rate in darxus corpora for URIBL_WS_S

High ham rate in darxus corpora for URIBL_WS_SURBL Re: ham scores

From: <darxus_at_nospam>
Date: Mon Sep 19 2011 - 17:01:14 GMT
To: Mark <mack@surbl.org>, dev@spamassassin.apache.org

On 09/17, Mark wrote:
> I noticed that the ham rate of WS in the darxus corpus is rather high.
>
> ruleqa.spamassassin.org/20110916-r1171450-n/URIBL_WS_SURBL/detail
>
> Could you check what is causing this please?

That shows 39 of 10523 hams hitting RUBLE_WS_SURBL.

1 of them was my fault, an email from a spam related mailing list
mentioning a web site that hit this rule. I've removed it from my
corpora.

The other 38 were notifications from livejournal.com, nothing spam
related, from 2011-08-02 to 2011-08-11. It looks like you just had
livejournal.com listed as a spammer for those 10 days. Those emails
are not hitting this rule now.

It looks like mass-check has the "reuse" flag set for this rule, so
mass-checks are accurately reflecting the accuracy of this rule at the
time the emails were received, which I believe is appropriate.

Thanks for keeping on top of our accuracy. It would be nice if this kind
of thing were done more.

-- "He who dies with the most toys... still dies." - No Fear http://www.ChaosReigns.com