[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Recent spam increase

Mike McCarty wrote:
[about the penny-stock image spams]
Yes, I get several a day myself. The actual "text" of the message is
often actually an image, while the body of the message is randomly
selected sentences or words from a collection which would make a
Bayesian filter delete most of my e-mail.

I delete them by hand, to prevent messing up my other filter which
is working reasonably well ...(snip)

I use a statistical filter (aka "bayesian") that seems to do a reasonable job. I block most of the spam I get directly by other means, but I submit some of the stock-peddling image spams I get through mailing-lists to the filter. Since I began running the filter I've received 30453 non-spam messages, caught 882 spams and had 36 false positives, and it has been ages since the last false positive. This is while submitting 202 mails for training puposes., that were missed by the filter. Like I said, the spams I want to catch is just from a couple of mailinglists that have been particuliarly badly hit, hence the low spam ratio.

The filter I use is known as dspam <http://dspam.nuclearelephant.com/>. I believe you might be pleasantly surprised by the other implementations aswell. The filter has more than the message text to work with. Everything from the MIME headers to the message headers contribute in characterizing the mail, so even though the text is random dspam will catch the spam.

Håkon Alstadheim

Reply to: