[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Favorite anti-spam tool



on Thu, 01 May 2003 01:48:59PM +0100, Colin Watson insinuated:
> On Wed, Apr 30, 2003 at 09:21:20PM -0400, Nori Heikkinen wrote:
> > Cool, I just upgraded to 2.53, and it seems better.  Can someone
> > explain to me how a Bayesian filter would work in a static
> > context?  Does it train itself based on the threshold you provide?
> > My impression (though this could be wrong, as I've only ever used
> > spamassassin) is that on other commonly-used Bayesian spamfilters,
> > you have to manually train it on x number of emails before it
> > learns what you consider spam and what you don't.  How does
> > spamassassin -- which is procmail-based -- train itself?
> 
> You can train it by hand using sa-learn. If you don't, it
> auto-learns based on its own scores: anything below
> auto_learn_threshold_nonspam (default -2.0) gets auto-learned as
> ham, and anything above auto_learn_threshold_spam (default 15.0)
> gets auto-learned as spam. You see 'autolearn=ham' or
> 'autolearn=spam' in the headers when this happens, so you can
> correct it with sa-learn if need be.

cool -- i just bound a mutt macro to '|/usr/bin/sa-learn --file
--ham', and put in a cron job to sa-learn my spamfolder as such every
morning.  that should train it pretty well, and without too much
hassle!

thanks!

</nori>

-- 
    .~.      nori @ sccs.swarthmore.edu
    /V\  http://www.sccs.swarthmore.edu/~nori/jnl/
   // \\          @ maenad.net
  /(   )\       www.maenad.net
   ^`~'^
            get my (*new*) key here:
   http://www.maenad.net/geek/gpg/7ede5499.asc
      (please *remove* old key 11e031f1!)

Attachment: pgp3KIlCpYTx8.pgp
Description: PGP signature


Reply to: