on Thu, 01 May 2003 01:48:59PM +0100, Colin Watson insinuated: > On Wed, Apr 30, 2003 at 09:21:20PM -0400, Nori Heikkinen wrote: > > Cool, I just upgraded to 2.53, and it seems better. Can someone > > explain to me how a Bayesian filter would work in a static > > context? Does it train itself based on the threshold you provide? > > My impression (though this could be wrong, as I've only ever used > > spamassassin) is that on other commonly-used Bayesian spamfilters, > > you have to manually train it on x number of emails before it > > learns what you consider spam and what you don't. How does > > spamassassin -- which is procmail-based -- train itself? > > You can train it by hand using sa-learn. If you don't, it > auto-learns based on its own scores: anything below > auto_learn_threshold_nonspam (default -2.0) gets auto-learned as > ham, and anything above auto_learn_threshold_spam (default 15.0) > gets auto-learned as spam. You see 'autolearn=ham' or > 'autolearn=spam' in the headers when this happens, so you can > correct it with sa-learn if need be. cool -- i just bound a mutt macro to '|/usr/bin/sa-learn --file --ham', and put in a cron job to sa-learn my spamfolder as such every morning. that should train it pretty well, and without too much hassle! thanks! </nori> -- .~. nori @ sccs.swarthmore.edu /V\ http://www.sccs.swarthmore.edu/~nori/jnl/ // \\ @ maenad.net /( )\ www.maenad.net ^`~'^ get my (*new*) key here: http://www.maenad.net/geek/gpg/7ede5499.asc (please *remove* old key 11e031f1!)
Attachment:
pgp3KIlCpYTx8.pgp
Description: PGP signature