Re: Bayes filter at ISPs

To: Adam ENDRODI <borso@vekoll.saturnus.vein.hu>
Cc: debian-isp@lists.debian.org
Subject: Re: Bayes filter at ISPs
From: Lance Levsen <lance@catprint.ca>
Date: Thu, 19 Feb 2004 10:29:17 -0600
Message-id: <[🔎] 1077208157.7181.20.camel@dante.catprint.ca>
Reply-to: lance@catprint.ca
In-reply-to: <[🔎] 20040219120920.GB27423@ildicow.saturnus.vein.hu>
References: <[🔎] 20040219120920.GB27423@ildicow.saturnus.vein.hu>

On Thu, 2004-02-19 at 06:09, Adam ENDRODI wrote:

> I suppose many of you use Bayesian spamfilters at the ISP level.
> I'd like to ask how do you teach it to separate ham and spam
> correctly?  In particular, how do I select a representative set
> of ham and spam?  Is it a good idea to deploy bogofilter for an
> entire organization at all?

This will only help if you're users have login capabilities, but I use a
cron that calls, I don't know if this is doable w/out login shells for
the users.

for i in `ls /home/`;do  user=$(echo ${i} | awk -F/ '{print $1}'); su -
${user} -- sa-learn --spam /home/${user}/mail/spam; done;

Obviously this is for spamassassin, but there must be a learning
capability with bogofilter. It ensures that the user just has to throw
their spam in ~/mail/spam and it updates their bayes db's. Then a
standard .procmailrc in /etc/skel and all the users home dirs to check
for headers.

I find this is better then a global bayesian filter because with all of
the users, the Bayesian filter tends to useless. I do use SA w/out
bayesian filters at the top level though.

> thanks,
> adam

Cheers,
lance

-- 
Lance Levsen, Catprint Computing
Linux Systems and programming
gpg --keyserver wwwkeys.pgp.net --recv-keys 0xF2DA79C8

Attachment: signature.asc
Description: This is a digitally signed message part

Reply to:

References:
- Bayes filter at ISPs
  - From: Adam ENDRODI <borso@vekoll.saturnus.vein.hu>

Prev by Date: Re: Bayes filter at ISPs
Next by Date: Re: Bayes filter at ISPs
Previous by thread: Re: Bayes filter at ISPs
Next by thread: Re: Bayes filter at ISPs
Index(es):
- Date
- Thread