Re: baysian filtering

To: "Monique Y. Mudama" <spam@bounceswoosh.org>
Cc: debian-user@lists.debian.org
Subject: Re: baysian filtering
From: Frédéric Dreier <frederic.dreier@uptime.ch>
Date: Wed, 05 May 2004 15:45:39 +0200
Message-id: <[🔎] 4098F003.60804@uptime.ch>
In-reply-to: <[🔎] slrnc9hq2j.hg7.spam@home.bounceswoosh.org>
References: <[🔎] 20040505064920.GA20958@comcast.net> <[🔎] 20040505065431.GB20958@comcast.net> <[🔎] slrnc9hq2j.hg7.spam@home.bounceswoosh.org>

Monique Y. Mudama wrote:

On 2004-05-05, William Ballard penned:

On Tue, May 04, 2004 at 11:49:20PM -0700, William Ballard wrote:

filter the spam?  and do Bayesian filtering myself on mail I get from
the lists?  I think I read that so far only Bayesian filtering works

and


Speaking of Baysian.  Lately I've been getting a lot of spam (to me
directly, not to the list) with a long-winded joke in the body.  The
email has a spammy subject and a few bizarre keywords in it, but mostly
it's this long-winded joke, with well-formed grammar, proper spelling,
etc.  It's even plain text; no HTML involved.  I've even caught myself
reading a few of them, then feeling rather dirty.

Anyway, I dutifully pipe them through sa-learn, but I worry.  If these
spams look so much like regular mail, won't I just end up tainting my
baysian library by teaching sa-learn with them?  I mean, eventually,
won't my baysian scheme be unable to distinguish between spam and ham?

Yes, it could be. But actualy BN does not look at an email the same wayas we do.. It willmostly trigger on the 'bizarre keyword' that appears in spams and not inregular emails.


Don't worry. BN used to do less mistakes than human :-)

Regards,

dreier.

Thoughts?

Reply to:

References:
- Massive increase of spam on debian-*@l.d.o
  - From: William Ballard <40414.nospam@comcast.net>
- Re: Massive increase of spam on debian-*@l.d.o
  - From: William Ballard <40414.nospam@comcast.net>
- baysian filtering (was: Re: Massive increase of spam on debian-*@l.d.o)
  - From: "Monique Y. Mudama" <spam@bounceswoosh.org>

Prev by Date: Re: Problems after upgrading kernel from 2.4 to 2.6
Next by Date: Re: baysian filtering (was: Re: Massive increase of spam on debian-*@l.d.o)
Previous by thread: Re: baysian filtering (was: Re: Massive increase of spam on debian-*@l.d.o)
Next by thread: Re: baysian filtering (was: Re: Massive increase of spam on debian-*@l.d.o)
Index(es):
- Date
- Thread