Re: Promoting your website with bulk-email

To: debian-curiosa <debian-curiosa@lists.debian.org>
Subject: Re: Promoting your website with bulk-email
From: Benoît SIBAUD <benoit.sibaud@rd.francetelecom.com>
Date: Thu, 03 Jul 2003 09:13:47 +0200
Message-id: <[🔎] 3F03D7AB.9000603@rd.francetelecom.com>
In-reply-to: <[🔎] 20030702145808.GA19716@chihiro.cern.ch>
References: <20021014084516.7B88A1F60F@murphy.debian.org> <20021014171148.3C5016824@debby.onfirenet> <20021015052546.GA496@ysabell.wh.vaih> <20021028112017.GB30767@chiara.elte.hu> <20021028165801.GA2725@eiv.com> <1035825017.13654.40.camel@bohr> <20021029100203.GC8165@chiara.elte.hu> <20021029183506.GA26055@sirena.org.uk> <[🔎] 20030702145808.GA19716@chihiro.cern.ch>

Hi,

Another possibility: in my experience bogofilter seems to work
better when it has seen very much more non-spam than spam
e-mail.  As I recall your data set was about evenly split
between the two.

My own data about Bogofilter (part of a French text available athttp://oumph.free.fr/textes/penibles_du_net.html#pourriel , written inJune):


* initial learning

my email boxes for 21 month:
40839 ham
 2231 spam (5%)
-----
43060 mails (more than 300 MiB)

17.6 MiB goodlist.db (ham database)
 3.0 MiB spamlist.db (spam database)

* 22 days later:

 559 new spams (19%)
       90 false negative
      469 detected spam

  49 virus/worm/trojan (1,7%) [*]

2327 ham (79%)
        0 false positive
     2327 detected ham
----
2935 mails

19.0 MiB goodlist.db
 3.9 MiB spamlist.db

procmail+bogofilter looks good: good success rate and no (or few) falsepositives.


[*] mainly a rule to detect PE executables in attachments

--
Benoît Sibaud

Reply to:

References:
- Re: Promoting your website with bulk-email
  - From: KELEMEN Peter <fuji@debian.org>

Prev by Date: Do you know that guy?
Next by Date: Re: Do you know that guy?
Previous by thread: Re: Promoting your website with bulk-email
Next by thread: RE: Promoting your website with bulk-email
Index(es):
- Date
- Thread