Karsten M. Self wrote:
SA has an "autolearn" feature, where mail scoring above 6, and below 0.1, will be "autolearned" as spam and ham. That is, the Baysian classifier will train on these mails.
However these only are what SA would have caught already without the Bayesian score. It discards that when autolearning to prevent a self-spiralling corruption of its database. On any given pass through d-u, d-d, d-m or d-k I can get 50-60+% messages which were not learned by SA. That's a large amount to discard.
Does he need to feed every message to SA? If he has autolearning turned on, no. Should he feed samples in regularly? Yes.
-- Steve C. Lamb | I'm your priest, I'm your shrink, I'm your PGP Key: 8B6E99C5 | main connection to the switchboard of souls. -------------------------------+---------------------------------------------
Attachment:
pgput3qwtmaoF.pgp
Description: PGP signature