[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: reject non-english mail as spam?



on Tue, Jan 13, 2004 at 01:03:30AM -0700, Lucas Albers (albersl@cs.montana.edu) wrote:
> I keep getting spam on the list that is completelly foreign.
> 
> SA scores it as this in regards the foreign langauge component:
> 1.5 BODY_8BITS BODY: Body includes 8 consecutive 8-bit characters
>  2.8 UNWANTED_LANGUAGE_BODY BODY: Message written in an undesired language
>  3.2 CHARSET_FARAWAY BODY: Character set indicates a foreign language
>  3.2 CHARSET_FARAWAY_HEADER A foreign language charset used in headers
>  2.5 MIME_CHARSET_FARAWAY   MIME character set indicates foreign language
> 
> 
> Can we tune the sa rules for this list to reject completelly
> non-english email?  Or can it be assumed that people will be posting
> non-english email to this list.  Not that I can read them.  At the
> very least can we add in the SA english component at perhap these
> score levesl?


There's a local procmail rule I use to catch what passes by SA.

    http://linuxmafia.com/~karsten/Download/chinese-charset


Works pretty well.  Tunable by proportion of characters in a mail are
outside a standard Western / English charset.


Peace.

-- 
Karsten M. Self <kmself@ix.netcom.com>        http://kmself.home.netcom.com/
 What Part of "Gestalt" don't you understand?
  Backgrounder on the Caldera/SCO vs. IBM and Linux dispute.
      http://sco.iwethey.org/

Attachment: signature.asc
Description: Digital signature


Reply to: