on Tue, Jan 13, 2004 at 01:03:30AM -0700, Lucas Albers (albersl@cs.montana.edu) wrote:
> I keep getting spam on the list that is completelly foreign.
>
> SA scores it as this in regards the foreign langauge component:
> 1.5 BODY_8BITS BODY: Body includes 8 consecutive 8-bit characters
> 2.8 UNWANTED_LANGUAGE_BODY BODY: Message written in an undesired language
> 3.2 CHARSET_FARAWAY BODY: Character set indicates a foreign language
> 3.2 CHARSET_FARAWAY_HEADER A foreign language charset used in headers
> 2.5 MIME_CHARSET_FARAWAY MIME character set indicates foreign language
>
>
> Can we tune the sa rules for this list to reject completelly
> non-english email? Or can it be assumed that people will be posting
> non-english email to this list. Not that I can read them. At the
> very least can we add in the SA english component at perhap these
> score levesl?
There's a local procmail rule I use to catch what passes by SA.
http://linuxmafia.com/~karsten/Download/chinese-charset
Works pretty well. Tunable by proportion of characters in a mail are
outside a standard Western / English charset.
Peace.
--
Karsten M. Self <kmself@ix.netcom.com> http://kmself.home.netcom.com/
What Part of "Gestalt" don't you understand?
Backgrounder on the Caldera/SCO vs. IBM and Linux dispute.
http://sco.iwethey.org/
Attachment:
signature.asc
Description: Digital signature