[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Spam cleaning in list archives: one last effort needed



Christian Perrier wrote:
> If you're interested in statistics, you can look [2] to learn that
> "our" list is by far the one that got most cleaning.

Here's a little overview of the lists with the most removed posts, 
including the ratio of deleted spam over reviewed posts.

As you can see we have a _very_ high ratio, so our scanning process has 
been very effective. Other lists have had much more ham reported, but that 
does show the importance of the review stage.

There are a few other lists that have a similarly high ratio, but none that 
are anywhere near the volume that we've processed.

debian-boot 4733 (96%)
debian-www 4290 (84%)
debian-devel 2907 (69%)
debian-user 2217 (25%)
debian-user-german 1744 (10%)
debian-project 1028 (73%)
debian-user-spanish 1024 (13%)
debian-qa 766 (66%)
debian-newmaint 680 (85%)
debian-release 625 (75%)
debian-x 501 (71%)
debian-vote 470 (79%)
debian-chinese-gb 470 (33%)
debian-java 463 (69%)
debian-mentors 442 (29%)
debian-user-portuguese 424 (30%)
debian-apache 412 (16%)
debian-legal 362 (69%)
debian-laptop 323 (57%)
cdwrite 296 (27%)

Our ratios of deleted spam over nominated posts and over considered posts 
are also quite high (resp. 50% and 86%) when compared to other lists.

Cheers,
FJP


Reply to: