Danny Sauer(piclist
KILLspamdannysauer.com) is reported to have said:
{Quote hidden}> Russell wrote regarding '[OT] Filtering new style of SPAM' on Fri, Nov 11 at 01:27:
> > I recently mentioned html based spam which is getting through my ISP's
> > otherwise excellent filters.
> > I have a simple method of trapping it.
> >
> > They code the words in html strings so that the text appears in
> > vertical columns spread amongst n strings. There is a space between
> > each letter. This is rebuilt on reception.
>
> Speaking of spam filtering:
>
> I've switched from SpamAssassin to DSpam recently, and it works
> somehat like that - building up a list of tokens likely to be in spam
> and ham. It's sort of like a Bayesian filter, but it's more
> intelligent than most implementations I'm familiar with (it's not a
> 100% traditional byesian filter). For example, since this message
> came from a mailing list that I don't typically get spam from, this
> messages was marked as having a 1 in 110 probability of being spam -
> which means that there's a good chance it's spam, but it's probably
> not. This is in contrast to the other poster whose Bayesian filter
> improperly marked it as spam. :) Over the last week and a half, I've
> run about 7500 messages through my dspam install and it's had a
> 99.818% accuracy rate classifying spam and ham. Granted, I trained it
> with 18K spams and 28K hams from my personal mail archive (doesn't
> everyone keep 6 months worth of spam?), but I've heard reports of
> accuracy in the 80% range out of box. It can be set up as a proxy,
> but might be a little difficult to run on a windows box without cygwin
> - so it's not for everyone. The benefits are considerable, in that
> it's easy to keep up-to-date with "modern" spam and it's pretty
> scalable. The mail server that runs on handles something like
> 5000-8000 messages/day and it's just a 133MHz AMD 5x86 with 48MB RAM.
> The system never sees load averages over 10%, and stays below 1% most
> of the time (it was usually in the 10-15% range running SA). :)
>
I used to use spamassassin but found, in my mind, a better solution.
Rather then download the spam to have SA scan it, I switched to
mailfilter. Using my 'spam library', it took a few days to get it to
drop 99% of the spam I was receiving. My logs showed >100 spam
messages per day back then. I now have to tweek once a month or so
and average 1-2 spam msgs/week. Life in now much easier.
<http://mailfilter.sourceforge.net>
and a new one that has even more filtering options
<http://murx.sourceforge.net>
Wayne
--
Pascal Users:
To show respect for the 313th anniversary (tomorrow) of the
death of Blaise Pascal, your programs will be run at half speed.
_______________________________________________________