Fighting Spam with Spamassassin
Posted in Spam, Networking, Email on August 14, 2008 at 8:08 am
Well, after many years with zero anti-spam technologies (and manual deletion of ~ 200 items a day) I decided it was time to move my mail host and implement anti-spam technologies.
Now I already have a home SAMBA server, running Debian, which also acts as a mini desktop. I decided to use this as my mail volume isn’t huge… I get ~20 valid emails a day, ~200-500 spams depending on the day of week really.
SpamAssassin looked to be the premier anti-spam solution out there for Linux, and I selected a Debian EXIM integration. Took a while to learn exim, but I’m now mostly impressed with the configuration. I’ve used dovecot as a IMAP server. All these are the standard Debian stable packages……
Basic procedure for me was I installed the packages - then I followed this guide and got a basic system up and running… and moved a “test” domain name to point inbound SMTP at the box so I could then fully test all the options and tune the anti-spam.
Tricks the above guide missed:
Using CPAN (perl -m CPAN -e shell) to install Net::DNS. Without this vital step Spamassassin missed out on ALL DNS tests, which are quite good for scoring.
Bayesian filtering.
- Set this up to use a system wide database, in a folder you control with world read/write access. The default isn’t right.
- You may wish to increase the default size of the bayes database. I increased mine 10 times.
- It seems to require 200 spams and 200 non-spams to be learnt before its operational - at first I did not realise this. I fed Bayes a folder of 2000 spams, and let it read my (already filtered of spam) archive of personal mails as non-spam (3400 items). This trained the spam filter quite well.. I used a variation of this script
- If you run sa-learn with -D for debug it does tend to show faults in your SA config.
- Increasing score of BAYES_99 for me at least results in better results.
- I’ve set up learn as spam folders in my mailfile, which is learnt and deleted every 6 hours (ie mails making it through SA I drag to this folder).
Setting SpamAssassin up is NOT easy, and requires a lot of tinkering to get runnign as you want (hence my playing with a test domain). Once complete however, its an brilliant system in my opinion at least.
Now its up and running, only 4 spams have hit my mailbox (though I’m still storing all spam - aim is to not store very high scoring spams in future, and only store “uncertain” results. Though right now, with ~5000 spams not hitting my mailbox I’m a happy bunny.
SpamAssassin is also available as a windows version I believe. For Exchange users with nothing else it may be worth a look.
Wifi in coffee shops - is it time to die?
Posted in ADSL, 3, Wi-Fi, Networking, Mobile Phone on February 21, 2008 at 9:38 pm
I friend I met up with last week does not have Broadband at home via DSL or Cable. Nor do they use Wifi. They actually use a Vodafone 3G Datacard, providing 3Gig of data a month for a monthly fee.
All the UK carriers now I believe are doing some form of 3G datacard - but finally with decent usage limits (1 Gig being a minimal usable level per month) - and all for varying fees. The above example, in my friend, pays
ADSL and why I am happy a neighbor is moving.
Posted in ADSL, Networking on February 4, 2008 at 9:39 am
I’ve had ADSL since around 2003 - initially with a 512K service, then 2Mbit, then MAX upon release, all via Andrews and Arnold (who incidentially provide a fine service or I would have moved ages ago).
Anyhow, this post is about my experience over the past few years since I moved into my new build house in
Archives
- July 2009
- June 2009
- April 2009
- March 2009
- February 2009
- January 2009
- December 2008
- November 2008
- October 2008
- September 2008
- August 2008
- July 2008
- June 2008
- May 2008
- April 2008
- March 2008
- February 2008
- January 2008
- December 2007
- November 2007
- October 2007
- September 2007
- August 2007
- April 2007
- March 2007
- February 2007
- January 2007
- October 2006
- September 2006
- August 2006
Most commented posts
Highest Rated Blog Posts
- Debian & APT - Why I love it (100%)
- PicardTagger - most useful mp3 tool ever? (100%)
- Nokia Comes with Music - doomed to fail? (100%)
- The death of the British High Street (100%)
- Fighting Spam with Spamassassin (100%)
- iPhone 2.1 Upgrade - Genius! (100%)
- ADSL and why I am happy a neighbor is moving. (80%)
- Homebuilt NAS - one week on (80%)
- Second Life - a big waste of time? (75%)
- Day 4 of me.com/iPhone, my mini-review (73.4%)


