octo's bayesian mail filter - obmf

I know there are a lot of spam filters out there, and most of the bayesian ones will work better than this one. Also, there are lots of people out there, that get a lot more spam than I do. But my interest in pattern matching and recognition were reason enough for me to try to code one myself.

Features

obmf is written in perl for rapid development, the ability to run (almost?) everywhere perl does, great string handling and personal preference. If you don't like perl because you have seen a lot of bad examples, let me assure you that I have taken care of the code being readable, well documented and easy to understand. It does not use any perl modules that don't come with the perl distribution, so you don't have to mess with your setup. So the real feature is probably obmf's ease to be customized. Anyone with some basic knowledge of perl should be able to do with it whatever (s)he wants it to do. Other features are that obmf ignores non-text parts of the mail, understands multipart messages and saves each mail's message-id so a mail is not examined twice. Sample configurations for mutt and procmail are also included. Anyone with other systems is welcome to send the config for his/her favorite mail prgram.

Download

Just download one of the following links, extract the file and read the readme.

Usefull links

Interesting papers

Other, similar programs

This document validates as XHTML 1.1!
This document uses valid CSS2!
Written by Florian octo Forster