Spamassassin + FuzzyOCR (not really a debian way)

What do we need:

  • Debian Sarge
  • Spamassassin (CPAN up-to-date)
  • luck :)

Getting started

upgrade your Spamassassin (non debian / CPAN way)

 perl -MCPAN -e shell
 cpan> install Mail:Spamassassin

keep your finger crossed

get some graphic manipulation packages from debian-sarge-backports

 # apt-get install -t sarge-backports install gocr libungif-bin \
   libimage-exif-perl libimage-exiftool-perl libstring-approx-perl imagemagick netpbm

The Fuzzy magic

 # cd /usr/src/
 # wget http://users.own-hero.net/~decoder/fuzzyocr/fuzzyocr-2.3b.tar.gz
 # tar xzvf fuzzyocr-2.3b.tar.gz
 # wget http://www200.pair.com/mecham/spam/fuzzyocr-23b-hashdb-poison.patch
 # cd FuzzyOcr-2.3b
 # patch FuzzyOcr.pm < ../fuzzyocr-23b-hashdb-poison.patch
 # cp FuzzyOcr.pm /usr/share/perl5/Mail/SpamAssassin/Plugin/
 # cp FuzzyOcr.cf /etc/spamassassin/
 # cp FuzzyOcr.words.sample /etc/spamassassin/FuzzyOcr.words

Configuration

 # echo "loadplugin FuzzyOcr /usr/share/perl5/Mail/SpamAssassin/Plugin/FuzzyOcr.pm"\
   >> /etc/spamassassin/v310.pre
 # sed -i "s/^loadplugin FuzzyOcr FuzzyOcr.pm/#loadplugin FuzzyOcr FuzzyOcr.pm/"\
   /etc/spamassassin/FuzzyOcr.cf
 # sed -i "s/^#focr_base_score\ 4/focr_base_score\ 2/" /etc/spamassassin/FuzzyOcr.cf
  

For SpamAssassin less than 3.1.4:

  # sed -i "s/^focr_pre314 0.0/focr_pre314 1.0/" /etc/spamassassin/FuzzyOcr.cf

Fingers still crossed

Verify Spamassassin config:

  # spamassassin --lint

See also: Fighting image spam on our Debian spamfilter with FuzzyOcr and ImageInfo plugins

spamassassin/fuzzyocr.txt · Last modified: 2009/05/25 00:35 (external edit)
CC Attribution-Share Alike 4.0 International
Driven by DokuWiki Recent changes RSS feed Valid CSS Valid XHTML 1.0 ipv6 ready