Koozali.org: home of the SME Server

Image base Spam emails

Offline mercyh

  • *
  • 824
  • +0/-0
    • http://mercyh.org
Image base Spam emails
« Reply #15 on: November 27, 2006, 10:30:26 PM »
Hi,

I have been watching this over on the contribs forum. Do you feel like this is ready for production?

Over the weekend with Spamassin RBLS enabled and bayese trained for 6 months I received 8 image based spam in my inbox. Another 12 hit my junkmail folder with scores between 5 and 10. I have about 60 users that don't even know what spamassin is so would like to be sure it is working before I implement.


Thanks for all your work.

Royce

Offline gregswallow

  • *
  • 651
  • +1/-0
Image base Spam emails
« Reply #16 on: November 27, 2006, 10:57:55 PM »
> Do you feel like this is ready for production?

If it was, it would be in the smeupdates or smeupdates-testing repository.

It will be ready sooner if people test it and report bugs.  Upstream has a bug tracker now as well at http://fuzzyocr.org.

Offline cpuffalt

  • *
  • 17
  • +0/-0
Image base Spam emails
« Reply #17 on: November 29, 2006, 07:10:13 AM »
John...


Quote from: "mrjhb3"

Corey,

How has this been working?

John


Well it hasn't blown up on me.  I did a quick grep through my junkmail folder and it scored a hit on about 5% of the emails there.  Of course that doesn't mean spamassassin wouldn't have caught many of those without FuzzyOCR.  It seems to have helped the epidemic though it hasn't cured the problem as a number of image-based spams are still getting through.

Keep in mind though that I'm just using this on my own personal server so the email volumes aren't very high.  I imagine FuzzyOCR would add considerable load to a busy site.  

I also have some issues with the current design of FuzzyOCR.  It relies on a fixed list of keywords.  It's too bad it doesn't/can't(?) leverage the existing bayesian-based scoring built into spamassassin so it could be more adaptive.

Anyhow, it's not a silver bullet but if you're running a low volume site or have a server with lots of headroom it may be worth installing... At least until a better solution appears.

Corey