<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Meta-Blog</title>
	<atom:link href="http://www.andrewferrier.com/blog/2006/10/30/meta-blog/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.andrewferrier.com/blog/2006/10/30/meta-blog/</link>
	<description>Economics; Travel; Film; and Technology.</description>
	<pubDate>Tue, 06 Jan 2009 03:22:41 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Andrew Ferrier&#8217;s Blog &#187; Blog Archive &#187; Spam and OCR</title>
		<link>http://www.andrewferrier.com/blog/2006/10/30/meta-blog/comment-page-1/#comment-3975</link>
		<dc:creator>Andrew Ferrier&#8217;s Blog &#187; Blog Archive &#187; Spam and OCR</dc:creator>
		<pubDate>Fri, 10 Nov 2006 14:13:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.new-destiny.co.uk/andrew/blog/2006/10/30/meta-blog/#comment-3975</guid>
		<description>[...] As I&#8217;ve mentioned before, I have a huge spam problem on my personal e-mail account (~4,000/week) - due to a combination of bad luck and some foolish naivety at a few points - and so I have a fairly highly-tuned SpamAssassin installation running at home, with plenty of custom rules and plugins. I&#8217;ve seen a rising amount of image spam on it, so I decided to give FuzzyOcr, a plugin for SpamAssassin, a try. So far, the results are pretty impressive. FuzzyOcr uses the open-source gocr program as the engine, and ties it to with SpamAssassin and some logic. The OCR is fairly CPU-intensive, so unlike most SpamAssassin plugins, it only kicks in if the message is otherwise going to be below a certain scoring threshold. So far it has roughly halved the volume of spam that slips through into my inbox (previously ~40-50/day), which is a welcome improvement. [...]</description>
		<content:encoded><![CDATA[<p>[...] As I&#8217;ve mentioned before, I have a huge spam problem on my personal e-mail account (~4,000/week) - due to a combination of bad luck and some foolish naivety at a few points - and so I have a fairly highly-tuned SpamAssassin installation running at home, with plenty of custom rules and plugins. I&#8217;ve seen a rising amount of image spam on it, so I decided to give FuzzyOcr, a plugin for SpamAssassin, a try. So far, the results are pretty impressive. FuzzyOcr uses the open-source gocr program as the engine, and ties it to with SpamAssassin and some logic. The OCR is fairly CPU-intensive, so unlike most SpamAssassin plugins, it only kicks in if the message is otherwise going to be below a certain scoring threshold. So far it has roughly halved the volume of spam that slips through into my inbox (previously ~40-50/day), which is a welcome improvement. [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>
