Posts Tagged ‘spam’

Look ma! Tables!

Monday, March 23, 2009 @ 11:03 PM Author: Péter Gyöngyösi

Sorry for the rarity of updates, my head (and days) has been so full of the last pre-release issues lately that I can’t find the time to post about anything worthy — including, for example, the epic fight with the JVM that fails misteriously every now and then if it cannot find /proc where it expects it and fails just the same, but for a totally different reason and on each and every run, if it can. (Kudos to the tireless tester of ours that kept trying out different versions of Java until he found one that produced a never before seen error message that yielded one single Google hit that brought us to the solution.)

However, this is something worth mentioning. I mean, we’ve seen all kinds of spam that desperately tries to get pass through the ever-growing wall of bayesian filters. Deliberate mispellings, images that get filtered, adding whole paragraphs of bogus sentences — we’ve seen them all. But using a colored HTML table to get the name of the Blue Pill to our screens: wow, that’s clever.

This is what landed in my mailbox today:

And here’s the HTML that Blogger seems to mess up badly:

Do we have to start writing OCR for HTML tables?