Feeds

IBM dissects the DNA of spam

Feng shui and genetics fight junk mail

  • alert
  • submit to reddit

Intelligent flash storage arrays

IBM is applying ideas developed in sequencing DNA molecules to the detection of spam. Spammers have taken to inserting streams of gobbledegook or deliberately misspelling words in their spam messages in order the throw off anti-spam filters that rely on Bayesian statistical analysis alone.

In response, IBM is developing more sophisticated anti-spam filters. Boffins at Big Blue hit on the idea that programs used for looking for recurring patterns in DNA sequences could be applied to look for recurring phrases that often feature in junk mail missives. It developed a program called Chung-Kwei (named after a feng-shui talisman that protects homeowners against evil spirits) and trained it to spot repeated patterns in spam messages. IBM then fed a series of legitimate messages through the program in order to eliminate repeated patterns of messages that were common between both spam and 'ham' (legitimate) messages.

New Scientist reports the approach detects nearly 97 per cent of spam messages and has a far lower rate of false positives than conventional techniques (less than one in 1,000). IBM is using the filtering techniques, alongside a variety of other approaches, in developing an anti-spam product called SpamGuru . SpamGuru is shipping as a technology preview in Lotus Workplace 2.0, the next version of IBM's messaging and collaboration application. ®

Related stories

ISPs gang up on spammer-run websites
Spamming for Dummies
Spam poetry: transcending the junk mail paradigm
IBM brings Instant Messaging to Lotus Notes
Lotus Domino goes spam busting

Top 5 reasons to deploy VMware with Tegile

Whitepapers

Choosing cloud Backup services
Demystify how you can address your data protection needs in your small- to medium-sized business and select the best online backup service to meet your needs.
Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Storage capacity and performance optimization at Mizuno USA
Mizuno USA turn to Tegile storage technology to solve both their SAN and backup issues.