Content filtering - Spam

From Wikipedia, the free encyclopedia

Content filters act either on the content, the information contained in the mail body, or on the mail headers (like "Subject:") to either classify, accept or reject a mail. Like the name implies, the filter filters out unwanted text or letters, thus saving the receiver time. Usually, Anti-Virus methods can be classified as content filters too, since they scan (simplyfied) either the binary attachments of mail or the HTML contents.

[edit] Common Filters

Common content filters are:

  • Bayesian
  • Attachment
  • Mail header
  • Mailing List
  • HTML anomalies
  • Language
  • Heuristic
  • Regular Expression
  • Phrases
  • Proximity
  • URL
  • Content-encoding
  • Char-set