MEME suite

The MEME suite is a collection of tools for the discovery and analysis of sequence motifs. It is hosted at http://meme-suite.org/.

Motif Discovery

MEME

MEME or Multiple EM for Motif Elicitation is a tool for discovering motifs in a group of related DNA or protein sequences. MEME takes as input a group of DNA or protein sequences and outputs as many motifs as requested up to a user-specified statistical confidence threshold. MEME uses statistical modeling techniques to automatically choose the best width, number of occurrences, and description for each motif.[1]

GLAM2

GLAM2 or Gapped local alignment of motifs is a tool for discovering gapped motifs in a group of DNA or protein sequences. Unlike MEME, GLAM2 does not try to find several different motifs all in one go. Instead, it performs replicates: it tries to find the best possible motif multiple times. [2]

DREME

DREME or Discriminative Regular Expression Motif Elicitation is a tool for discovering motifs in large collections of sequences. DREME is very computationally efficient and therefore is suitable for motif search on large data sets derived from ChIP-seq (Chromatin immunoprecipitation followed by sequencing) experiments. In the interest of computational efficiency, DREME finds only motifs that can be expressed in the IUPAC alphabet, which contains the standard DNA alphabet ACGT as well as eleven 'wildcard' characters (for example, R indicates either A or G).

MEME-ChIP

MEME-ChIP is a tool for discovering motifs in data sets derived from ChIP-seq (Chromatin immunoprecipitation followed by sequencing) experiments. [3]

Motif Search

FIMO

FIMO or Find Individual Motif Occurrences is a tool for finding instances of motifs in a sequence database. FIMO searches the database for the provided motifs, and reports a q-value for each match. [4]

GLAM2SCAN

GLAM2SCAN is a tool for finding occurrences of a GLAM2 motif in a sequence database. [5]

MAST

MAST or Motif Alignment & Search Tool is a tool for searching biological sequence databases for sequences that contain an occurrence of each motif in a given set of motifs. MAST scores the matches and reports p-values for four types of events:

Motif Enrichment Analysis

SpaMo

SpaMo or Spaced Motif Analysis Tool is a tool for inferring interactions between transcription factors. SpaMo takes a set of sequences (typically sequences surrounding ChIP-seq peaks), a motif represented in these sequences, and a database of known motifs. SpaMo searches the database for instances of database motifs enriched in sites neighboring the given motif. These enrichments suggest physical interaction between the factors that bind each motif. [6]

CentriMo

CentriMo or Central Motif Enrichment Analysis is a tool for inferring direct DNA binding from ChIP-seq data. CentriMo is based on the observation that the positional distribution of binding sites matching the direct-binding motif tends to be unimodal, well centered and maximal in the precise center of the ChIP-seq peak regions. CentriMo takes a set of sequences and plots the occurrence of motifs relative to the ChIP-seq peak. Motifs that occur exclusively at the peak provide good evidence of direct binding, while motifs that do not occur in a consistent position relative to the peak may not bind directly. [7]

Motif cluster search

MCAST

MCAST or Motif Cluster Alignment and Search Tool is a tool for searching a sequence database for statistically significant clusters of non-overlapping occurrences of a set of motifs. Such clusters may represent regulatory modules.

Motif comparison

TOMTOM

Tomtom is a tool for comparing a DNA motif to a database of known motifs. TOMTOM searches for statistically significantly similar motifs to the query motif. TOMTOM is useful for determining whether a discovered motif is novel or is a variation of a known motif.

Motif function analysis

GOMO

GOMO or Gene Ontology for MOtifs is a tool for identifying possible roles for DNA binding motifs. It does so by comparing genes the motif occurs upstream of to a Gene Ontology database. If the motif occurs statistically significantly upstream of genes related to a particular function (for example, lactose digestion), it suggests that the transcription factor that binds the motif may regulate that function (for example, by promoting transcription of proteins that digest lactose).

References

  1. Timothy L. Bailey, "DREME: Motif discovery in transcription factor ChIP-seq data", Bioinformatics, 27(12):1653-1659, 2011.
  2. MC Frith, NFW Saunders, B Kobe, TL Bailey, "Discovering sequence motifs with arbitrary insertions and deletions", PLoS Computational Biology, 4(5):e1000071, 2008
  3. Philip Machanick and Timothy L. Bailey, "MEME-ChIP: motif analysis of large DNA datasets", Bioinformatics, 2712, 1696-1697, 2011
  4. Charles E. Grant, Timothy L. Bailey, and William Stafford Noble, "FIMO: Scanning for occurrences of a given motif", Bioinformatics, 27(7):1017-1018, 2011
  5. MC Frith, NFW Saunders, B Kobe, TL Bailey (2008) Discovering sequence motifs with arbitrary insertions and deletions, PLoS Computational Biology, 4(5), e1000071, 2008
  6. Whitington, T., Frith, M. C., Johnson, J., & Bailey, T. L. (2011). Inferring transcription factor complexes from ChIP-seq data. Nucleic acids research, 39(15), e98-e98.
  7. Bailey, T. L., & Machanick, P. (2012). Inferring direct DNA binding from ChIP-seq. Nucleic Acids Research, 40(17), e128-e128