Sequence logo

From Wikipedia, the free encyclopedia

A sequence logo in bioinformatics is a graphical representation of the sequence conservation of nucleotides (in a strand of DNA/RNA) or amino acids (in protein sequences).[1]

To create sequence logos, related DNA, RNA or protein sequences, or DNA sequences that have common conserved binding sites, are aligned so that the most conserved parts create good alignments. A sequence logo can then be created from the conserved multiple sequence alignment. The sequence logo will show how well residues are conserved at each position: the fewer the number of residues, the higher the letters will be, because the better the conservation is at that position. Different residues at the same position will be scaled according to their frequency. Sequence logos can be used to represent conserved DNA binding sites, where transcription factors bind.

A sequence logo showing the most conserved bases around the initiation codon from all human mRNAs
A sequence logo showing the most conserved bases around the initiation codon from all human mRNAs

[edit] References

  1. ^ Schneider TD, Stephens RM (1990). "Sequence Logos: A New Way to Display Consensus Sequences". Nucleic Acids Res 18 (20): 6097–6100. PMID 2172928. 

[edit] External links

[edit] Tools for creating sequence logos

Languages