ChIP-on-chip

Contents

ChIP-on-chip (also known as ChIP-chip) is a technique that combines chromatin immunoprecipitation ("ChIP") with microarray technology ("chip"). Like regular ChIP, ChIP-on-chip is used to investigate interactions between proteins and DNA in vivo. Specifically, it allows the identification of the cistrome, sum of binding sites, for DNA-binding proteins on a genome-wide basis.[1] Whole-genome analysis can be performed to determine the locations of binding sites for almost any protein of interest.[1] As the name of the technique suggests, such proteins are generally those operating in the context of chromatin. The most prominent representatives of this class are transcription factors, replication-related proteins, like ORC, histones, their variants, and histone modifications. The goal of ChIP-on-chip is to localize protein binding sites that may help identify functional elements in the genome. For example, in the case of a transcription factor as a protein of interest, one can determine its transcription factor binding sites throughout the genome. Other proteins allow the identification of promoter regions, enhancers, repressors and silencing elements, insulators, boundary elements, and sequences that control DNA replication.[2] If histones are subject of interest, it is believed that the distribution of modifications and their localizations may offer new insights into the mechanisms of regulation. One of the long-term goals ChIP-on-chip was designed for is to establish a catalogue of (selected) organisms that lists all protein-DNA interactions under various physiological conditions. This knowledge would ultimately help in the understanding of the machinery behind gene regulation, cell proliferation, and disease progression. Hence, ChIP-on-chip offers not only huge potential to complement our knowledge about the orchestration of the genome on the nucleotide level, but also on higher levels of information and regulation as it is propagated by research on epigenetics.

Technological platforms

The technical platforms to conduct ChIP-on-chip experiments are DNA microarrays, or "chips". They can be classified and distinguished according to various characteristics:

Besides the actual microarray, other hard- and software equipment is necessary to run ChIP-on-chip experiments. It is generally the case that one company’s microarrays can not be analyzed by another company’s processing hardware. Hence, buying an array requires also buying the associated workflow equipment. The most important elements are, among others, hybridization ovens, chip scanners, and software packages for subsequent numerical analysis of the raw data.

Workflow of a ChIP-on-chip experiment

Starting with a biological question, a ChIP-on-chip experiment can be divided into three major steps: The first is to set up and design the experiment by selecting the appropriate array and probe type. Second, the actual experiment is performed in the wet-lab. Last, during the dry-lab portion of the cycle, gathered data are analyzed to either answer the initial question or lead to new questions so that the cycle can start again.

Wet-lab portion of the workflow

Dry-lab portion of the workflow

  1. During the first step, the captured fluorescence signals from the array are normalized, using control signals derived from the same or a second chip. Such control signals tell which probes on the array were hybridized correctly and which bound nonspecifically.
  2. In the second step, numerical and statistical tests are applied to control data and IP fraction data to identify POI-enriched regions along the genome. The following three methods are used widely: Median percentile rank, Single-array error, and Sliding-window. These methods generally differ in a way how low-intensity signals are handled, how much background noise is accepted, and which trait for the data is emphasized during the computation. In the recent past, the sliding-window approach seems to be favored and is often described as most powerful.
  3. In the third step, these regions are analyzed further. If, for example, the POI was a transcription factor, such regions would represent its binding sites. Subsequent analysis then may want to infer nucleotide motifs and other patterns to allow functional annotation of the genome.

Strengths and Weaknesses

Using tiled arrays, ChIP-on-chip allows for high resolution of genome-wide maps. These maps can determine the binding sites of many DNA-binding proteins like transcription factors and also chromatin modifications.

Although ChIP-on-chip can be a powerful technique in the area of genomics, it is very expensive. Most published studies using ChIP-on-chip repeat their experiments at least three times to ensure biologically meaningful maps. The cost of the DNA microarrays is often a limiting factor to whether a laboratory should proceed with a ChIP-on-chip experiment. Another limitation is the size of DNA fragments that can be achieved. Most ChIP-on-chip protocols utilize sonication as a method of breaking up DNA into small pieces. However, sonication is limited to a minimal fragment size of 200 bp. For higher resolution maps, this limitation should be overcome to achieve smaller fragments, preferably to single nucleosome resolution. As mentioned previously, the statistical analysis of the huge amount of data generated from arrays is a challenge and normalization procedures should aim to minimize artifacts and determine what is really biologically significant. So far, application to mammalian genomes has been a major limitation, for example, due to a significant percentage of the genome that is occupied by repeats. However, as ChIP-on-chip technology advances, high resolution whole mammalian genome maps should become achievable.

Antibodies used for ChIP-on-chip can be an important limiting factor. ChIP-on-chip requires highly specific antibodies that must recognize its epitope in free solution and also under fixed conditions. If it is demonstrated to successfully immunoprecipitate cross-linked chromatin, it is termed "ChIP-grade". Companies that provide ChIP-grade antibodies include Abcam, Cell Signaling Technology, Santa Cruz, and Upstate. To overcome the problem of specificity, the protein of interest can be fused to a tag like FLAG or HA that are recognized by antibodies. An alternative to ChIP-on-chip that does not require antibodies is DamID.

Also available are antibodies against a specific histone modification like H3 tri methyl K4. As mentioned before, the combination of these antibodies and ChIP-on-chip has become extremely powerful in determining whole genome analysis of histone modification patterns and will contribute tremendously to our understanding of the histone code and epigenetics.

A study demonstrating the non-specific nature of DNA binding proteins has been published in PLoS Biology. This indicates that alternate confirmation of functional relevancy is a necessary step in any ChIP-chip experiment.[5]

History

A first ChIP-on-chip experiment was performed in 1999 to analyze the distribution of cohesin along budding yeast chromosome III.[6] Although the genome was not completely represented, the protocol in this study remains equivalent as those used in later studies. The ChIP-on-chip technique using all of the ORFs of the genome (that nevertheless remains incomplete, missing intergenic regions) was then applied successfully in three papers published in 2000 and 2001.[7][8][9] The authors identified binding sites for individual transcription factors in the budding yeast Saccharomyces cerevisiae. In 2002, Richard Young’s group [10] determined the genome-wide positions of 106 transcription factors using a c-Myc tagging system in yeast. Other applications for ChIP-on-chip include DNA replication, recombination, and chromatin structure. Since then, ChIP-on-chip has become a powerful tool in determining genome-wide maps of histone modifications and many more transcription factors. ChIP-on-chip in mammalian systems has been difficult due to the large and repetitive genomes. Thus, many studies in mammalian cells have focused on select promoter regions that are predicted to bind transcription factors and have not analyzed the entire genome. However, whole mammalian genome arrays have recently become commercially available from companies like Nimblegen. In the future, as ChIP-on-chip arrays become more and more advanced, high resolution whole genome maps of DNA-binding proteins and chromatin components for mammals will be analyzed in more detail.

Analysis and Software

[1] CoCAS: a free Analysis software for Agilent ChIP-on-Chip experiments

[2] rMAT: R implementation from MAT program to normalize and analyze tiling arrays and ChIP-chip data.

Software Reference

[3] Touati Benoukraf , Pierre Cauchy , Romain Fenouil , Adrien Jeanniard , Frederic Koch , Sébastien Jaeger , Denis Thieffry , Jean Imbert , Jean-Christophe Andrau , Salvatore Spicuglia , and Pierre Ferrier , CoCAS: a ChIP-on-chip analysis suite, Bioinformatics Advance Access published on April 1, 2009, DOI 10.1093/bioinformatics/btp075, Bioinformatics 25: 954-955.

[4] W. Evan Johnson, Wei Li, Clifford A. Meyer, Raphael Gottardo, Jason S. Carroll, Myles Brown, and X. Shirley Liu. Model-based analysis of tiling-arrays for ChIP-chip. Proc Natl Acad Sci U S A. 2006 Aug 15;103(33):12457-62. Epub 2006 Aug 8.

Alternatives

Chip-Sequencing is a recently developed technology that still uses chromatin immunoprecipitation to crosslink the proteins of interest to the DNA but then instead of using a micro-array, it uses the more accurate, higher throughput method of sequencing to localize interaction points.

DamID is an alternative method that does not require antibodies.

External links

References

  1. ^ a b Aparicio, Oscar; Aparicio O, Geisberg JV, Struhl K (2004). "Chromatin immunoprecipitation for determining the association of proteins with specific genomic sequences in vivo". Current Protocols in Cell Biology (University of Southern California, Los Angeles, California, USA.: John Wiley & Sons, Inc.) Chapter 17 (2004): Unit 17.7. doi:10.1002/0471143030.cb1707s23. ISBN 0471143030. ISSN 1934-2616. PMID 18228445. 
  2. ^ a b M.J. Buck, J.D. Lieb, ChIP-chip: considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments, Genomics 83 (2004) 349-360.
  3. ^ Royce TE, Rozowsky JS, Bertone P, Samanta M, Stolc V, Weissman S, Snyder M, Gerstein M. Related Articles. Issues in the analysis of oligonucleotide tiling microarrays for transcript mapping. Trends Genet. 2005 Aug;21(8):466-75. Review.
  4. ^ Kartha, R. Spectrophotometric Quantification of Nano- and Standard-Volume Samples, (2008, October 7), American Biotechnology Laboratory, http://www.iscpubs.com/Media/PublishingTitles/b0608kar.pdf
  5. ^ "Transcription Factors Bind Thousands of Active and Inactive Regions in the Drosophila Blastoderm". http://biology.plosjournals.org/perlserv/?request=get-document&doi=10.1371/journal.pbio.0060027. 
  6. ^ Blat Y, Kleckner N., Cohesins bind to preferential sites along yeast chromosome III, with differential regulation along arms versus the centric region, Cell (1999) Jul 23;98(2):249-59.
  7. ^ J.D. Lieb, X. Liu, D. Botstein, P.O. Brown, Promoter-specific binding of Rap1 revealed by genome-wide maps of protein-DNA association, Nat. Genet. 28 (2001) 327-334.
  8. ^ B. Ren, F. Robert, J.J. Wyrick, O. Aparicio, E.G. Jennings, I. Simon, J. Zeitlinger, J. Schreiber, N. Nannett, E. Kanin, T.L. Volkert, C.J. Wilson, S.R. Bell, R.A. Young, Genome-wide location and function of DNA binding proteins, Science 290 (2000) 2306-2309.
  9. ^ V.R. Iyer, C.E. Horak, C.S. Scafe, D. Botstein, M. Snyder, P.O. Brown, Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF, Nature 409 (2001) 533-538.
  10. ^ T.I. Lee, N.J. Rinaldi, F. Robert, D.T. Odom, Z. Bar-Joseph, G.K. Gerber, N.M. Hannett, C.T. Harbison, C.M. Thompson, I. Simon, J. Zeitlinger, E.G. Jennings, H.L. Murray, D.B. Gordon, B. Ren, J.J. Wyrick, J.B Tagne, T.L. Volkert, E. Fraenkel, D.K. Gifford, R.A Young, Transcriptional regulatory networks in Saccharomyces cerevisiae, Science 298 (2002) 799-804.