Transposable element

A bacterial DNA transposon

A transposable element (TE or transposon) is a DNA sequence that can change its position within the genome, sometimes creating or reversing mutations and altering the cell's genome size. Transposition often results in duplication of the TE. Barbara McClintock's discovery of these jumping genes earned her a Nobel prize in 1983.[1]

TEs make up a large fraction of the C-value of eukaryotic cells. They are generally considered non-coding DNA, although it has been shown that TEs are important in genome function and evolution.[2] In Oxytricha, which has a unique genetic system, they play a critical role in development.[3] They are also very useful to researchers as a means to alter DNA inside a living organism.

Discovery

Barbara McClintock discovered the first TEs in maize, Zea mays, at the Cold Spring Harbor Laboratory. McClintock was experimenting with maize plants that had broken chromosomes.[4] In the winter of 1944–1945 McClintock planted corn kernels that were self-pollinated, meaning that the flowers were pollinated by the silk of their own plant.[4] These kernels came from a long line of plants that had been self-pollinated, causing broken arms on the end of their ninth chromosome.[4] As the maize plants began to grow, McClintock noted unusual color patterns on the leaves.[4] For example, one leaf had two albino patches of almost identical size, located side by side on the leaf.[4] McClintock hypothesized that during cell division certain cells lost genetic material, while others gained what they had lost.[5] However, when comparing the chromosomes of the current generation of plants and their parent generation, she found certain parts of the chromosomes had switched positions on the chromosome.[5] She disproved the popular genetic theory of the time that genes were fixed in their position on a chromosome. McClintock found that genes could not only move, but they could also be turned on or off due to certain environmental conditions or during different stages of cell development.[5] McClintock also showed that gene mutations could be reversed.[6] McClintock presented her report on her findings in 1951, and published an article on her discoveries in Genetics in November 1953 entitled, ″Induction of Instability at Selected Loci in Maize.″[7] Her work would be largely dismissed and ignored until the late 1960s-1970s when it would be rediscovered after TEs were found in bacteria.[8] She was awarded a Nobel Prize in Medicine or Physiology in 1983 for her discovery of TEs, more than thirty years after her research and initial discovery.[9]

TEs are more common than usually thought. Approximately 90% of maize genome is made up of TEs, and 50% in the human genome.[10]

Classification

Transposable elements (TEs) represent one of several types of mobile genetic elements. TEs are assigned to one of two classes according to their mechanism of transposition, which can be described as either copy and paste (class I TEs) or cut and paste (class II TEs).[11]

Class I (retrotransposons)

Main article: retrotransposon

Class I TEs are copied in two stages: first they are transcribed from DNA to RNA, and the RNA produced is then reverse transcribed to DNA. This copied DNA is then inserted at a new position into the genome. The reverse transcription step is catalyzed by a reverse transcriptase, which is often encoded by the TE itself. The characteristics of retrotransposons are similar to retroviruses, such as HIV.

Retrotransposons are commonly grouped into three main orders:

Retroviruses can also be considered TEs. For example, after entering a host cell and conversion of the retroviral RNA into DNA, the newly produced retroviral DNA is integrated into the genome of the host cell. These integrated DNAs represent a provirus of the retrovirus. The provirus is a specialized form of eukaryotic retrotransposon, which can produce RNA intermediates that may leave the host cell and infect other cells. The transposition cycle of retroviruses has similarities to that of prokaryotic TEs, suggesting a distant relationship between these two TEs types.

Class II (DNA transposons)

The cut-and-paste transposition mechanism of class II TEs does not involve an RNA intermediate. The transpositions are catalyzed by several transposase enzymes. Some transposases non-specifically bind to any target site in DNA, whereas others bind to specific DNA sequence targets. The transposase makes a staggered cut at the target site resulting in single-strand 5' or 3' DNA overhangs (sticky ends). This step cuts out the DNA transposon, which is then ligated into a new target site; this process involves activity of a DNA polymerase that fills in gaps and of a DNA ligase that closes the sugar-phosphate backbone. This results in duplication of the target site. The insertion sites of DNA transposons may be identified by short direct repeats (created by the staggered cut in the target DNA and filling in by DNA polymerase) followed by a series of inverted repeats important for the TE excision by transposase. Cut-and-paste TEs may be duplicated if their transposition takes place during S phase of the cell cycle when a donor site has already been replicated, but a target site has not yet been replicated. Such duplications at the target site can result in gene duplication, which plays an important role in evolution.[12]:284 Not all DNA transposons transpose through the cut-and-paste mechanism. In some cases, a replicative transposition is observed in which a transposon replicates itself to a new target site (e.g. Helitron (biology)).

Class II TEs make less than 2% of the human genome, making the rest Class I.[13]

Autonomous and non-autonomous TEs

Transposition can be classified as either "autonomous" or "non-autonomous" in both Class I and Class II TEs. Autonomous TEs can move by themselves while non-autonomous TEs require the presence of another TE to move. This is often because non-autonomous TEs lack transposase (for class II) or reverse transcriptase (for class I).

Activator element (Ac) is an example of an autonomous TE, and dissociation element (Ds) is an example of non-autonomous TE. Without Ac, Ds is not able to transpose.

Examples

In disease

TEs are mutagens. They can damage the genome of their host cell in different ways:[25]

Diseases that are often caused by TEs include hemophilia A and B, severe combined immunodeficiency, porphyria, predisposition to cancer, and Duchenne muscular dystrophy.[26][27] LINE1 (L1) TEs that land on the human Factor VIII caused haemophilia[28] and insertion of L1 into APC gene caused colon cancer and this confirms that the TEs play an important role for disease development.[29]

Additionally, many TEs contain promoters which drive transcription of their own transposase. These promoters can cause aberrant expression of linked genes, causing disease or mutant phenotypes.

Silencing of TEs[30]

If an organism is composed of mostly TEs, doesn't it affect their genetics? Surprisingly, in most cases TEs are silenced through epigenetics mechanism like methylation, chromatin remodeling and piRNAs. So no phenotypic effects nor the movement of TEs are occurring as in the wild type plant TEs are silenced. Certain mutated plants were found to have defects in the methylation related enzymes (methyl transferase) that cause the transcription of TEs thus affect the phenotype.[31]

Hypothesis suggest that about only 100 LINE1 related sequenced are actually active out of 17% of its exisence in the human genome. In human cells, silencing of LINE1 sequence are triggered by RNAi mechanism. Surprisingly, the RNAi sequenced are derived from the 5' untranslated region (UTR) of the LINE1 long terminal repeats itself. Supposedly the 5' LINE1 UTR codes for sense promoter for LINE1 transcription also encodes antisense promoter for the miRNA that become the substrate for the siRNA production. An inhibition of RNAi silencing mechanism at this region showed an increase in LINE1 transcript as the study showed.[32]

Rate of transposition, induction and defense

One study estimated the rate of transposition of a particular retrotransposon, the Ty1 element in Saccharomyces cerevisiae. Using several assumptions, the rate of successful transposition event per single Ty1 element came out to be about once every few months to once every few years.[33]

Cells defend against the proliferation of TEs in a number of ways. These include piRNAs and siRNAs[34] which silence TEs after they have been transcribed.

Some TEs contain heat-shock like promoters and their rate of transposition increases if the cell is subjected to stress,[35] thus increasing the mutation rate under these conditions, which might be beneficial to the cell.

Evolution

The scientific community is still exploring the evolution of TEs and their effect on genome evolution.

TEs are found in most life forms. They may have originated in the last universal common ancestor, arisen independently multiple times, or arisen once and then spread to other kingdoms by horizontal gene transfer.[36] While some TEs confer benefits on their hosts, most are regarded as selfish DNA parasites. In this way, they are similar to viruses. Various viruses and TEs also share features in their genome structures and biochemical abilities, leading to speculation that they share a common ancestor.

Because excessive TE activity can damage exons, many organisms have developed mechanisms to inhibit their activity. Bacteria may undergo high rates of gene deletion as part of a mechanism to remove TEs and viruses from their genomes while eukaryotic organisms use RNA interference (RNAi) to inhibit TE activity. Nevertheless, some TEs generated large families often associated with speciation events.

Evolution often deactivates DNA transposons, leaving them as introns (inactive gene sequences). In vertebrate animal cells nearly all >100,000 DNA transposons per genome have genes that encode inactive transposase polypeptides.[37] In humans, all Tc1-like transposons are inactive. The first synthetic transposon designed for use in vertebrate cells, the Sleeping Beauty transposon system, is a Tc1/mariner-like transposon. It exists in the human genome as an intron and was activated through reconstruction.[38]

Interspersed Repeats within genomes are created by transposition events accumulating over evolutionary time. Because interspersed repeats block gene conversion, they protect novel gene sequences from being overwritten by similar gene sequences and thereby facilitate the development of new genes.

TEs may have been co-opted by the vertebrate immune system as a means of producing antibody diversity. The V(D)J recombination system operates by a mechanism similar to that of some TEs.

TEs contain many type of genes, including those conferring antibiotic resistance and ability to transpose to conjugative plasmids. Some TEs also contain integrons (genetic elements that can capture and express genes from other sources). These contain integrase, which can integrate gene cassettes. There are over 40 antibiotic resistance genes identified on cassettes, as well as virulence genes.

Transposons do not always excise their elements precisely, sometimes removing up the adjacent base pairs. This phenomenon is called exon shuffling. Shuffling two unrelated exons create a novel gene product or, more likely, an intron.[39]

Applications

The first TE was discovered in the plant maize (Zea mays, corn species), and is named dissociator (Ds). Likewise, the first TE to be molecularly isolated was from a plant (Snapdragon). Appropriately, TEs have been an especially useful tool in plant molecular biology. Researchers use them as a means of mutagenesis. In this context, a TE jumps into a gene and produces a mutation. The presence of such a TE provides a straightforward means of identifying the mutant allele, relative to chemical mutagenesis methods.

Sometimes the insertion of a TE into a gene can disrupt that gene's function in a reversible manner, in a process called insertional mutagenesis; transposase-mediated excision of the DNA transposon restores gene function. This produces plants in which neighboring cells have different genotypes. This feature allows researchers to distinguish between genes that must be present inside of a cell in order to function (cell-autonomous) and genes that produce observable effects in cells other than those where the gene is expressed.

TEs are also a widely used tool for mutagenesis of most experimentally tractable organisms. The Sleeping Beauty transposon system has been used extensively as an insertional tag for identifying cancer genes [40]

The Tc1/mariner-class of TEs Sleeping Beauty transposon system, awarded as the Molecule of the Year 2009[41] is active in mammalian cells and are being investigated for use in human gene therapy.[42][43][44]

De novo repeat identification

De novo repeat identification is an initial scan of sequence data that seeks to find the repetitive regions of the genome, and to classify these repeats. Many computer programs exist to perform de novo repeat identification, all operating under the same general principles.[45] As short tandem repeats are generally 1–6 base pairs in length and are often consecutive, their identification is relatively simple.[46] Dispersed repetitive elements, on the other hand, are more challenging to identify, due to the fact that they are longer and have often acquired mutations. However, it is important to identify these repeats as they are often found to be transposable elements (TEs).[45]

De novo identification of transposons involves three steps: 1) find all repeats within the genome, 2) build a consensus of each family of sequences, and 3) classify these repeats. There are three groups of algorithms for the first step. One group is referred to as the k-mer approach, where a k-mer is a sequence of length k. In this approach, the genome is scanned for overrepresented k-mers; that is, k-mers that occur more often than is likely based on probability alone. The length k is determined by the type of transposon being searched for. The k-mer approach also allows mismatches, the number of which is determined by the analyst. Some k-mer approach programs use the k-mer as a base, and extend both ends of each repeated k-mer until there is no more similarity between them, indicating the ends of the repeats.[45] Another group of algorithms employs a method called sequence self-comparison. Sequence self-comparison programs use databases such as AB-BLAST to conduct an initial sequence alignment. As these programs find groups of elements that partially overlap, they are useful for finding highly diverged transposons, or transposons with only a small region copied into other parts of the genome.[47] Another group of algorithms follows the periodicity approach. These algorithms perform a Fourier transformation on the sequence data, identifying periodicities, regions that are repeated periodically, and are able to use peaks in the resultant spectrum to find candidate repetitive elements. This method works best for tandem repeats, but can be used for dispersed repeats as well. However, it is a slow process, making it an unlikely choice for genome scale analysis.[45]

The second step of de novo repeat identification involves building a consensus of each family of sequences. A consensus sequence is a sequence that is created based on the repeats that comprise a TE family. A base pair in a consensus is the one that occurred most often in the sequences being compared to make the consensus. For example, in a family of 50 repeats where 42 have a T base pair in the same position, the consensus sequence would have a T at this position as well, as the base pair is representative of the family as a whole at that particular position, and is most likely the base pair found in the family’s ancestor at that position.[45] Once a consensus sequence has been made for each family, it is then possible to move on to further analysis, such as TE classification and genome masking in order to quantify the overall TE content of the genome.

See also

Notes

References

  1. McClintock, Barbara (June 1950). "The origin and behavior of mutable loci in maize". Proc Natl Acad Sci U S A. 36 (6): 344–55. Bibcode:1950PNAS...36..344M. doi:10.1073/pnas.36.6.344. PMC 1063197. PMID 15430309.
  2. Bucher E, Reinders J, Mirouze M. (Nov 2012). "Epigenetic control of transposon transcription and mobility in Arabidopsis". Current Opinion in Plant Biology 15 (5): 503–10. doi:10.1016/j.pbi.2012.08.006. PMID 22940592.
  3. "'Junk' DNA Has Important Role, Researchers Find". Science Daily. 21 May 2009.
  4. 4.0 4.1 4.2 4.3 4.4 McGrayne, Sharon Bertsch (1998). Nobel Prize Women in Science: Their Lives, Struggles, and Momentous Discoveries (2nd ed.). Carol Publishing. p. 165. ISBN 978-0-9702256-0-3.
  5. 5.0 5.1 5.2 McGrayne 1998, p. 166
  6. McGrayne 1998, p. 167
  7. McClintock B (November 1953). "Induction of Instability at Selected Loci in Maize". Genetics 38 (6): 579–99. PMC 1209627. PMID 17247459.
  8. Des Jardins, Julie (2010). The Madame Curie Complex: The Hidden History of Women in Science. Feminist Press at CUNY. p. 246. ISBN 978-1-55861-655-4.
  9. Fedoroff, Nina; Botstein, David, eds. (1 January 1992). The Dynamic Genome: Barbara McClintock's Ideas in the Century of Genetics. Cold Spring Harbor Laboratory Press. p. 2. ISBN 978-0-87969-422-7.
  10. SanMiguel P, Tikhonov A, Jin YK et al. (November 1996). "Nested retrotransposons in the intergenic regions of the maize genome". Science 274 (5288): 765–8. doi:10.1126/science.274.5288.765. PMID 8864112.
  11. Wicker, T et al. (December 2007). "A unified classification system for eukaryotic transposable elements". Nature Reviews Genetics 8 (12): 973–82. doi:10.1038/nrg2165. PMID 17984973.
  12. Madigan M, Martinko J (editors) (2006). Brock Biology of Microorganisms (11th ed.). Prentice Hall. ISBN 0-13-144329-1.
  13. Kazazian HH Jr, Moran JV (May 1998). "The impact of L1 retrotransposons on the human genome". Nature Genetics 19 (1): 19–24. doi:10.1038/ng0598-19. PMID 9590283.
  14. Schnable et al. (November 2009). "The B73 maize genome: complexity, diversity, and dynamics". Science 326 (5956): 1112–1115. Bibcode:2009Sci...326.1112S. doi:10.1126/science.1178534. PMID 19965430.
  15. Spradling AC, Rubin GM (October 1982). "Transposition of cloned P elements into Drosophila germ line chromosomes". Science 218 (4570): 341–347. Bibcode:1982Sci...218..341S. doi:10.1126/science.6289435. PMID 6289435.
  16. Rubin GM, Spradling AC (October 1982). "Genetic transformation of Drosophila with transposable element vectors". Science 218 (4570): 348–353. Bibcode:1982Sci...218..348R. doi:10.1126/science.6289436. PMID 6289436.
  17. Cesari F (15 October 2007). "Milestones in Nature: Milestone 9: Transformers, Elements in Disguise". Nature. doi:10.1038/nrg2254.
  18. Kazazian HH, Moran JV (May 1998). "The impact of L1 retrotransposons on the human genome". Nat. Genet. 19 (1): 19–24. doi:10.1038/ng0598-19. PMID 9590283.
  19. Jacobson JW, Medhora MM, Hartl DL (November 1986). "Molecular structure of a somatically unstable transposable element in Drosophila". Proc. Natl. Acad. Sci. U.S.A. 83 (22): 8684–8. doi:10.1073/pnas.83.22.8684. PMC 386995. PMID 3022302.
  20. Lohe AR, Moriyama EN, Lidholm DA, Hartl DL (January 1995). "Horizontal transmission, vertical inactivation, and stochastic loss of mariner-like transposable elements". Mol. Biol. Evol. 12 (1): 62–72. doi:10.1093/oxfordjournals.molbev.a040191. PMID 7877497.
  21. Lampe DJ, Witherspoon DJ, Soto-Adames FN, Robertson HM (April 2003). "Recent horizontal transfer of mellifera subfamily mariner transposons into insect lineages representing four different orders shows that selection acts only during horizontal transfer". Mol. Biol. Evol. 20 (4): 554–62. doi:10.1093/molbev/msg069. PMID 12654937.
  22. Mandal PK, Kazazian HH (October 2008). "SnapShot: Vertebrate transposons". Cell 135 (1): 192–192.e1. doi:10.1016/j.cell.2008.09.028. PMID 18854165.
  23. Carlton JM, Hirt RP, Silva JC et al. (January 2007). "Draft genome sequence of the sexually transmitted pathogen Trichomonas vaginalis". Science 315 (5809): 207–12. doi:10.1126/science.1132894. PMC 2080659. PMID 17218520.
  24. Kim JM, Vanguri S, Boeke JD, Gabriel A, Voytas DF (May 1998). "Transposable elements and genome organization: a comprehensive survey of retrotransposons revealed by the complete Saccharomyces cerevisiae genome sequence". Genome Res. 8 (5): 464–78. doi:10.1101/gr.8.5.464. PMID 9582191.
  25. Belancio VP, Hedges DJ, Deininger P (March 2008). "Mammalian non-LTR retrotransposons: for better or worse, in sickness and in health". Genome Res. 18 (3): 343–58. doi:10.1101/gr.5558208. PMID 18256243.
  26. Kazazian H.H., Goodier J.L. (2002). "LINE drive: retrotransposition and genome instability". Cell 110 (3): 277–80. doi:10.1016/S0092-8674(02)00868-1. PMID 12176313.
  27. Kapitonov V.V., Pavlicek, A., Jurka, J. (2006). "Anthology of Human Repetitive DNA". Encyclopedia of Molecular Cell Biology and Molecular Medicine. doi:10.1002/3527600906.mcb.200300166.
  28. Kazazian, HH Jr et al. (1988). "Haemophilia A resulting from de novo insertion of L1 sequences represents a novel mechanism for mutation in man". Nature 332 (6160): 164–6. doi:10.1038/332164a0. PMID 2831458.
  29. Miki, Y. et al. (Feb 1992). "Disruption of the APC gene by a retrotransposal insertion of L1 sequence in colon cancer". Cancer Research 52 (3): 643–5. PMID 1310068.
  30. Pray, L (2008). "Transposons: The jumping genes". Nature Education 1 (1): 204.
  31. Miura A, Yonebayashi S, Watanabe K, Toyama T, Shimada H, Kakutani T (May 2001). "Mobilization of transposons by a mutation abolishing full DNA methylation in Arabidopsis". Nature 411 (6834): 212–4. doi:10.1038/35075612. PMID 11346800.
  32. Yang N, Kazazian HH (September 2006). "L1 retrotransposition is suppressed by endogenously encoded small interfering RNAs in human cultured cells". Nat. Struct. Mol. Biol. 13 (9): 763–71. doi:10.1038/nsmb1141. PMID 16936727.
  33. Paquin CE, Williamson VM (5 October 1984). "Temperature Effects on the Rate of Ty Transposition". Science 226 (4670): 53–55. Bibcode:1984Sci...226...53P. doi:10.1126/science.226.4670.53. PMID 17815421.
  34. Wei-Jen Chung,Katsutomo Okamura,Raquel Martin, Eric C. Lai (3 June 2008). "Endogenous RNA Interference Provides a Somatic Defense against Drosophila Transposons". Current Biology 18 (11): 795–802. doi:10.1016/j.cub.2008.05.006. PMC 2812477. PMID 18501606.
  35. Dennis J. Strand, John F. McDonald (1985). "Copia is transcriptionally responsive to environmental stress". Nucleic Acids Research 13 (12): 4401–4410. doi:10.1093/nar/13.12.4401. PMC 321795. PMID 2409535.
  36. Kidwell, M.G. (1992). "Horizontal transfer of P elements and other short inverted repeat transposons". Genetica 86 (1): 275–286. doi:10.1007/BF00133726. PMID 1334912.
  37. Plasterk R.H.A., Izsvák Z., Ivics Z. (1999). "Resident aliens: the Tc1/mariner superfamily of transposable elements". Trends Genet 15 (8): 326–32. doi:10.1016/S0168-9525(99)01777-1. PMID 10431195.
  38. Ivics Z., Hackett P.B., Plasterk R.H., Izsvak Z. (1997). "Molecular reconstruction of Sleeping Beauty, a Tc1-like transposon from fish, and its transposition in human cells". Cell 91 (4): 501–10. doi:10.1016/S0092-8674(00)80436-5. PMID 9390559.
  39. Moran JV, DeBerardinis RJ, Kazazian HH (March 1999). "Exon shuffling by L1 retrotransposition". Science 283 (5407): 1530–4. doi:10.1126/science.283.5407.1530. PMID 10066175.
  40. Carlson C.M., Largaespada D.A. (2005). "Insertional mutagenesis in mice: new perspectives and tools". Nature Reviews Genetics 6 (7): 568–80. doi:10.1038/nrg1638. PMID 15995698.
  41. Luft FC (May 2010). "Sleeping Beauty jumps to new heights". Mol. Med 88 (7): 641–643. doi:10.1007/s00109-010-0626-1. PMID 20467721.
  42. Ivics Z, Izsvák Z (October 2006). "Transposons for gene therapy!". Curr Gene Ther 6 (5): 593–607. doi:10.2174/156652306778520647. PMID 17073604.
  43. Wilson MH, Coates CJ, George AL (January 2007). "PiggyBac transposon-mediated gene transfer in human cells". Mol. Ther. 15 (1): 139–145. doi:10.1038/sj.mt.6300028. PMID 17164785.
  44. Hackett P.B., Largaespada D.A., Cooper L.J.N. (2010). "A transposon and transposase system for human application". Mol. Ther. 18 (4): 674–83. doi:10.1038/mt.2010.2. PMC 2862530. PMID 20104209.
  45. 45.0 45.1 45.2 45.3 45.4 Makałowski W, Pande A, Gotea V, Makałowska I (2012). "Transposable elements and their identification". Methods Mol. Biol. 855: 337–59. doi:10.1007/978-1-61779-582-4_12. PMID 22407715.
  46. Saha S, Bridges S, Magbanua ZV, Peterson DG (2008). "Computational Approaches and Tools Used in Identification of Dispersed Repetitive DNA Sequences". Tropical Plant Biol. 1: 85–96. doi:10.1007/s12042-007-9007-5.
  47. Saha S, Bridges S, Magbanua ZV, Peterson DG (April 2008). "Empirical comparison of ab initio repeat finding programs". Nucleic Acids Res. 36 (7): 2284–94. doi:10.1093/nar/gkn064. PMC 2367713. PMID 18287116.

External links