Eukaryotic chromosome fine structure
From Wikipedia, the free encyclopedia
Eukaryotic chromosome fine structure refers to the structure of sequences for eukaryotic chromosomes. Some fine sequences are included in more than one class, so the classification listed is not intended to be completely separate.
Contents |
[edit] Chromosomal characteristics
Some sequences are required for a properly functioning chromosome:
- Centromere: Used during cell division as the attachment point for the spindle fibers.
- Telomere: Used to maintain chromosomal integrity by capping off the ends of the linear chromosomes. This region is a microsatellite, but its function is more specific than a simple tandem repeat.
[edit] Structural sequences
Other sequences are used in replication or during interphase with the physical structure of the chromosome.
- Ori, or Origin: Origins of replication.
- MAR: Matrix attachment regions, where the DNA attaches to the nuclear matrix.
[edit] Protein-coding genes
Regions of the genome with protein-coding genes include several elements:
- Enhancer regions (Normally up to a few thousand basepairs upstream of transcription)
- Promoter regions (Normally less than a couple of hundred basepairs upstream of transcription) include elements such as the TATA and CAAT boxes, GC elements, etc.
- Exons are the part of the transcript that will eventually be transported to the cytoplasm for translation. When discussing gene with alternate splicing, an exon is a portion of the transcript that could be translated, given the correct splicing conditions. The exons can be divided into three parts
- The coding region is the portion of the mRNA that will eventually be translated.
- Upstream untranslated region (5' UTR can serve several functions, including mRNA transport, and initiation of translation (including, portions of the Kozak sequence). They are never translated into the protein (excepting various mutations).
- The 3' region downstream from the stop codon is separated into two parts:
- 3' UTR is never translated, but serves to add mRNA stability. It is also the attachment site for the poly-A tail. The poly-A tail is used in the initiation of translation and also seems to have an effect on the long-term stability (aging) of the mRNA.
- An unnamed region after the poly-A tail, but before the actual site for transcription termination, is spliced off during transcription, and so does not become part of the 3' UTR. Its function, if any, is unknown.
- Introns are intervening sequences between the exons that are never translated. Some sequences inside introns function as miRNA, and there are even some cases of small genes residing completely within the intron of a large gene. For some genes (such as the antibody genes), internal control regions are found inside introns. These situations, however, are treated as exceptions.
[edit] Genes that are used as RNA
Many regions of the DNA are transcribed with RNA as the functional form:
- rRNA: Ribosomal RNA are used in the ribosome.
- tRNA: Transfer RNA are used in the translation process by bringing amino acids to the ribosome.
- snRNA: Small nuclear RNA are used in spliceosomes to help the processing of pre-mRNA.
- gRNA: Guide RNA are used in RNA editing.
- miRNA: Micro RNA are small (approximately 24 nucleotides) that are used in gene silencing.
- snoRNA: Small nucleolar RNA are used to help process and construct the ribosome.
Other RNAs are transcribed and not translated, but have undiscovered functions.
[edit] Repeated sequences
Repeated sequences are of two basic types: unique sequences that are repeated in one area; and repeated sequences that are interspersed throughout the genome.
[edit] Satellites
Satellites are unique sequences that are repeated in tandem in one area. Depending on the length of the repeat, they are classified as either:
- Minisatellite: Short repeats of nucleotides.
- Microsatellite: Very short repeats of nucleotides. Some trinucleotide repeats are found in coding regions (see, Trinucleotide repeat disorder). Most are found in noncoding regions. Their function is unknown, if they have any specific function. They are used as molecular markers and in DNA fingerprinting.
[edit] Interspersed sequences
Interspersed sequences are tandem repeats, with sequences that are found interspersed across the genome. They can be classified based on the length of the repeat as:
- SINE: Short interspersed sequences. The repeats are normally a few hundred base pairs in length. These sequences constitute about 13% of the human genome[1] with the specific Alu sequence accounting for 5%.
- LINE: Long interspersed sequences. The repeats are normally several thousand base pairs in length. These sequences constitute about 21% of the human genome.[1]
Both of these types are classified as retrotransposons.
[edit] Retrotransposons
Retrotransposons are sequences in the DNA that are the result of retrotransposition of RNA. LINEs and SINEs are examples where the sequences are repeats, but there are non-repeated sequences that can also be retrotransposons.
[edit] Other sequences
Typical eukaryotic chromosomes contain much more DNA than is classified in the categories above. The DNA may be used as spacing, or have other as-yet-unknown function. Or, they may simply be random sequences of no consequence.