C1orf131

Uncharacterized protein C1orf131 is a protein that in humans is encoded by the gene C1orf131 or chromosome 1 open reading frame 131.

Gene

In humans C1orf131 is located on the minus strand of chromosome 1 and on the cytogenetic band 1q42.2 along with 193 other genes.[1] Notably, the gene upstream of C1orf131 is GNPAT, and the gene downstream of C1orf131 is TRIM 67. When this gene is transcribed in humans, it most often forms an mRNA of 1458 base pairs long which codes for seven exons. There are at least seven alternatively spliced mRNAs ranging from 129 base pairs to 1458 base pairs.[2]

Protein

The protein coded by this gene is 293 amino acids long, weighs 32.6 kDa, has an isoelectric point of 10.3,[3] and a charge of 23.0.[4] It has a poly(A) RNA binding site identified,[4] and it also has a domain of unknown function (DUF) 4602. Some post-translational modifications have been identified. Residues S73[5] , S78[5] , S86,[5][6] T163,[7] S207,[6] and S279,[5][8] have been proven to be phosphorylated, and residue K185 is acetylated.[9]

Some interesting characteristics of this protein is that compared to the rest of the human proteome it is rich in lysines, charged amino acids,and basic charged amino acids.[10] The secondary structure of this protein primarily consists of alpha helices and coils with a small percentage of strands.[11] This protein has been shown to be an intrinsically disordered protein in addition it has areas of low complexity and turns or loops.[12] This protein has also been identified to have a coiled coil and nuclear localization sequence.

There are 58 noted polymorphisms. The most common is from leucine to valine at position 28. Despite these 58 possible polymorphisms, none of them have been linked to a disease yet.[13]

It has been shown to interact with ubiquitin[14] through affinity capture followed by mass spectrometry and APP (amyloid beta (A4) precursor protein)[15] through reconstituted complex.

History

There have been a couple of pseudogenes of the gene found in orangutan, mouse lemur, and sloth. There is no paralog found in humans; however, 151 homologues exist as far back as invertebrates. This gene has a domain of unknown function that is 31.5% conserved between the human homolog and Bactrocera cucurbitae (melon fly). It is a comparably fast evolving gene in comparison to Cytochrome C, a slow evolving gene, and Fibrinogen gamma chain, a fast evolving gene. Additionally, are no paralogs or pseudogenes found in the human genome of this gene.

References

  1. http://www.genscript.com/cgi-bin/orf/browse.pl?species=9606&type=locus&chromosome=1&locus=1q42
  2. http://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/av.cgi?db=human&term=c1orf131&submit=Go
  3. http://www.phosphosite.org/proteinAction.do?id=20627&showAllSites=true
  4. 4.0 4.1 "Uniprot Gene: C1orf131".
  5. 5.0 5.1 5.2 5.3 Olsen JV, Vermeulen M, Santamaria A, Kumar C, Miller ML, Jensen LJ, Gnad F, Cox J, Jensen TS, Nigg EA, Brunak S, Mann M. (January 2010). "Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis.". Science Signalling 3 (104). doi:10.1126/scisignal.2000475. Retrieved April 26, 2015.
  6. 6.0 6.1 Wang B, Malik R, Nigg EA, Körner R. (December 2008). "Evaluation of the low-specificity protease elastase for large-scale phosphoproteome analysis.". Analytical Chemistry 80 (24): 9526–9533. doi:10.1021/ac801708p.10.1126/scisignal.2000475. Retrieved April 26, 2015.
  7. Matsuoka S, Ballif BA, Smogorzewska A, McDonald ER 3rd, Hurov KE, Luo J, Bakalarski CE, Zhao Z, Solimini N, Lerenthal Y, Shiloh Y, Gygi SP, Elledge SJ. (May 2007). "ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage.". Science 316 (5828): 1160–1166. PMID 17525332. Retrieved April 26, 2015.
  8. Dephoure N, Zhou C, Villén J, Beausoleil SA, Bakalarski CE, Elledge SJ, Gygi SP. (August 2008). "Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis.". Proceedings of the National Academy of Sciences of the United States of America 105 (31). doi:10.1073/pnas.0805139105. PMC 2504835. Retrieved April 26, 2015.
  9. Choudhary C, Kumar C, Gnad F, Nielsen ML, Rehman M, Walther TC, Olsen JV, Mann M. (August 2010). "Lysine acetylation targets protein complexes and co-regulates major cellular functions.". Science 325 (5942). doi:10.1126/science.1175371. PMID 19608861. Retrieved April 26, 2015.
  10. Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences.". Proceedings of the National Academy of Sciences of the United States of America 89 (6). doi:10.1073/pnas.89.6.2002. PMID 1549558. Retrieved April 26, 2015.
  11. Garnier J, Gibrat J-F, Robson B (1996). "GOR secondary structure prediction method version IV". Methods in Enzymology 266: 540–553. Retrieved April 26, 2015.
  12. http://pfam.xfam.org/protein/Q8NDD1
  13. http://bioinformatics.oxfordjournals.org/content/27/18/2494.full
  14. Stes E, Laga M, Walton A, Samyn N, Timmerman E, De Smet I, Goormachtig S, Gevaert K (June 2014). "A COFRADIC Protocol To Study Protein Ubiquitination.". J Proteome Res. (3 ed.) 13: 3107–3113. PMID 24816145. Retrieved April 26, 2015.
  15. Olah J, Vincze O, Virok D, Simon D, Bozso Z, Tokesi N, Horvath I, Hlavanda E, Kovacs J, Magyar A, Szucs M, Orosz F, Penke B, Ovadi J (September 2011). "Interactions of pathological hallmark proteins: tubulin polymerization promoting protein/p25, beta-amyloid, and alpha-synuclein.". J Biol. Chem. (39 ed.) 286: 34088–34100. PMID 21832049. Retrieved April 26, 2015.