Protein Information Resource

The Protein Information Resource (PIR), located at Georgetown University Medical Center (GUMC), is an integrated public bioinformatics resource to support genomic and proteomic research, and scientific studies^[1]^[2]^[3]^[4]^[5]^[6]^[7]

History of PIR

PIR was established in 1984 by the National Biomedical Research Foundation (NBRF) as a resource to assist researchers and costumers in the identification and interpretation of protein sequence information. Prior to that, the NBRF compiled the first comprehensive collection of macromolecular sequences in the Atlas of Protein Sequence and Structure, published from 1964-1974 under the editorship of Margaret Dayhoff. Dr. Dayhoff and her research group pioneered in the development of computer methods for the comparison of protein sequences, for the detection of distantly related sequences and duplications within sequences, and for the inference of evolutionary histories from alignments of protein sequences..

Dr. Winona Barker and Dr. Robert Ledley assumed leadership of the project after the untimely death of Dr. Dayhoff in 1983. In 1999, Dr. Cathy H. Wu joined NBRF, and later on GUMC, to head the bioinformatics efforts of PIR, and has served first as Principal Investigator and, since 2001, as Director.

For four decades, PIR has provided many protein databases and analysis tools freely accessible to the scientific community, including the Protein Sequence Database (PSD), the first international database (see PIR-International), which grew out of Atlas of Protein Sequence and Structure.

In 2002, PIR along with its international partners, EBI (European Bioinformatics Institute) and SIB (Swiss Institute of Bioinformatics), were awarded a grant from NIH to create UniProt, a single worldwide database of protein sequence and function, by unifying the PIR-PSD, Swiss-Prot, and TrEMBL databases.

Present

As of 2010, PIR offers a wide variety of resources mainly oriented to assist the propagation and standardization of protein annotation: PIRSF,^[8] iProClass, iProLINK

References

↑ http://pir.georgetown.edu/ Official website of PIR at Georgetown University.
↑ Wu, C.; Nebert, D. W. (2004). "Update on genome completion and annotations: Protein Information Resource". Human genomics 1 (3): 229–233. doi:10.1186/1479-7364-1-3-229. PMC 3525084. PMID 15588483.
↑ Wu, C. H.; Yeh, L. S.; Huang, H.; Arminski, L.; Castro-Alvear, J.; Chen, Y.; Hu, Z.; Kourtesis, P.; Ledley, R. S.; Suzek, B. E.; Vinayaka, C. R.; Zhang, J.; Barker, W. C. (2003). "The Protein Information Resource". Nucleic Acids Research 31 (1): 345–347. doi:10.1093/nar/gkg040. PMC 165487. PMID 12520019.
↑ Wu, C. H.; Huang, H.; Arminski, L.; Castro-Alvear, J.; Chen, Y.; Hu, Z. Z.; Ledley, R. S.; Lewis, K. C.; Mewes, H. W.; Orcutt, B. C.; Suzek, B. E.; Tsugita, A.; Vinayaka, C. R.; Yeh, L. S.; Zhang, J.; Barker, W. C. (2002). "The Protein Information Resource: An integrated public resource of functional annotation of proteins". Nucleic Acids Research 30 (1): 35–37. doi:10.1093/nar/30.1.35. PMC 99125. PMID 11752247.
↑ Barker, W. C.; Garavelli, J. S.; Hou, Z.; Huang, H.; Ledley, R. S.; McGarvey, P. B.; Mewes, H. W.; Orcutt, B. C.; Pfeiffer, F.; Tsugita, A.; Vinayaka, C. R.; Xiao, C.; Yeh, L. S.; Wu, C. (2001). "Protein Information Resource: A community resource for expert annotation of protein data". Nucleic Acids Research 29 (1): 29–32. doi:10.1093/nar/29.1.29. PMC 29802. PMID 11125041.
↑ Barker, W. C.; Garavelli, J. S.; Huang, H.; McGarvey, P. B.; Orcutt, B. C.; Srinivasarao, G. Y.; Xiao, C.; Yeh, L. S.; Ledley, R. S.; Janda, J. F.; Pfeiffer, F.; Mewes, H. W.; Tsugita, A.; Wu, C. (2000). "The protein information resource (PIR)". Nucleic Acids Research 28 (1): 41–44. doi:10.1093/nar/28.1.41. PMC 102418. PMID 10592177.
↑ George, D. G.; Dodson, R. J.; Garavelli, J. S.; Haft, D. H.; Hunt, L. T.; Marzec, C. R.; Orcutt, B. C.; Sidman, K. E.; Srinivasarao, G. Y.; Yeh, L. -S. L.; Arminski, L. M.; Ledley, R. S.; Tsugita, A.; Barker, W. C. (1997). "The Protein Information Resource (PIR) and the PIR-International Protein Sequence Database". Nucleic Acids Research 25 (1): 24–28. doi:10.1093/nar/25.1.24. PMC 146415. PMID 9016497.
↑ Wu, C. H.; Nikolskaya, A.; Huang, H.; Yeh, L. S.; Natale, D. A.; Vinayaka, C. R.; Hu, Z. Z.; Mazumder, R.; Kumar, S.; Kourtesis, P.; Ledley, R. S.; Suzek, B. E.; Arminski, L.; Chen, Y.; Zhang, J.; Cardenas, J. L.; Chung, S.; Castro-Alvear, J.; Dinkov, G.; Barker, W. C. (2004). "PIRSF: Family classification system at the Protein Information Resource". Nucleic Acids Research 32 (90001): 112D–1114. doi:10.1093/nar/gkh097. PMC 308831. PMID 14681371.

Bioinformatics

Databases	Sequence databases: GenBank, European Nucleotide Archive and DNA Data Bank of Japan Secondary databases: UniProt, database of protein sequences grouping together Swiss-Prot, TrEMBL and Protein Information Resource Other databases: Protein Data Bank, Ensembl and InterPro Specialised genomic databases: BOLD, Saccharomyces Genome Database, FlyBase, VectorBase, WormBase, PHI-base, Arabidopsis Information Resource and Zebrafish Information Network

Other	Algorithm: BLAST Server: ExPASy Ontology: Gene Ontology

Institutions	European Bioinformatics Institute US National Center for Biotechnology Information Swiss Institute of Bioinformatics Japanese Institute of Genetics

List of biological databases Sequencing Sequence database Sequence alignment Molecular phylogenetics