Beijing Genomics Institute

BGI
Industry Genome sequencing
Founded September 9, 1999 (Beijing)
Headquarters Shenzhen, Guangdong, China
Number of locations
Shenzhen, Hong Kong, Wuhan, Hangzhou, Beijing, China;
Boston, USA;
Copenhagen, Denmark
Area served
Worldwide
Key people
Yang Huanming (Chariman)
Wang Jian (President)
Wang Jun (CEO & Director)
Divisions BGI China (Mainland)
BGI Asia Pacific
BGI Americas (North and South America)
BGI Europe (Europe and Africa)
Subsidiaries
Website www.genomics.cn/en

BGI (Chinese: 华大基因; pinyin: Huádà Jīyīn), known as the Beijing Genomics Institute prior to 2008, is one of the world’s premier genome sequencing centers, headquartered in Shenzhen, Guangdong province, China.[1]


History

Wang Jian, Yu Jun, Yang Huanming and Liu Siqi created BGI in November 1999 [2] in Beijing, China as a non-governmental independent research institute in order to participate in the Human Genome Project as China's representative.[3][4] After the project was completed, funding dried up. So BGI moved to Hangzhou in exchange for funding from the Hangzhou Municipal Government.

In 2002, BGI sequenced the rice genome which was a cover story in the journal Science. In 2003 BGI decoded the SARS virus genome and created a kit for detection of the virus. In 2003, BGI Hangzhou and the Zhejiang University founded a new research institute, the James D. Watson Institute of Genome Sciences, Zhejiang University. The Watson Institute was intended to become a major center for research and education in East Asia modelled after the Cold Spring Harbor Laboratory in the US.

In 2007 BGI’s headquarters relocated to Shenzhen as "the first citizen-managed, non-profit research institution in China". Yu Jun left BGI at this time purportedly selling his stake to the other 3 founders for a nominal sum. [5] In 2008, BGI-Shenzhen was officially recognized as a state agency.[6] In 2008, BGI published the first human genome of an Asian individual.[3][7]

In 2010 BGI Shenzhen was certified as meeting the requirements of ISO9001:2008 standard for the design and provision of high-throughput sequencing services,[8] The same year BGI bought 128 sequencing machines and claimed to be the world's largest genome center.[3]

In 2010 it was reported that BGI would receive US$1.5 billion in “collaborative funds” over the next 10 years from the China Development Bank.[9][10] In 2010, BGI Americas was established with its main office in Cambridge, Massachusetts[11] and BGI Europe was established in Copenhagen.[12]

In 2011 BGI reported it employed 4,000 scientists and technicians.[1] BGI did the genome sequencing for the deadly 2011 Germany E. coli O104:H4 outbreak in three days under open licence.[13]

In 2013 BGI claimed it had relationships with 17 out of the top 20 global pharmaceutical companies.[11][14] and advertised that it provided commercial science, health, agricultural, and informatics services.[15] That year it bought Complete Genomics of Mountain View, California, a major supplier of DNA sequencing technology, for US$118 million.[13]

The Institute has described itself as partly private and partly public, receiving funds both from private investors and the Chinese government. The laboratory was also the Bioinformatics Center of the Chinese Academy of Sciences.

Key achievements

Current research projects

Human genetics

Yan Huang Project

Started in 2007 and named after two Emperors believed to have founded China’s dominant ethnic group,[31] BGI planned in this project, to sequence at least 100 Chinese individuals to produce a high-resolution map of Chinese genetic polymorphisms.[32][33] The first genome data was published in October 2007.[34] An anonymous Chinese billionaire donated $10 million RMB (about US$1.4 million) to the project and his genome was sequenced at the beginning of the project.[32][33]

The 1000 genomes project

Main article: 1000 Genomes Project

Diabetes-associated Genes and Variations Study (LUCAMP) Cancer Genome Project

Nine Danish universities and institutes will collaborate with BGI in this targeted resequencing project.

BGI explores associated genome and gene variation in complexes diseases in large-scale studies primarily using two methods: PCR-based resequencing of candidate genes and exon-capture-based whole exome resequencing.

Cognitive Research Lab

The Cognitive Research Lab at BGI is working with Stephen Hsu on a project to discover the genetic basis of human intelligence.[35]

Animals and plants

1,000 Plant and Animal Reference Project

BGI is leading an international collaboration to sequence 1,000 plants and animals of economic and scientific import within two years. It has pledged an initial US$100 million to start the program.[36]

BGI has already sequenced genomes of 20 species of animals and 9 species of plants—sometimes for multiple individuals, such as 40 silkworms 19713493, and has an equal number underway as of March 2010.

Three Extreme-Environment Animal Genomes Project

http://www.genomics.cn/en/research.php?type=show&id=330

International Big Cats Genome Project

In 2010, BGI, Beijing University, Heilongjiang Manchurian tiger forestry zoo, Kunming Institute of Zoology, San Diego Zoo Institute for Conservation Research in California, and others announced they would sequence the Amur tiger, South China tiger, Bengal tiger, Asiatic lion, African lion, clouded leopard, snow leopard, and other felines. BGI would also sequence the genomes and epigenoms of a liger and tigon. Since the two reciprocal hybrids have different phenotypes, despite being genetically identical, it was expected that the epigenome might reveal the basis of such differences.[37] The project aim was to significantly advance conservation research and was auspiciously announced for the Chinese year of the Tiger.[38]

Results were reported in 2013 for the genomes of the Anur tiger, the white Bengal tiger, African lion, white African lion and snow leopard.[39]

Symbiont Genome Project

A jointly funded project announced March 19, 2010, BGI will collaborate with Sidney K. Pierce of University of South Florida and Charles Delwiche of the University of Maryland at College Park to sequence the genomes of the sea slug, Elysia chlorotica, and its algal food Vaucheria litorea. The sea slug uses genes from the algae to synthesize chlorophyll, the first interspecies of gene transfer discovered. Sequencing their genomes could elucidate the mechanism of that transfer.[40]

Microorganisms

Ten Thousand Microbial Genomes Project

http://english.cas.cn/Ne/CASE/200908/t20090805_44705.shtml

Bioinformatics technology

De novo sequencing requires aligning billions of short strings of DNA sequence into a full genome, itself three billion base pairs long for humans.

BGI’s computational biologists developed the first successful algorithm, based on graph theory, for aligning billions of 25 to 75-base pair strings produced by next-generation sequencers, specifically Illumina’s Genome Analyzer, during de novo sequencing. The algorithm, called SOAPdenovo, can assemble a genome in two days[21] and has been used to sequence an array of plant and animal genomes.

BGI’s 500-node supercomputer processes 10 terabytes of raw sequencing data every 24 hours from its current 30 or so Genome Analyzers from Illumina. The annual budget for the computer center is US$9 million.[41]

SOAPdenovo is part of "Short Oligonucleotide Analysis Package" (SOAP), a suite of tools developed by BGI for de novo assembly of human-sized genomes, alignment, SNP detection, resequencing, indel finding, and structural variation analysis. Built for the Illumina sequencers' short reads, SOAPdenovo has been used to assemble multiple human genomes[17][18][19] (identifying an eight kilobase insertion not detected by mapping to the human reference genome[42]) and animals, like the giant panda.[16]

See also

References

  1. 1.0 1.1 Lone Frank, High-Quality DNA, Apr 24, 2011, The Daily Beast, http://www.thedailybeast.com/newsweek/2011/04/24/high-quality-dna.html
  2. Shu-Ching Jean Chen, (2 September 2013) Genomic Dreams Coming True in China Forbes Asia, Retrieved 27 October 2014
  3. 3.0 3.1 3.2 Kevin Davies, (27 September 2011) The Bedrock of BGI: Huanming Yang Bio-IT World, Retrieved 14 January 2014
  4. 4.0 4.1 The dragon's DNA, Jun 17th 2010, The Economist, http://www.economist.com/node/16349434
  5. Shu-Ching Jean Chen, (2 September 2013) Genomic Dreams Coming True in China Forbes Asia, Retrieved 27 October 2014
  6. About BGI, BGI, http://en.genomics.cn/navigation/show_navigation.action?navigation.id=95
  7. Ye, Jia (2008) An Interview with a Leader in Genomics — Beijing Genomics Institute Asia Biotech, Retrieved 14b January 2013
  8. "Next Generation of High-Throughput Sequencing Service of BGI Received the ISO9001 Certification". 23 March 2010. Retrieved 14 January 2014.
  9. "BGI to Receive $1.5B in 'Collaborative Funds' Over 10 Years from China Development Bank | In Sequence | Sequencing | GenomeWeb". Retrieved 29 March 2010.
  10. Fox, J.; Kling, J. (2010). "Chinese institute makes bold sequencing play". Nature Biotechnology 28 (3): 189–191. doi:10.1038/nbt0310-189c. PMID 20212469.
  11. 11.0 11.1 (2013) Introduction to BGI Americas BGI official web page, Retrieved 14 January 2014
  12. (2013) BGI Europe BGI official web page, Retrieved 14 January 2014
  13. 13.0 13.1 Specter, Michael (6 January 2014) The Gene Factory The New Yorker, Retrieved 28 October 2014
  14. Pharma and Biotech Services Introduction, BGI, http://en.genomics.cn/navigation/show_navigation.action?navigation.id=1618
  15. Industry, BGI, http://en.genomics.cn/navigation/show_navigation.action?navigation.id=92
  16. 16.0 16.1 16.2 16.3 Li, R.; Fan, W.; Tian, G.; Zhu, H.; He, L.; Cai, J.; Huang, Q.; Cai, Q.; Li, B.; Bai, Y.; Zhang, Z.; Zhang, Y.; Wang, W.; Li, J.; Wei, F.; Li, H.; Jian, M.; Li, J.; Zhang, Z.; Nielsen, R.; Li, D.; Gu, W.; Yang, Z.; Xuan, Z.; Ryder, O. A.; Leung, F. C. C.; Zhou, Y.; Cao, J.; Sun, X.; Fu, Y. (2009). "The sequence and de novo assembly of the giant panda genome". Nature 463 (7279): 311–317. doi:10.1038/nature08696. PMID 20010809.
  17. 17.0 17.1 Li, R.; Zhu, H.; Ruan, J.; Qian, W.; Fang, X.; Shi, Z.; Li, Y.; Li, S.; Shan, G.; Kristiansen, K.; Li, S.; Yang, H.; Wang, J.; Wang, J. (2009). "De novo assembly of human genomes with massively parallel short read sequencing". Genome Research 20 (2): 265–272. doi:10.1101/gr.097261.109. PMC 2813482. PMID 20019144.
  18. 18.0 18.1 Rasmussen, M.; Li, Y.; Lindgreen, S.; Pedersen, J. S.; Albrechtsen, A.; Moltke, I.; Metspalu, M.; Metspalu, E.; Kivisild, T.; Gupta, R.; Bertalan, M.; Nielsen, K.; Gilbert, M. T. P.; Wang, Y.; Raghavan, M.; Campos, P. F.; Kamp, H. M.; Wilson, A. S.; Gledhill, A.; Tridico, S.; Bunce, M.; Lorenzen, E. D.; Binladen, J.; Guo, X.; Zhao, J.; Zhang, X.; Zhang, H.; Li, Z.; Chen, M.; Orlando, L. (2010). "Ancient human genome sequence of an extinct Palaeo-Eskimo". Nature 463 (7282): 757–762. doi:10.1038/nature08835. PMC 3951495. PMID 20148029.
  19. 19.0 19.1 Wang, J.; Wang, W.; Li, R.; Li, Y.; Tian, G.; Goodman, L.; Fan, W.; Zhang, J.; Li, J.; Zhang, J.; Guo, Y.; Feng, B.; Li, H.; Lu, Y.; Fang, X.; Liang, H.; Du, Z.; Li, D.; Zhao, Y.; Hu, Y.; Yang, Z.; Zheng, H.; Hellmann, I.; Inouye, M.; Pool, J.; Yi, X.; Zhao, J.; Duan, J.; Zhou, Y.; Qin, J. (2008). "The diploid genome sequence of an Asian individual". Nature 456 (7218): 60–65. doi:10.1038/nature07484. PMC 2716080. PMID 18987735.
  20. Li, R.; Li, Y.; Zheng, H.; Luo, R.; Zhu, H.; Li, Q.; Qian, W.; Ren, Y.; Tian, G.; Li, J.; Zhou, G.; Zhu, X.; Wu, H.; Qin, J.; Jin, X.; Li, D.; Cao, H.; Hu, X.; Blanche, H. L. N.; Cann, H.; Zhang, X.; Li, S.; Bolund, L.; Kristiansen, K.; Yang, H.; Wang, J.; Wang, J. (2009). "Building the sequence map of the human pan-genome". Nature Biotechnology 28 (1): 57–63. doi:10.1038/nbt.1596. PMID 19997067.
  21. 21.0 21.1 "To Start Building 'Human Pan-Genome,' BGI De Novo Assembles Two Genomes from Illumina Data | In Sequence | Sequencing | GenomeWeb". Retrieved 29 March 2010.
  22. Qin, J.; Li, R.; Raes, J.; Arumugam, M.; Burgdorf, K. S.; Manichanh, C.; Nielsen, T.; Pons, N.; Levenez, F.; Yamada, T.; Mende, D. R.; Li, J.; Xu, J.; Li, S.; Li, D.; Cao, J.; Wang, B.; Liang, H.; Zheng, H.; Xie, Y.; Tap, J.; Lepage, P.; Bertalan, M.; Batto, J. M.; Hansen, T.; Le Paslier, D.; Linneberg, A.; Nielsen, H. B. R.; Pelletier, E.; Renault, P. (2010). "A human gut microbial gene catalogue established by metagenomic sequencing". Nature 464 (7285): 59–65. doi:10.1038/nature08821. PMC 3779803. PMID 20203603.
  23. "International Team Catalogs Microbial Genes in the Human Gut | GenomeWeb Daily News | Sequencing | GenomeWeb". Archived from the original on 7 March 2010. Retrieved 29 March 2010.
  24. Enserink, M. (2003). "SARS IN CHINA: China's Missed Chance". Science 301 (5631): 294–296. doi:10.1126/science.301.5631.294. PMID 12869735.
  25. German Teams, BGI and Life Technologies Identify Deadly European E.coli Strain, March 23, 2012 | Bio-IT World, http://www.bio-itworld.com/news/06/02/2011/German-teams-BGI-Life-Technologies-Identify-E-coli-strain.html
  26. 26.0 26.1 Xia, Q.; Guo, Y.; Zhang, Z.; Li, D.; Xuan, Z.; Li, Z.; Dai, F.; Li, Y.; Cheng, D.; Li, R.; Cheng, T.; Jiang, T.; Becquet, C.; Xu, X.; Liu, C.; Zha, X.; Fan, W.; Lin, Y.; Shen, Y.; Jiang, L.; Jensen, J.; Hellmann, I.; Tang, S.; Zhao, P.; Xu, H.; Yu, C.; Zhang, G.; Li, J.; Cao, J.; Liu, S. (2009). "Complete Resequencing of 40 Genomes Reveals Domestication Events and Genes in Silkworm (Bombyx)". Science 326 (5951): 433–436. doi:10.1126/science.1176620. PMID 19713493.
  27. Cyranoski, D. (2010). "Chinese bioscience: The sequence factory". Nature 464 (7285): 22–24. doi:10.1038/464022a. PMID 20203579.
  28. Huang, S.; Li, R.; Zhang, Z.; Li, L.; Gu, X.; Fan, W.; Lucas, W.; Wang, X.; Xie, B.; Ni, P.; Ren, Y.; Zhu, H.; Li, J.; Lin, K.; Jin, W.; Fei, Z.; Li, G.; Staub, J.; Kilian, A.; Van Der Vossen, E. A. G.; Wu, Y.; Guo, J.; He, J.; Jia, Z.; Ren, Y.; Tian, G.; Lu, Y.; Ruan, J.; Qian, W.; Wang, M. (2009). "The genome of the cucumber, Cucumis sativus L". Nature Genetics 41 (12): 1275–1281. doi:10.1038/ng.475. PMID 19881527.
  29. BGI Shenzhen Ranked 4th of Top 10 Institutions in NPI 2010 China, BGI, http://www.bgisequence.com/home/newsandevents/news/bgi-shenzhen-ranked-4th-of-top-10-institutions-in-npi-2010-china
  30. Shukman, David (14 January 2014) China cloning on an 'industrial scale' BBC News Science and Environment, Retrieved 14 January 2014
  31. "Chinese scientists sequence 1st volunteer's genome". People's Daily Online. 7 January 2008. Retrieved 29 October 2014.
  32. 32.0 32.1 Qiu, Jane; Hayden, Check (2008). "Genomics sizes up". Nature 451 (7176): 234. Bibcode:2008Natur.451..234Q. doi:10.1038/451234a. PMID 18202611.
  33. 33.0 33.1 "BGI Offers Next-Gen Sequencing Service, Kicks Off 100-Genome Sequencing Project | In Sequence | Sequencing | GenomeWeb". Genomeweb LLC. 8 January 2008. Retrieved 29 October 2014. (subscription required (help)).
  34. (20 November 2008) TuanHuang - The First Asian Diploid Genome BGI Shenzen web page, Retrieved 29 October 2014
  35. Stephen Hsu is New Director of Research for MSU:
  36. Fox, J.; Kling, J. (2010). "Chinese institute makes bold sequencing play". Nature Biotechnology 28 (3): 189–191. doi:10.1038/nbt0310-189c. PMID 20212469.
  37. "BGI to Sequence Tiger, Lion, and Leopard Species This Year | In Sequence | Sequencing | GenomeWeb". Archived from the original on 28 February 2010. Retrieved 29 March 2010.
  38. "BGI". Archived from the original on 17 February 2010. Retrieved 29 March 2010.
  39. Cho, Y. S.; Hu, L.; Hou, H.; Lee, H.; Xu, J.; Kwon, S.; Oh, S.; Kim, H. M.; Jho, S.; Kim, S.; Shin, Y. A.; Kim, B. C.; Kim, H.; Kim, C. U.; Luo, S. J.; Johnson, W. E.; Koepfli, K. P.; Schmidt-Küntzel, A.; Turner, J. A.; Marker, L.; Harper, C.; Miller, S. M.; Jacobs, W.; Bertola, L. D.; Kim, T. H.; Lee, S.; Zhou, Q.; Jung, H. J.; Xu, X. et al. (2013). "The tiger genome and comparative analysis with lion and snow leopard genomes". Nature Communications 4: 2433. Bibcode:2013NatCo...4E2433C. doi:10.1038/ncomms3433. PMC 3778509. PMID 24045858.
  40. "BGI". Retrieved 29 March 2010.
  41. Petsko, G. A. (2010). "Rising in the East". Genome Biology 11 (1): 102. doi:10.1186/gb-2010-11-1-102. PMC 2847708. PMID 20156314.
  42. "BGI Uses New Short-Read Algorithm to Assemble Panda Genome as Proof of Concept for Human Genome | BioInform | Informatics | GenomeWeb". Retrieved 28 March 2010.

External links