United States federal research funders use the term cyberinfrastructure to describe research environments that support advanced data acquisition, data storage, data management, data integration, data mining, data visualization and other computing and information processing services distributed over the Internet beyond the scope of a single institution. In scientific usage, cyberinfrastructure is a technological and sociological solution to the problem of efficiently connecting laboratories, data, computers, and people with the goal of enabling derivation of novel scientific theories and knowledge.
Contents |
The term National Information Infrastructure had been popularized by Al Gore in the 1990s. This use of the term "cyberinfrastructure" evolved from the same thinking that produced Presidential Decision Directive NSC-63[1] on Protecting America's Critical Infrastructures (PDD-63). PDD-63 focuses on the security and vulnerability of the nation’s “cyber-based information systems” as well as the critical infrastructures on which America’s military strength and economic well-being depend, such as the electric power grid, transportation networks, potable water and wastewater infrastructures.
The term "cyberinfrastructure" was used in a press briefing on PDD-63 on May 22, 1998[2] with Richard A. Clarke, then national coordinator for security, infrastructure protection, and counter-terrorism, and Jeffrey Hunker, who had just been named director of the critical infrastructure assurance office. Hunker stated:
"One of the key conclusions of the President's commission that laid the intellectual framework for the President's announcement today was that while we certainly have a history of some real attacks, some very serious, to our cyber-infrastructure, the real threat lay in the future. And we can't say whether that's tomorrow or years hence. But we've been very successful as a country and as an economy in wiring together our critical infrastructures. This is a development that's taken place really over the last 10 or 15 years — the Internet, most obviously, but electric power, transportation systems, our banking and financial systems."[2]
The term "cyberinfrastructure" was used by a United States National Science Foundation (NSF) blue-ribbon committee in 2003 in response to the question: how can NSF, as the nation's premier agency funding basic research, remove existing barriers to the rapid evolution of high performance computing, making it truly usable by all the nation's scientists, engineers, scholars, and citizens? The NSF use of the term focuses on the integrated assemblage of these information technologies with one another.
A workshop on cyberinfrastructure for the social sciences was held in San Diego, California in May 2005.[3] Another conference was held in January 2007 in Washington, DC.[4] A "CyberInfrastructure Partnership" existed from February 2005 until 2009.[5] A collaboration led by the University of Wisconsin-Madison and Boston University had a web site called "Engaging People in Cyberinfrastructure" (EPIC) which existed from 2005 through 2007.[6]
Complementing the technical construction of cyberinfrastructure, social scientists in the field of computer supported cooperative work investigate the organizational and social aspects of building these large-scale, distributed resources to support science. Related to this research space is the notion of the collaboratory, originally coined by William Wulf.
Cyberinfrastructure is more often called e-Science or e-Research.[7] In particular, the United Kingdom started an e-Science initiative in 2001.[8] Others distinguish e-Science as the work that is done using the cyberinfrastructure.[9]
NSF's Office of Cyberinfrastructure, for example, supported the TeraGrid project in which the Grid Infrastructure Group led by University of Chicago provided integration of resources and services that were operated by some of the US's supercomputing centers.
The nanoHUB and its HUBzero software originally funded in 2002.[10][11] NSF funded the iPlant Collaborative in 2008 for botany support.[12]
The United States Department of Energy supports e-Science through high performance computing and other initiatives involving its laboratories, including:
The Department of Energy (Office of Science SciDAC-2 program from the High Energy Physics, Nuclear Physics and Advanced Software and Computing Research programs) and NSF (Math and Physical Sciences, Office of Cyberinfrastructure and Office of International Science and Engineering Directorates) support the Open Science Grid which is a consortium of more than 80 member institutions and alliances.
Other examples include:
Cyberinfrastructure is the coordinated aggregate of software, hardware and other technologies, as well as human expertise, required to support current and future discoveries in science and engineering. The challenge of Cyberinfrastructure is to integrate relevant and often disparate resources to provide a useful, usable, and enabling framework for research and discovery characterized by broad access and “end-to-end” coordination.[13]
Cyberinfrastructure consists of computing systems, data storage systems, advanced instruments and data repositories, visualization environments, and people, all linked together by software and high performance networks to improve research productivity and enable breakthroughs not otherwise possible.[14]
Like the physical infrastructure of roads, bridges, power grids, telephone lines, and water systems that support modern society, cyberinfrastructure refers to the distributed computer, information and communication technologies combined with the personnel and integrating components that provide a long-term platform to empower the modern scientific research endeavor.[15]