The iPlant Collaborative, renamed Cyverse in 2017, is a
virtual organization
A virtual organization is a temporary or permanent collection of geographically dispersed individuals, groups, organizational units, or entire organizations that depend on electronic linking in order to complete the production process (working defi ...
created by a
cooperative agreement
A cooperative (also known as co-operative, co-op, or coop) is "an autonomous association of persons united voluntarily to meet their common economic, social and cultural needs and aspirations through a jointly owned and democratically-contro ...
funded by the US
National Science Foundation
The National Science Foundation (NSF) is an independent agency of the United States government that supports fundamental research and education in all the non-medical fields of science and engineering. Its medical counterpart is the National ...
(NSF) to create
cyberinfrastructure United States federal research funders use the term cyberinfrastructure to describe research environments that support advanced data acquisition, data storage, data management, data integration, data mining, data visualization and other computing ...
for the plant sciences (
botany
Botany, also called plant science (or plant sciences), plant biology or phytology, is the science of plant life and a branch of biology. A botanist, plant scientist or phytologist is a scientist who specialises in this field. The term "bot ...
).
The NSF compared cyberinfrastructure to physical
infrastructure, "... the
distributed computer,
information and communication technologies
Information and communications technology (ICT) is an extensional term for information technology (IT) that stresses the role of unified communications and the integration of telecommunications ( telephone lines and wireless signals) and comput ...
combined with the personnel and integrating components that provide a long-term platform to empower the modern scientific research endeavor".
In September 2013 it was announced that the National Science Foundation had renewed iPlant's funding for a second 5-year term with an expansion of scope to all non-human life science research.
The project develops computing systems and software that combine computing resources, like those of
TeraGrid
TeraGrid was an e-Science grid computing infrastructure combining resources at eleven partner sites. The project started in 2001 and operated from 2004 through 2011.
The TeraGrid integrated high-performance computers, data resources and tools, an ...
, and
bioinformatics
Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combin ...
and
computational biology
Computational biology refers to the use of data analysis, mathematical modeling and computational simulations to understand biological systems and relationships. An intersection of computer science, biology, and big data, the field also has fo ...
software. Its goal is easier collaboration among researchers with improved data access and processing efficiency. Primarily centered in the United States, it collaborates internationally.
History
Biology is relying more and more on computers. Plant biology is changing with the rise of new technologies. With the advent of
bioinformatics
Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combin ...
,
computational biology
Computational biology refers to the use of data analysis, mathematical modeling and computational simulations to understand biological systems and relationships. An intersection of computer science, biology, and big data, the field also has fo ...
,
DNA sequencing,
geographic information systems
A geographic information system (GIS) is a type of database containing geographic data (that is, descriptions of phenomena for which location is relevant), combined with software tools for managing, analyzing, and visualizing those data. In a ...
and others computers can greatly assist researchers who study plant life looking for solutions to challenges in
medicine
Medicine is the science and Praxis (process), practice of caring for a patient, managing the diagnosis, prognosis, Preventive medicine, prevention, therapy, treatment, Palliative care, palliation of their injury or disease, and Health promotion ...
,
biofuels
Biofuel is a fuel that is produced over a short time span from biomass, rather than by the very slow natural processes involved in the formation of fossil fuels, such as oil. According to the United States Energy Information Administration (E ...
,
biodiversity
Biodiversity or biological diversity is the variety and variability of life on Earth. Biodiversity is a measure of variation at the genetic ('' genetic variability''), species ('' species diversity''), and ecosystem ('' ecosystem diversity' ...
,
agriculture
Agriculture or farming is the practice of cultivating plants and livestock. Agriculture was the key development in the rise of sedentary human civilization, whereby farming of domesticated species created food surpluses that enabled peop ...
and problems like
drought tolerance
Drought tolerance is the ability to which a plant maintains its biomass production during arid or drought conditions. Some plants are naturally adapted to dry conditions'','' surviving with protection mechanisms such as desiccation tolerance, deto ...
,
plant breeding
Plant breeding is the science of changing the traits of plants in order to produce desired characteristics. It has been used to improve the quality of nutrition in products for humans and animals. The goals of plant breeding are to produce cr ...
, and
sustainable farming
Sustainable agriculture is agriculture, farming in sustainability, sustainable ways meeting society's present food and textile needs, without compromising the ability for current or future generations to meet their needs. It can be based on an u ...
.
[ Many of these problems cross traditional disciplines and facilitating collaboration between plant scientists of diverse backgrounds and specialties is necessary.][
In 2006, the NSF solicited proposals to create "a new type of organization – a cyberinfrastructure collaborative for plant science" with a program titled "Plant Science Cyberinfrastructure Collaborative" (PSCIC) with Christopher Greer as program director.] A proposal was accepted (adopting the convention of using the word "Collaborative" as a noun) and iPlant was officially created on February 1, 2008.
Funding was estimated as $10 million per year over five years.
Richard Jorgensen led the team through the proposal stage and was the principal investigator
In many countries, the term principal investigator (PI) refers to the holder of an independent grant and the lead researcher for the grant project, usually in the sciences, such as a laboratory study or a clinical trial. The phrase is also often us ...
(PI) from 2008 to 2009. Gregory Andrews, Vicki Chandler, Sudha Ram and Lincoln Stein served as Co-Principal Investigators (Co-PIs) from 2008 to 2009. In late 2009, Stephen Goff was named PI and Daniel Stanzione was added as a Co-PI. As of May 2014, Co-PI Stanzione was replaced by 4 new Co-PIs: Doreen Ware at Cold Spring Harbor, Nirav Merchant and Eric Lyons at the University of Arizona, and Matthew Vaughn at the Texas Advanced Computing Center.
The iPlant project supports what has been called e-Science
E-Science or eScience is computationally intensive science that is carried out in highly distributed network environments, or science that uses immense data sets that require grid computing; the term sometimes includes technologies that enable dis ...
, which is a use of information systems technology that is being adopted by the research community in efforts such as the National Center for Ecological Analysis and Synthesis The National Center for Ecological Analysis and Synthesis (NCEAS) is a research center at the University of California, Santa Barbara, in Santa Barbara, California. Better known by its acronym, NCEAS (pronounced “n-seas”) opened in May 1995. Fu ...
(NCEAS), ELIXIR, and the Bamboo Technology Project that started in September 2010. iPlant is "designed to create the foundation to support the computational needs of the research community and facilitate progress toward solutions of major problems in plant biology."
The project works as a collaboration
Collaboration (from Latin ''com-'' "with" + ''laborare'' "to labor", "to work") is the process of two or more people, entities or organizations working together to complete a task or achieve a goal. Collaboration is similar to cooperation. Most ...
. It seeks input from the wider plant science community on what to build.
Based on that input, it has enabled easier use of large data sets, created a community-driven research environment to share existing data collections within a research area and between research areas and shares data with provenance
Provenance (from the French ''provenir'', 'to come from/forth') is the chronology of the ownership, custody or location of a historical object. The term was originally mostly used in relation to works of art but is now used in similar senses i ...
tracking.
One model studied for collaboration was Wikipedia
Wikipedia is a multilingual free online encyclopedia written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and using a wiki-based editing system. Wikipedia is the largest and most-read ref ...
.
Several more recent National Science Foundation awards mentioned iPlant explicitly in their descriptions, as either a design pattern to follow or a collaborator with whom the recipient will work.
Institutions
The primary institution for the iPlant project is the University of Arizona
The University of Arizona (Arizona, U of A, UArizona, or UA) is a public land-grant research university in Tucson, Arizona. Founded in 1885 by the 13th Arizona Territorial Legislature, it was the first university in the Arizona Territory. ...
, located within the BIO5 Institute in Tucson
, "(at the) base of the black ill
, nicknames = "The Old Pueblo", "Optics Valley", "America's biggest small town"
, image_map =
, mapsize = 260px
, map_caption = Interactive map ...
. Since its inception in 2008, personnel worked at other institutions including Cold Spring Harbor Laboratory
Cold Spring Harbor Laboratory (CSHL) is a private, non-profit institution with research programs focusing on cancer, neuroscience, plant biology, genomics, and quantitative biology.
It is one of 68 institutions supported by the Cancer Centers ...
, University of North Carolina, Wilmington
The University of North Carolina Wilmington (UNCW or UNC Wilmington) is a public research university in Wilmington, North Carolina. It is part of the University of North Carolina System and enrolls 17,499 undergraduate and graduate students eac ...
, and the University of Texas at Austin
The University of Texas at Austin (UT Austin, UT, or Texas) is a public research university in Austin, Texas. It was founded in 1883 and is the oldest institution in the University of Texas System. With 40,916 undergraduate students, 11,075 ...
in the Texas Advanced Computing Center
The Texas Advanced Computing Center (TACC) at the University of Texas at Austin, United States, is an advanced computing research center that provides comprehensive advanced computing resources and support services to researchers in Texas and acr ...
.
Purdue University
Purdue University is a public land-grant research university in West Lafayette, Indiana, and the flagship campus of the Purdue University system. The university was founded in 1869 after Lafayette businessman John Purdue donated land and ...
and Arizona State University
Arizona State University (Arizona State or ASU) is a public research university in the Phoenix metropolitan area. Founded in 1885 by the 13th Arizona Territorial Legislature, ASU is one of the largest public universities by enrollment in t ...
were part of the original project group.
Other collaborating institutions that received support from iPlant for their work on a Grand Challenge in phylogenetics
In biology, phylogenetics (; from Greek φυλή/ φῦλον [] "tribe, clan, race", and wikt:γενετικός, γενετικός [] "origin, source, birth") is the study of the evolutionary history and relationships among or within groups ...
starting in March 2009 included Yale University
Yale University is a Private university, private research university in New Haven, Connecticut. Established in 1701 as the Collegiate School, it is the List of Colonial Colleges, third-oldest institution of higher education in the United Sta ...
, University of Florida
The University of Florida (Florida or UF) is a public land-grant research university in Gainesville, Florida. It is a senior member of the State University System of Florida, traces its origins to 1853, and has operated continuously on its ...
, and the University of Pennsylvania
The University of Pennsylvania (also known as Penn or UPenn) is a private research university in Philadelphia. It is the fourth-oldest institution of higher education in the United States and is ranked among the highest-regarded universit ...
.
A trait evolution group was led at the University of Tennessee
The University of Tennessee (officially The University of Tennessee, Knoxville; or UT Knoxville; UTK; or UT) is a public land-grant research university in Knoxville, Tennessee. Founded in 1794, two years before Tennessee became the 16th state ...
.
A visualization workshop employing iPlant was run by Virginia Tech
Virginia Tech (formally the Virginia Polytechnic Institute and State University and informally VT, or VPI) is a Public university, public Land-grant college, land-grant research university with its main campus in Blacksburg, Virginia. It also ...
in 2011.
The NSF requires that funding subcontracts stay within the United States, but international collaboration started in 2009 with the Technical University Munich
The Technical University of Munich (TUM or TU Munich; german: Technische Universität München) is a public research university in Munich, Germany. It specializes in engineering, technology, medicine, and applied and natural sciences.
Establis ...
and University of Toronto
The University of Toronto (UToronto or U of T) is a public research university in Toronto, Ontario, Canada, located on the grounds that surround Queen's Park. It was founded by royal charter in 1827 as King's College, the first institu ...
in 2010.
East Main Evaluation & Consulting provides external oversight, advice, and assistance.
Services
The iPlant project makes its cyberinfrastructure available several different ways and offers services to make it the accessible to its primary audience. The design was meant to grow in response to needs of the research community it serves.
The Discovery Environment
The Discovery Environment integrates community-recommended software tools into a system that can handle terabyte
The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable unit ...
s of data using high-performance supercomputers to perform these tasks much more quickly. It has an interface designed to hide the complexity needed to do this from the end user. The goal was to make the cyberinfrastructure available to non-technical end users who are not as comfortable using a command-line interface.[
]
iPlant Foundational APIs
A set of application programming interfaces (APIs) for developers allow access to iPlant services, including authentication, data management, high performance supercomputing resources from custom, locally produced software.[
]
Atmosphere
Atmosphere is a cloud computing
Cloud computing is the on-demand availability of computer system resources, especially data storage ( cloud storage) and computing power, without direct active management by the user. Large clouds often have functions distributed over m ...
platform that provides easy access to pre-configured, frequently used analysis routines, relevant algorithms, and data sets, and accommodates computationally and data-intensive bioinformatics tasks.[
It uses the ]Eucalyptus
''Eucalyptus'' () is a genus of over seven hundred species of Flowering plant, flowering trees, shrubs or Mallee (habit), mallees in the Myrtaceae, myrtle Family (biology), family, Myrtaceae. Along with several other genera in the Tribe (biology) ...
virtualization platform.
iPlant Semantic Web
The iPlant Semantic Web effort uses an iPlant-created architecture, protocol, and platform called the Simple Semantic Web Architecture and Protocol ( SSWAP) for semantic web linking using a plant science focused ontology
In metaphysics, ontology is the philosophical study of being, as well as related concepts such as existence, becoming, and reality.
Ontology addresses questions like how entities are grouped into categories and which of these entities ...
.[ SSWAP is based on the notion of RESTful web services with an ontology based on ]Web Ontology Language
The Web Ontology Language (OWL) is a family of knowledge representation languages for authoring ontologies. Ontologies are a formal way to describe taxonomies and classification networks, essentially defining the structure of knowledge for vario ...
(OWL).
Taxonomic Name Resolution Service
The Taxonomic Name Resolution Service (TNRS) is a free utility for correcting and standardizing plant names. This is needed because plant names that are misspelled, out of date (because a newer synonym is preferred), or incomplete make it hard to use computers to process large lists.[
]
My-Plant
My-Plant.org is a social networking
A social network is a social structure made up of a set of social actors (such as individuals or organizations), sets of dyadic ties, and other social interactions between actors. The social network perspective provides a set of methods for a ...
community for plant biologists, educators and others to come together to share information and research, collaborate, and track the latest developments in plant science.
The My-Plant network uses the terminology '' clades'' to group users in a manner similar to phylogenetics
In biology, phylogenetics (; from Greek φυλή/ φῦλον [] "tribe, clan, race", and wikt:γενετικός, γενετικός [] "origin, source, birth") is the study of the evolutionary history and relationships among or within groups ...
of plants themselves.
It was implemented using Drupal
Drupal () is a free and open-source web content management system (CMS) written in PHP and distributed under the GNU General Public License. Drupal provides an open-source back-end framework for at least 14% of the top 10,000 websites worldwid ...
as its content management system
A content management system (CMS) is computer software used to manage the creation and modification of digital content ( content management).''Managing Enterprise Content: A Unified Content Strategy''. Ann Rockley, Pamela Kostur, Steve Manning. New ...
.
DNA Subway
The DNA Subway website uses a graphical user interface
The GUI ( "UI" by itself is still usually pronounced . or ), graphical user interface, is a form of user interface that allows User (computing), users to Human–computer interaction, interact with electronic devices through graphical icon (comp ...
(GUI) to generate DNA sequence annotations, explore plant genome
In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ...
s for members of gene and transposon
A transposable element (TE, transposon, or jumping gene) is a nucleic acid sequence in DNA that can change its position within a genome, sometimes creating or reversing mutations and altering the cell's genetic identity and genome size. Transp ...
families, and conduct phylogenetic
In biology, phylogenetics (; from Greek φυλή/ φῦλον [] "tribe, clan, race", and wikt:γενετικός, γενετικός [] "origin, source, birth") is the study of the evolutionary history and relationships among or within groups o ...
analyses. It makes high-level DNA analysis available to faculty and students by simplifying annotation and comparative genomics
Comparative genomics is a field of biological research in which the genomic features of different organisms are compared. The genomic features may include the DNA sequence, genes, gene order, regulatory sequences, and other genomic structural ...
workflows.[
It was developed for iPlant by the ]Dolan DNA Learning Center
The DNA Learning Center (DNALC) is a genetics learning center affiliated with the Cold Spring Harbor Laboratory, in Cold Spring Harbor, New York. It is the world's first science center devoted entirely to genetics education and offers online educ ...
.
References
External links
*
**
**
** {{Cite web , title= Fast Track to Gene Annotation and Genome Analysis - DNA Subway , work= Tool web site , publisher= iPlant Collaborative , url= http://www.dnasubway.org , access-date= September 21, 2011
E-Science
Organizations established in 2008