Reproducibility, closely related to replicability and repeatability, is a major principle underpinning the
scientific method
The scientific method is an Empirical evidence, empirical method for acquiring knowledge that has been referred to while doing science since at least the 17th century. Historically, it was developed through the centuries from the ancient and ...
. For the findings of a study to be reproducible means that results obtained by an
experiment
An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs whe ...
or an
observational study
In fields such as epidemiology, social sciences, psychology and statistics, an observational study draws inferences from a sample (statistics), sample to a statistical population, population where the dependent and independent variables, independ ...
or in a
statistical analysis
Statistical inference is the process of using data analysis to infer properties of an underlying probability distribution.Upton, G., Cook, I. (2008) ''Oxford Dictionary of Statistics'', OUP. . Inferential statistical analysis infers properties of ...
of a
data set
A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more table (database), database tables, where every column (database), column of a table represents a particular Variable (computer sci ...
should be achieved again with a high degree of reliability when the study is replicated. There are different kinds of replication but typically replication studies involve different researchers using the same methodology. Only after one or several such successful replications should a result be recognized as scientific knowledge.
History

The first to stress the importance of reproducibility in science was the Anglo-Irish chemist
Robert Boyle, in
England
England is a Countries of the United Kingdom, country that is part of the United Kingdom. It is located on the island of Great Britain, of which it covers about 62%, and List of islands of England, more than 100 smaller adjacent islands. It ...
in the 17th century. Boyle's
air pump was designed to generate and study
vacuum
A vacuum (: vacuums or vacua) is space devoid of matter. The word is derived from the Latin adjective (neuter ) meaning "vacant" or "void". An approximation to such vacuum is a region with a gaseous pressure much less than atmospheric pressur ...
, which at the time was a very controversial concept. Indeed, distinguished philosophers such as
René Descartes
René Descartes ( , ; ; 31 March 1596 – 11 February 1650) was a French philosopher, scientist, and mathematician, widely considered a seminal figure in the emergence of modern philosophy and Modern science, science. Mathematics was paramou ...
and
Thomas Hobbes
Thomas Hobbes ( ; 5 April 1588 – 4 December 1679) was an English philosopher, best known for his 1651 book ''Leviathan (Hobbes book), Leviathan'', in which he expounds an influential formulation of social contract theory. He is considered t ...
denied the very possibility of vacuum existence.
Historians of science Steven Shapin
Steven Shapin ( ) (born 1943) is an American historian and sociologist of science. He is Professor Emeritus of the History of Science at Harvard University.
Early life and education
Steven Shapin (born 1943 in New York) was educated at Central ...
and
Simon Schaffer
Simon J. Schaffer (born 1 January 1955) is a historian of science, previously a professor of the history and philosophy of science at the Department of History and Philosophy of Science at the University of Cambridge and was editor of '' The B ...
, in their 1985 book ''
Leviathan and the Air-Pump'', describe the debate between Boyle and Hobbes, ostensibly over the nature of vacuum, as fundamentally an argument about how useful knowledge should be gained. Boyle, a pioneer of the
experimental method
An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs whe ...
, maintained that the foundations of knowledge should be constituted by experimentally produced facts, which can be made believable to a scientific community by their reproducibility. By repeating the same experiment over and over again, Boyle argued, the certainty of fact will emerge.
The air pump, which in the 17th century was a complicated and expensive apparatus to build, also led to one of the first documented disputes over the reproducibility of a particular
scientific phenomenon. In the 1660s, the Dutch scientist
Christiaan Huygens
Christiaan Huygens, Halen, Lord of Zeelhem, ( , ; ; also spelled Huyghens; ; 14 April 1629 – 8 July 1695) was a Dutch mathematician, physicist, engineer, astronomer, and inventor who is regarded as a key figure in the Scientific Revolution ...
built his own air pump in
Amsterdam
Amsterdam ( , ; ; ) is the capital of the Netherlands, capital and Municipalities of the Netherlands, largest city of the Kingdom of the Netherlands. It has a population of 933,680 in June 2024 within the city proper, 1,457,018 in the City Re ...
, the first one outside the direct management of Boyle and his assistant at the time
Robert Hooke
Robert Hooke (; 18 July 16353 March 1703) was an English polymath who was active as a physicist ("natural philosopher"), astronomer, geologist, meteorologist, and architect. He is credited as one of the first scientists to investigate living ...
. Huygens reported an effect he termed "anomalous suspension", in which water appeared to levitate in a glass jar inside his air pump (in fact suspended over an air bubble), but Boyle and Hooke could not replicate this phenomenon in their own pumps. As Shapin and Schaffer describe, "it became clear that unless the phenomenon could be produced in England with one of the two pumps available, then no one in England would accept the claims Huygens had made, or his competence in working the pump". Huygens was finally invited to England in 1663, and under his personal guidance Hooke was able to replicate anomalous suspension of water. Following this Huygens was elected a Foreign Member of the
Royal Society
The Royal Society, formally The Royal Society of London for Improving Natural Knowledge, is a learned society and the United Kingdom's national academy of sciences. The society fulfils a number of roles: promoting science and its benefits, re ...
. However, Shapin and Schaffer also note that "the accomplishment of replication was dependent on contingent acts of judgment. One cannot write down a formula saying when replication was or was not achieved".
The
philosopher of science
Philosophy ('love of wisdom' in Ancient Greek) is a systematic study of general and fundamental questions concerning topics like existence, reason, knowledge, Value (ethics and social sciences), value, mind, and language. It is a rational an ...
Karl Popper
Sir Karl Raimund Popper (28 July 1902 – 17 September 1994) was an Austrian–British philosopher, academic and social commentator. One of the 20th century's most influential philosophers of science, Popper is known for his rejection of the ...
noted briefly in his famous 1934 book ''
The Logic of Scientific Discovery'' that "non-reproducible single occurrences are of no significance to science". The
statistician
A statistician is a person who works with Theory, theoretical or applied statistics. The profession exists in both the private sector, private and public sectors.
It is common to combine statistical knowledge with expertise in other subjects, a ...
Ronald Fisher
Sir Ronald Aylmer Fisher (17 February 1890 – 29 July 1962) was a British polymath who was active as a mathematician, statistician, biologist, geneticist, and academic. For his work in statistics, he has been described as "a genius who a ...
wrote in his 1935 book ''
The Design of Experiments'', which set the foundations for the modern scientific practice of
hypothesis testing and
statistical significance
In statistical hypothesis testing, a result has statistical significance when a result at least as "extreme" would be very infrequent if the null hypothesis were true. More precisely, a study's defined significance level, denoted by \alpha, is the ...
, that "we may say that a phenomenon is experimentally demonstrable when we know how to conduct an experiment which will rarely fail to give us statistically significant results". Such assertions express a common
dogma in modern science that reproducibility is a necessary condition (although not necessarily
sufficient) for establishing a scientific fact, and in practice for establishing scientific authority in any field of knowledge. However, as noted above by Shapin and Schaffer, this dogma is not well-formulated quantitatively, such as statistical significance for instance, and therefore it is not explicitly established how many times must a fact be replicated to be considered reproducible.
Terminology
''Replicability'' and ''repeatability'' are related terms broadly or loosely synonymous with reproducibility (for example, among the general public), but they are often usefully differentiated in more precise senses, as follows.
Two major steps are naturally distinguished in connection with reproducibility of experimental or observational studies: when new data are obtained in the attempt to achieve it, the term ''replicability'' is often used, and the new study is a ''replication'' or ''replicate'' of the original one. Obtaining the same results when analyzing the data set of the original study again with the same procedures, many authors use the term ''reproducibility'' in a narrow, technical sense coming from its use in computational research. ''Repeatability'' is related to the ''repetition'' of the experiment within the same study by the same researchers. Reproducibility in the original, wide sense is only acknowledged if a replication performed by an ''independent researcher team'' is successful.
The terms reproducibility and replicability sometimes appear even in the scientific literature with reversed meaning, as different research fields settled on their own definitions for the same terms.
Measures of reproducibility and repeatability
In chemistry, the terms reproducibility and repeatability are used with a specific quantitative meaning. In inter-laboratory experiments, a concentration or other quantity of a chemical substance is measured repeatedly in different laboratories to assess the variability of the measurements. Then, the standard deviation of the difference between two values obtained within the same laboratory is called repeatability. The standard deviation for the difference between two measurement from different laboratories is called ''reproducibility''.
These measures are related to the more general concept of
variance components in
metrology
Metrology is the scientific study of measurement. It establishes a common understanding of Unit of measurement, units, crucial in linking human activities. Modern metrology has its roots in the French Revolution's political motivation to stan ...
.
Reproducible research
Reproducible research method
The term ''reproducible research'' refers to the idea that scientific results should be documented in such a way that their deduction is fully transparent. This requires a detailed description of the methods used to obtain the data
and making the full dataset and the code to calculate the results easily accessible.
This is the essential part of
open science
Open science is the movement to make scientific research (including publications, data, physical samples, and software) and its dissemination accessible to all levels of society, amateur or professional. Open science is transparent and accessib ...
.
To make any research project computationally reproducible, general practice involves all data and files being clearly separated, labelled, and documented. All operations should be fully documented and automated as much as practicable, avoiding manual intervention where feasible. The workflow should be designed as a sequence of smaller steps that are combined so that the intermediate outputs from one step directly feed as inputs into the next step. Version control should be used as it lets the history of the project be easily reviewed and allows for the documenting and tracking of changes in a transparent manner.
A basic workflow for reproducible research involves data acquisition, data processing and data analysis. Data acquisition primarily consists of obtaining primary data from a primary source such as surveys, field observations, experimental research, or obtaining data from an existing source. Data processing involves the processing and review of the raw data collected in the first stage, and includes data entry, data manipulation and filtering and may be done using software. The data should be digitized and prepared for data analysis. Data may be analysed with the use of software to interpret or visualise statistics or data to produce the desired results of the research such as quantitative results including figures and tables. The use of software and automation enhances the reproducibility of research methods.
There are systems that facilitate such documentation, like the
R Markdown language
or the
Jupyter notebook.
The
Open Science Framework
The Center for Open Science is a non-profit technology organization based in Charlottesville, Virginia with a mission to "increase the openness, integrity, and reproducibility of scientific research." Brian Nosek and Jeffrey Spies founded the or ...
provides a platform and useful tools to support reproducible research.
Reproducible research in practice
Psychology has seen a renewal of internal concerns about irreproducible results (see the entry on
replicability crisis for empirical results on success rates of replications). Researchers showed in a 2006 study that, of 141 authors of a publication from the American Psychological Association (APA) empirical articles, 103 (73%) did not respond with their data over a six-month period. In a follow-up study published in 2015, it was found that 246 out of 394 contacted authors of papers in APA journals did not share their data upon request (62%). In a 2012 paper, it was suggested that researchers should publish data along with their works, and a dataset was released alongside as a demonstration. In 2017, an article published in ''
Scientific Data'' suggested that this may not be sufficient and that the whole analysis context should be disclosed.
In economics, concerns have been raised in relation to the credibility and reliability of published research. In other sciences, reproducibility is regarded as fundamental and is often a prerequisite to research being published, however in economic sciences it is not seen as a priority of the greatest importance. Most peer-reviewed economic journals do not take any substantive measures to ensure that published results are reproducible, however, the top economics journals have been moving to adopt mandatory data and code archives. There is low or no incentives for researchers to share their data, and authors would have to bear the costs of compiling data into reusable forms. Economic research is often not reproducible as only a portion of journals have adequate disclosure policies for datasets and program code, and even if they do, authors frequently do not comply with them or they are not enforced by the publisher. A Study of 599 articles published in 37 peer-reviewed journals revealed that while some journals have achieved significant compliance rates, significant portion have only partially complied, or not complied at all. On an article level, the average compliance rate was 47.5%; and on a journal level, the average compliance rate was 38%, ranging from 13% to 99%.
A 2018 study published in the journal ''
PLOS ONE'' found that 14.4% of a sample of public health statistics researchers had shared their data or code or both.
There have been initiatives to improve reporting and hence reproducibility in the medical literature for many years, beginning with the
CONSORT __NOTOC__
Consort may refer to:
Music
* "The Consort" (Rufus Wainwright song), from the 2000 album ''Poses''
* Consort of instruments, term for instrumental ensembles
* Consort song (musical), a characteristic English song form, late 16th–earl ...
initiative, which is now part of a wider initiative, the
EQUATOR Network.
This group has recently turned its attention to how better reporting might reduce waste in research, especially biomedical research.
Reproducible research is key to new discoveries in
pharmacology
Pharmacology is the science of drugs and medications, including a substance's origin, composition, pharmacokinetics, pharmacodynamics, therapeutic use, and toxicology. More specifically, it is the study of the interactions that occur betwee ...
. A Phase I discovery will be followed by Phase II reproductions as a drug develops towards commercial production. In recent decades Phase II success has fallen from 28% to 18%. A 2011 study found that 65% of medical studies were inconsistent when re-tested, and only 6% were completely reproducible.
Some efforts have been made to increase replicability beyond the social and biomedical sciences. Studies in the humanities tend to rely more on expertise and hermeneutics which may make replicability more difficult. Nonetheless, some efforts have been made to call for more transparency and documentation in the humanities.
Noteworthy irreproducible results
Hideyo Noguchi became famous for correctly identifying the bacterial agent of
syphilis
Syphilis () is a sexually transmitted infection caused by the bacterium ''Treponema pallidum'' subspecies ''pallidum''. The signs and symptoms depend on the stage it presents: primary, secondary, latent syphilis, latent or tertiary. The prim ...
, but also claimed that he could culture this agent in his laboratory. Nobody else has been able to produce this latter result.
In March 1989,
University of Utah
The University of Utah (the U, U of U, or simply Utah) is a public university, public research university in Salt Lake City, Utah, United States. It was established in 1850 as the University of Deseret (Book of Mormon), Deseret by the General A ...
chemists Stanley Pons and Martin Fleischmann reported the production of excess heat that could only be explained by a nuclear process ("
cold fusion
Cold fusion is a hypothesized type of nuclear reaction that would occur at, or near, room temperature. It would contrast starkly with the nuclear fusion, "hot" fusion that is known to take place naturally within Main sequence, stars and artific ...
"). The report was astounding given the simplicity of the equipment: it was essentially an
electrolysis
In chemistry and manufacturing, electrolysis is a technique that uses Direct current, direct electric current (DC) to drive an otherwise non-spontaneous chemical reaction. Electrolysis is commercially important as a stage in the separation of c ...
cell containing
heavy water
Heavy water (deuterium oxide, , ) is a form of water (molecule), water in which hydrogen atoms are all deuterium ( or D, also known as ''heavy hydrogen'') rather than the common hydrogen-1 isotope (, also called ''protium'') that makes up most o ...
and a
palladium
Palladium is a chemical element; it has symbol Pd and atomic number 46. It is a rare and lustrous silvery-white metal discovered in 1802 by the English chemist William Hyde Wollaston. He named it after the asteroid Pallas (formally 2 Pallas), ...
cathode
A cathode is the electrode from which a conventional current leaves a polarized electrical device such as a lead-acid battery. This definition can be recalled by using the mnemonic ''CCD'' for ''Cathode Current Departs''. Conventional curren ...
which rapidly absorbed the
deuterium
Deuterium (hydrogen-2, symbol H or D, also known as heavy hydrogen) is one of two stable isotopes of hydrogen; the other is protium, or hydrogen-1, H. The deuterium nucleus (deuteron) contains one proton and one neutron, whereas the far more c ...
produced during electrolysis. The news media reported on the experiments widely, and it was a front-page item on many newspapers around the world (see
science by press conference). Over the next several months others tried to replicate the experiment, but were unsuccessful.
Nikola Tesla
Nikola Tesla (;["Tesla"](_blank)
. ''Random House Webster's Unabridged Dictionary''. ; 10 July 1856 – 7 ...
claimed as early as 1899 to have used a high frequency current to light gas-filled lamps from over away
without using wires. In 1904 he built
Wardenclyffe Tower on
Long Island
Long Island is a densely populated continental island in southeastern New York (state), New York state, extending into the Atlantic Ocean. It constitutes a significant share of the New York metropolitan area in both population and land are ...
to demonstrate means to send and receive power without connecting wires. The facility was never fully operational and was not completed due to economic problems, so no attempt to reproduce his first result was ever carried out.
[Cheney, Margaret (1999), ''Tesla, Master of Lightning'', New York: Barnes & Noble Books, , pp. 107.; "Unable to overcome his financial burdens, he was forced to close the laboratory in 1905."]
Other examples which contrary evidence has refuted the original claim:
*
N-rays, a hypothesized form of radiation subsequently found to be illusory
*
Polywater, a hypothesized polymerized form of water found to be just water with common contaminations
*
Stimulus-triggered acquisition of pluripotency, revealed to be the result of fraud
*
GFAJ-1, a bacterium that could purportedly incorporate
arsenic
Arsenic is a chemical element; it has Symbol (chemistry), symbol As and atomic number 33. It is a metalloid and one of the pnictogens, and therefore shares many properties with its group 15 neighbors phosphorus and antimony. Arsenic is not ...
into its DNA in place of phosphorus
*
MMR vaccine controversy — a study in ''
The Lancet
''The Lancet'' is a weekly peer-reviewed general medical journal, founded in England in 1823. It is one of the world's highest-impact academic journals and also one of the oldest medical journals still in publication.
The journal publishes ...
'' claiming the MMR vaccine caused autism was revealed to be fraudulent
*
Schön scandal — semiconductor "breakthroughs" revealed to be fraudulent
*
Power posing — a
social psychology
Social psychology is the methodical study of how thoughts, feelings, and behaviors are influenced by the actual, imagined, or implied presence of others. Although studying many of the same substantive topics as its counterpart in the field ...
phenomenon that went viral after being the subject of a very popular
TED talk, but was unable to be replicated in dozens of studies
See also
*
Metascience
Metascience (also known as meta-research) is the use of scientific methodology to study science itself. Metascience seeks to increase the quality of scientific research while reducing inefficiency. It is also known as "research on research" and ...
*
Accuracy
*
ANOVA gauge R&R
*
Contingency
*
Corroboration
*
Reproducible builds
*
Falsifiability
Falsifiability (or refutability) is a deductive standard of evaluation of scientific theories and hypotheses, introduced by the Philosophy of science, philosopher of science Karl Popper in his book ''The Logic of Scientific Discovery'' (1934). ...
*
Hypothesis
A hypothesis (: hypotheses) is a proposed explanation for a phenomenon. A scientific hypothesis must be based on observations and make a testable and reproducible prediction about reality, in a process beginning with an educated guess o ...
*
Measurement uncertainty
*
Pathological science
*
Pseudoscience
Pseudoscience consists of statements, beliefs, or practices that claim to be both scientific and factual but are incompatible with the scientific method. Pseudoscience is often characterized by contradictory, exaggerated or unfalsifiable cl ...
*
Replication (statistics)
In engineering, science, and statistics, replication is the process of repeating a study or experiment under the same or similar conditions. It is a crucial step to test the original claim and confirm or reject the accuracy of results as well as f ...
*
Replication crisis
The replication crisis, also known as the reproducibility or replicability crisis, refers to the growing number of published scientific results that other researchers have been unable to reproduce or verify. Because the reproducibility of empir ...
* ''
ReScience C'' (journal)
*
Retraction in academic publishing
*
Tautology
*
Testability
*
Verification and validation
Verification and validation (also abbreviated as V&V) are independent procedures that are used together for checking that a product, service, or system meets requirements and specification (technical standard), specifications and that it fulf ...
References
Further reading
*
* "Science is not irrevocably broken,
pidemiologist John Ioannidisasserts. It just needs some improvements. "Despite the fact that I've published papers with pretty depressive titles, I'm actually an optimist," Ioannidis says. "I find no other investment of a society that is better placed than science.""
External links
Transparency and Openness Promotion Guidelinesfrom the
Center for Open Science
Guidelines for Evaluating and Expressing the Uncertainty of NIST Measurement Resultsof the
National Institute of Standards and Technology
The National Institute of Standards and Technology (NIST) is an agency of the United States Department of Commerce whose mission is to promote American innovation and industrial competitiveness. NIST's activities are organized into Outline of p ...
Reproducible papers with artifactsby the
CTuning foundation
ReproducibleResearch.net
{{Authority control
Measurement
Philosophy of science
Scientific method
Tests
Validity (statistics)
Discovery and invention controversies
Metascience
Statistical reliability