HOME

TheInfoList



OR:

Statcheck is an
R package R packages are extensions to the R statistical programming language. R packages contain code, data, and documentation in a standardised collection format that can be installed by users of R, typically via a centralised software repository such as ...
designed to detect
statistical Statistics (from German: ''Statistik'', "description of a state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, industria ...
errors in
peer-review Peer review is the evaluation of work by one or more people with similar competencies as the producers of the work (peers). It functions as a form of self-regulation by qualified members of a profession within the relevant field. Peer review ...
ed
psychology Psychology is the scientific study of mind and behavior. Psychology includes the study of conscious and unconscious phenomena, including feelings and thoughts. It is an academic discipline of immense scope, crossing the boundaries betwe ...
articles by searching papers for statistical results, redoing the calculations described in each paper, and comparing the two values to see if they match. It takes advantage of the fact that psychological research papers tend to report their results in accordance with the guidelines published by the
American Psychological Association The American Psychological Association (APA) is the largest scientific and professional organization of psychologists in the United States, with over 133,000 members, including scientists, educators, clinicians, consultants, and students. It ha ...
(APA). This leads to several disadvantages: it can only detect results reported completely and in exact accordance with the APA's guidelines, and it cannot detect statistics that are only included in tables in the paper. Another limitation is that Statcheck cannot deal with statistical corrections to test statistics, like Greenhouse–Geisser or Bonferroni corrections, which actually make tests more conservative. Some journals have begun piloting Statcheck as part of their
peer review Peer review is the evaluation of work by one or more people with similar competencies as the producers of the work (peers). It functions as a form of self-regulation by qualified members of a profession within the relevant field. Peer review ...
process. Statcheck is
free software Free software or libre software is computer software distributed under terms that allow users to run the software for any purpose as well as to study, change, and distribute it and any adapted versions. Free software is a matter of liberty, no ...
published under the
GNU GPL The GNU General Public License (GNU GPL or simply GPL) is a series of widely used free software licenses that guarantee end users the four freedoms to run, study, share, and modify the software. The license was the first copyleft for general us ...
v3.


Validity

In 2017, Statcheck's developers published a
preprint In academic publishing, a preprint is a version of a scholarly or scientific paper that precedes formal peer review and publication in a peer-reviewed scholarly or scientific journal. The preprint may be available, often as a non-typeset versio ...
paper concluding that the program accurately identified statistical errors over 95% of the time. This validity study comprised more than 1,000 hand-checked tests among which 5.00% turned out to be inconsistent. The study found that Statcheck recognized 60% of all statistical tests. A reanalysis of these data found that if the program flagged a test as inconsistent, it was correct in 60.4% of cases. Reversely, if a test was truly inconsistent, Statcheck flagged it in an estimated 51.8% of cases (this estimate included the undetected tests and assumed that they had the same rate of inconsistencies as the detected tests). Overall, Statcheck's accuracy was 95.9%, half a percentage point higher than the chance level of 95.4% expected when all tests are simply taken at face value. Statcheck was conservatively biased (by about one standard deviation) against flagging tests. More recent research has used Statcheck on papers published in
Canadian Canadians (french: Canadiens) are people identified with the country of Canada. This connection may be residential, legal, historical or cultural. For most Canadians, many (or all) of these connections exist and are collectively the source of ...
psychology journals, finding similar rates of statistical reporting errors as the original authors based on a 30-year sample of such articles. The same study also found many typographical errors in online versions of relatively old papers, and that correcting for these reduced the estimated percent of tests that were erroneously reported.


History

Statcheck was first developed in 2015 by Michele Nuijten of Tilburg University and Sacha Epskamp of the
University of Amsterdam The University of Amsterdam (abbreviated as UvA, nl, Universiteit van Amsterdam) is a public research university located in Amsterdam, Netherlands. The UvA is one of two large, publicly funded research universities in the city, the other being ...
. Later that year, Nuijten and her colleagues published a paper using Statcheck on over 30,000 psychology papers and reported that "half of all published psychology papers ..contained at least one p-value that was inconsistent with its test". The study was subsequently written up favorably in ''
Nature Nature, in the broadest sense, is the physics, physical world or universe. "Nature" can refer to the phenomenon, phenomena of the physical world, and also to life in general. The study of nature is a large, if not the only, part of science. ...
''. In 2016, Nuijten and Epskamp both received the Leamer-Rosenthal Prize for Open Social Science from the
Berkeley Initiative for Transparency in the Social Sciences The Berkeley Initiative for Transparency in the Social Sciences, abbreviated BITSS, is an academic initiative dedicated to advancing transparency, reproducibility, and openness in social science research. It was established in 2012 by the Univers ...
for creating Statcheck. In 2016, Tilburg University researcher Chris Hartgerink used Statcheck to scan over 50,000 psychology papers and posted the results to PubPeer; he subsequently published the data he extracted from these papers in an article in the journal ''
Data In the pursuit of knowledge, data (; ) is a collection of discrete values that convey information, describing quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpreted ...
''. Hartgerink told
Motherboard A motherboard (also called mainboard, main circuit board, mb, mboard, backplane board, base board, system board, logic board (only in Apple computers) or mobo) is the main printed circuit board (PCB) in general-purpose computers and other expand ...
that "We're checking how reliable is the actual science being presented by science". He also told Vox that he intended to use Statcheck to perform a function similar to a
spell checker In software, a spell checker (or spelling checker or spell check) is a software feature that checks for misspellings in a text. Spell-checking features are often embedded in software or services, such as a word processor, email client, electronic di ...
software program. Hartgerink's action also sent
email Electronic mail (email or e-mail) is a method of exchanging messages ("mail") between people using electronic devices. Email was thus conceived as the electronic ( digital) version of, or counterpart to, mail, at a time when "mail" meant ...
alerts to every researcher who had authored or co-authored a paper that it had flagged. These flaggings, and their posting on a public forum, proved controversial, prompting the
German Psychological Society The German Society for Psychology (Deutsche Gesellschaft für Psychologie) is the German national society of psychologists for education and research in psychology Psychology is the scientific study of mind and behavior. Psychology i ...
to issue a statement condemning this use of Statcheck. Psychologist Dorothy V.M. Bishop, who had two of her own papers flagged by Statcheck, criticized the program for publicly flagging many papers (including one of her own) despite not having found any statistical errors in it. Other critics alleged that Statcheck had reported the presence of errors in papers that did not actually contain them, due to the tool's failure to correctly read statistics from certain papers. Journals that have begun piloting the use of Statcheck as part of their peer review process include '' Psychological Science'', the ''
Canadian Journal of Human Sexuality Canadians (french: Canadiens) are people identified with the country of Canada. This connection may be residential, legal, historical or cultural. For most Canadians, many (or all) of these connections exist and are collectively the source of ...
'', and the ''
Journal of Experimental Social Psychology A journal, from the Old French ''journal'' (meaning "daily"), may refer to: *Bullet journal, a method of personal organization *Diary, a record of what happened over the course of a day or other period *Daybook, also known as a general journal, a ...
''. The
open access Open access (OA) is a set of principles and a range of practices through which research outputs are distributed online, free of access charges or other barriers. With open access strictly defined (according to the 2001 definition), or libre op ...
publisher
PsychOpen PsychOpen is a European Open-Access publishing platform for Psychology operated by the research support organization Leibniz Institute for Psychology Information (ZPID), which combines traditional scientific and Internet-based publishing. PsychOpe ...
has also used it on all papers accepted for publication in their journals since 2017.


See also

* Abuse of statistics *
Misuse of p-values Misuse of ''p''-values is common in scientific research and scientific education. ''p''-values are often used or interpreted incorrectly; the American Statistical Association states that ''p''-values can indicate how incompatible the data are wit ...
*
Metascience Metascience (also known as meta-research) is the use of scientific methodology to study science itself. Metascience seeks to increase the quality of scientific research while reducing inefficiency. It is also known as "''research on research''" ...


References


External links

* * {{R (programming language) Statistical software 2015 software Free R (programming language) software Metascience