arXiv
   HOME

TheInfoList



arXiv (pronounced "
archive An archive is an accumulation of Historical document, historical records – in any media – or the physical facility in which they are located. Archives contain primary source documents that have accumulated over the course of an individual o ...

archive
"—the X represents the Greek letter chi is an
open-access repositoryAn open-access repository or open archive is a digital platform that holds research output and provides free, immediate and permanent access to research results for anyone to use, download and distribute. To facilitate open access Open access (O ...
of electronic
preprint In academic publishing, a preprint is a version of a scholarly or scientific paper that precedes formal peer review and publication in a peer review, peer-reviewed scholarly or scientific journal. The preprint may be available, often as a non-typ ...
s and
postprints File:ENEURO.0483-18.2019 page4 Accepted Manuscript.jpg, Example of a page from an ''eNeuro'' accepted manuscript, 2019 A postprint is a digital draft of a academic journal, research journal article ''after'' it has been peer reviewed and accepted f ...
(known as
e-prints In academic publishing Academic publishing is the subfield of publishing Publishing is the activity of making information, literature, music, software and other content available to the public for sale or for free. Traditionally, the term ...
) approved for posting after moderation, but not
peer review Peer review is the evaluation of work by one or more people with similar competencies as the producers of the work ( peers). It functions as a form of self-regulation by qualified members of a profession within the relevant field Field may re ...
. It consists of
scientific papers : ''For a broader class of literature, see Academic publishing.'' Scientific literature comprises scholarly publications that report original Empirical evidence, empirical and theoretical work in the natural science, natural and social sciences ...
in the fields of
mathematics Mathematics (from Ancient Greek, Greek: ) includes the study of such topics as quantity (number theory), mathematical structure, structure (algebra), space (geometry), and calculus, change (mathematical analysis, analysis). It has no generally ...
,
physics Physics is the natural science that studies matter, its Elementary particle, fundamental constituents, its Motion (physics), motion and behavior through Spacetime, space and time, and the related entities of energy and force. "Physical scien ...

physics
,
astronomy Astronomy (from el, ἀστρονομία, literally meaning the science that studies the laws of the stars) is a natural science that studies astronomical object, celestial objects and celestial event, phenomena. It uses mathematics, physi ...
,
electrical engineering Electrical engineering is an engineering discipline concerned with the study, design, and application of equipment, devices, and systems which use electricity, electronics, and electromagnetism. It emerged as an identifiable occupation in the la ...

electrical engineering
,
computer science Computer science deals with the theoretical foundations of information, algorithms and the architectures of its computation as well as practical techniques for their application. Computer science is the study of Algorithm, algorithmic proc ...
,
quantitative biologyQuantitative biology is an umbrella term encompassing the use of mathematical, statistical or computational techniques to study life Life is a characteristic that distinguishes physical entities that have biological processes, such as sign ...
,
statistics Statistics is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, industrial, or social problem, it is conventional to begin with a statistical ...

statistics
,
mathematical financeMathematical finance, also known as quantitative finance and financial mathematics, is a field of applied mathematics, concerned with mathematical modeling of financial markets. Generally, mathematical finance will derive and extend the Mathematical ...
and
economics Economics () is the social science that studies how people interact with value; in particular, the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. ...

economics
, which can be accessed online. In many fields of mathematics and physics, almost all scientific papers are
self-archived Self-archiving is the act of (the author's) depositing a free copy of an electronic document World Wide Web, online in order to provide Open access (publishing), open access to it. The term usually refers to the self-archiving of peer review, peer- ...
on the arXiv repository before publication in a peer-reviewed journal. Some publishers also grant permission for authors to archive the peer-reviewed
postprint File:ENEURO.0483-18.2019 page4 Accepted Manuscript.jpg, Example of a page from an ''eNeuro'' accepted manuscript, 2019 A postprint is a digital draft of a academic journal, research journal article ''after'' it has been peer reviewed and accepted f ...
. Begun on August 14, 1991, arXiv.org passed the half-million-article milestone on October 3, 2008, and had hit a million by the end of 2014. As of April 2021, the submission rate is about 16,000 articles per month.


History

ArXiv's daily submission rate growth over 30 years since its beginning with topics labelled by the standard abbreviations used on arxiv.org arXiv was made possible by the compact
TeX TeX (, see below), stylized within the system as TeX, is a typesetting system which was designed and mostly written by Donald Knuth and released in 1978. TeX is a popular means of typesetting complex mathematical formulae; it has been noted ...
file format, which allowed scientific papers to be easily transmitted over the
Internet The Internet (Capitalization of Internet, or internet) is the global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a ''network of networks'' t ...

Internet
and rendered
client-side Client-side refers to operations that are performed by the client in a client–server relationship in a computer network A computer network is a group of computers that use a set of common communication protocols over digital signal, digit ...
. Around 1990,
Joanne Cohn Joanne Cohn is an American astrophysicist known for her work in cosmology and particle physics. She is also known for her role in the creation of the ArXiv.org e-print archive. Cohn is a Senior Space Fellow and Full Researcher in the Space Science ...
began emailing
physics Physics is the natural science that studies matter, its Elementary particle, fundamental constituents, its Motion (physics), motion and behavior through Spacetime, space and time, and the related entities of energy and force. "Physical scien ...

physics
preprints to colleagues as TeX files, but the number of papers being sent soon filled mailboxes to capacity.
Paul Ginsparg Paul Henry Ginsparg (born January 1, 1955) is a physicist A physicist is a scientist A scientist is a person who conducts Scientific method, scientific research to advance knowledge in an Branches of science, area of interest. In classical ...
recognized the need for central storage, and in August 1991 he created a central repository mailbox stored at the
Los Alamos National Laboratory Los Alamos National Laboratory (Los Alamos or LANL for short) is a United States Department of Energy national laboratory initially organized during World War II for the design of nuclear weapons as part of the Manhattan Project. It is a short ...
(LANL) which could be accessed from any computer. Additional modes of access were soon added:
FTP The File Transfer Protocol (FTP) is a standard communication protocol used for the transfer of computer files from a server to a client on a computer network. FTP is built on a client–server model architecture using separate control and data c ...

FTP
in 1991,
Gopher Pocket gophers, commonly referred to as just gophers, are burrowing rodent Rodents (from Latin Latin (, or , ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originall ...
in 1992, and the
World Wide Web The World Wide Web (WWW), commonly known as the Web, is an information system An information system (IS) is a formal, sociotechnical Sociotechnical systems (STS) in organizational development is an approach to complex organizational w ...
in 1993. The term
e-print In academic publishing, an eprint or e-print is a digital version of a research document (usually a journal article, but could also be a thesis, conference paper, book chapter, or a book) that is accessible online, usually as green open access, wh ...
was quickly adopted to describe the articles. It began as a physics archive, called the
LANL Los Alamos National Laboratory (Los Alamos or LANL for short) is a United States Department of Energy national laboratory initially organized during World War II World War II or the Second World War, often abbreviated as WWII or WW ...
preprint archive, but soon expanded to include astronomy, mathematics, computer science, quantitative biology and, most recently, statistics. Its original
domain name A domain name is an identification string String or strings may refer to: *String (structure), a long flexible structure made from threads twisted together, which is used to tie, bind, or hang other objects Arts, entertainment, and media Films ...
was xxx.lanl.gov. Due to LANL's lack of interest in the rapidly expanding technology, in 2001 Ginsparg changed institutions to
Cornell University Cornell University is a Private university, private Ivy League and Statutory college, statutory Land-grant university, land-grant research university, based in Ithaca, New York. Founded in 1865 by Ezra Cornell and Andrew Dickson White, Cornell w ...
and changed the name of the repository to arXiv.org. It is now hosted principally by Cornell, with five
mirrors Grange, East Yorkshire, UK, from World War I World War I or the First World War, often abbreviated as WWI or WW1, was a global war originating in Europe that lasted from 28 July 1914 to 11 November 1918. Contemporaneously kn ...
around the world. ArXiv was an early adopter and promoter of
preprints In academic publishing, a preprint is a version of a scholarly or scientific paper that precedes formal peer review and publication in a peer review, peer-reviewed scholarly or scientific journal. The preprint may be available, often as a non-typ ...
. Its success in sharing preprints was one of the precipitating factors that led to the later movement in
scientific publishing Science (from the Latin word ''scientia'', meaning "knowledge") is a systematic enterprise that Scientific method, builds and Taxonomy (general), organizes knowledge in the form of Testability, testable explanations and predictions about the u ...
known as
open access Open access (OA) is a set of principles and a range of practices through which research outputs are distributed online, free of cost or other access barriers. With open access strictly defined (according to the 2001 definition), or Gratis versu ...
.
Mathematician A mathematician is someone who uses an extensive knowledge of mathematics Mathematics (from Ancient Greek, Greek: ) includes the study of such topics as quantity (number theory), mathematical structure, structure (algebra), space (geometry) ...

Mathematician
s and scientists regularly upload their papers to arXiv.org for worldwide access and sometimes for reviews before they are published in
peer-reviewed Peer review is the evaluation of work by one or more people with similar competencies as the producers of the work ( peers). It functions as a form of self-regulation by qualified members of a profession within the relevant field Field may re ...
journals. Ginsparg was awarded a
MacArthur Fellowship The MacArthur Fellows Program, also known as the MacArthur Fellowship and commonly but unofficially known as the "Genius Grant", is a prize awarded annually by the John D. and Catherine T. MacArthur Foundation typically to between 20 and 30 indiv ...
in 2002 for his establishment of arXiv. The annual budget for arXiv was approximately $826,000 for 2013 to 2017, funded jointly by Cornell University Library, the
Simons Foundation The Simons Foundation is a private foundation established in 1994 by Marilyn and James Harris Simons with offices in New York City New York City (NYC), often simply called New York, is the List of United States cities by population, most p ...
(in both gift and
challenge grantChallenge grants are funds disbursed by one party (the grant (money), grant maker), usually a government agency, corporation, foundation (nonprofit), foundation or Charitable trust, trust (sometimes anonymously), typically to a non-profit entity or e ...
forms) and annual fee income from member institutions. This model arose in 2010, when Cornell sought to broaden the financial funding of the project by asking institutions to make annual voluntary contributions based on the amount of download usage by each institution. Each member institution pledges a five-year funding commitment to support arXiv. Based on institutional usage ranking, the annual fees are set in four tiers from $1,000 to $4,400. Cornell's goal is to raise at least $504,000 per year through membership fees generated by approximately 220 institutions. In September 2011, Cornell University Library took overall administrative and financial responsibility for arXiv's operation and development. Ginsparg was quoted in the ''
Chronicle of Higher Education ''The Chronicle of Higher Education'' is a newspaper and website that presents news, information, and jobs for college and Faculty (academic staff), university faculty and student affairs professionals (staff members and administrators). A subscri ...
'' as saying it "was supposed to be a three-hour tour, not a life sentence". However, Ginsparg remains on the arXiv's Scientific Advisory Board and its Physics Advisory Committee.


Moderation process and endorsement

Although arXiv is not
peer review Peer review is the evaluation of work by one or more people with similar competencies as the producers of the work ( peers). It functions as a form of self-regulation by qualified members of a profession within the relevant field Field may re ...
ed, a collection of moderators for each area review the submissions; they may recategorize any that are deemed off-topic, or reject submissions that are not scientific papers, or sometimes for undisclosed reasons. The lists of moderators for many sections of arXiv are publicly available, but moderators for most of the physics sections remain unlisted. Additionally, an "endorsement" system was introduced in 2004 as part of an effort to ensure content is relevant and of interest to current research in the specified disciplines. Under the system, for categories that use it, an author must be endorsed by an established arXiv author before being allowed to submit papers to those categories. Endorsers are not asked to review the paper for errors, but to check whether the paper is appropriate for the intended subject area. New authors from recognized academic institutions generally receive automatic endorsement, which in practice means that they do not need to deal with the endorsement system at all. However, the endorsement system has attracted criticism for allegedly restricting scientific inquiry. A majority of the
e-print In academic publishing, an eprint or e-print is a digital version of a research document (usually a journal article, but could also be a thesis, conference paper, book chapter, or a book) that is accessible online, usually as green open access, wh ...
s are also submitted to scientific journal, journals for publication, but some work, including some very influential papers, remain purely as e-prints and are never published in a peer-reviewed journal. A well-known example of the latter is an outline of a proof of Thurston's geometrization conjecture, including the Poincaré conjecture as a particular case, uploaded by Grigori Perelman in November 2002. Perelman appears content to forgo the traditional peer-reviewed journal process, stating: "If anybody is interested in my way of solving the problem, it's all there let them go and read about it". Despite this non-traditional method of publication, other mathematicians recognized this work by offering the Fields Medal and Millennium Prize Problems, Clay Mathematics Millennium Prizes to Perelman, both of which he refused. While arXiv does contain some dubious e-prints, such as those claiming to refute famous theorems or proving famous conjectures such as Fermat's Last Theorem using only high-school mathematics, a 2002 article which appeared in ''Notices of the American Mathematical Society'' described those as "surprisingly rare". arXiv generally re-classifies these works, e.g. in "General mathematics", rather than deleting them; however, some authors have voiced concern over the lack of transparency in the arXiv screening process.


Submission formats

Papers can be submitted in any of several formats, including LaTeX, and PDF printed from a word processor other than TeX or LaTeX. The Electronic submission, submission is rejected by the arXiv software if generating the final PDF file fails, if any image file is too large, or if the total size of the submission is too large. arXiv now allows one to store and modify an incomplete submission, and only finalize the submission when ready. The time stamp on the article is set when the submission is finalized.


Access

The standard access route is through the arXiv.org website or one of several mirrors. Several other interfaces and access routes have also been created by other un-associated organisations. These include the University of California, Davis's ''front'', a web portal that offers additional search functions and a more self-explanatory interface for arXiv.org, and is referred to by some mathematicians as (the) Front. A similar function used to be offered by eprintweb.org, launched in September 2006 by the Institute of Physics, and was switched off on June 30, 2014. Carnegie Mellon University, Carnegie Mellon provides TablearXiv, a search engine for tables extracted from arXiv publications. Google Scholar and Microsoft Academic can also be used to search for items in arXiv. Metadata for arXiv is made available through OAI-PMH, the standard for open access repositories. Content is therefore indexed in all major consumers of such data, such as BASE (search engine), BASE, CORE (research service), CORE and Unpaywall. As of 2020, the Unpaywall dump links over 500,000 arxiv URLs as the open access version of a work found in CrossRef data from the publishers, making arXiv a top 10 global host of green open access. Finally, researchers can select sub-fields and receive daily e-mailings or RSS feeds of all submissions in them.


Copyright status of files

Files on arXiv can have a number of different copyright statuses: #Some are public domain, in which case they will have a statement saying so. #Some are available under either the Creative Commons 4.0 Creative Commons licenses, Attribution-ShareAlike license or the Creative Commons 4.0 Creative Commons licenses, Attribution-Noncommercial-ShareAlike license. #Some are copyright to the publisher, but the author has the right to distribute them and has given arXiv a non-exclusive irrevocable license to distribute them. #Most are copyright to the author, and arXiv has only a non-exclusive irrevocable license to distribute them.


See also

* List of academic preprint servers * List of academic databases and search engines * List of academic journals by preprint policy


Notes


References

* * * * * * * * * * * * *


External links

* {{Cornell Eprint archives Open-access archives Open science Physics websites American digital libraries Internet properties established in 1991 1991 establishments in New Mexico