HOME

TheInfoList



OR:

arXiv (pronounced as "
archive An archive is an accumulation of historical records or materials, in any medium, or the physical facility in which they are located. Archives contain primary source documents that have accumulated over the course of an individual or organ ...
"—the X represents the Greek letter chi ⟨χ⟩) is an
open-access repository An open repository or open-access repository is a digital platform that holds research output and provides free, immediate and permanent access to research results for anyone to use, download and distribute. To facilitate open access such reposito ...
of electronic
preprint In academic publishing, a preprint is a version of a scholarly or scientific paper that precedes formal peer review and publication in a peer-reviewed scholarly or scientific journal. The preprint may be available, often as a non-typeset versi ...
s and
postprints A postprint is a digital draft of a research journal article ''after'' it has been peer reviewed and accepted for publication, but ''before'' it has been typeset and formatted by the journal. Related terminology A digital draft before peer re ...
(known as e-prints) approved for posting after moderation, but not
peer reviewed Peer review is the evaluation of work by one or more people with similar competencies as the producers of the work ( peers). It functions as a form of self-regulation by qualified members of a profession within the relevant field. Peer review ...
. It consists of
scientific papers Scientific literature encompasses a vast body of academic papers that spans various disciplines within the Natural science, natural and social sciences. It primarily consists of academic papers that present original empirical research an ...
in the fields of
mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
,
physics Physics is the scientific study of matter, its Elementary particle, fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge whi ...
,
astronomy Astronomy is a natural science that studies celestial objects and the phenomena that occur in the cosmos. It uses mathematics, physics, and chemistry in order to explain their origin and their overall evolution. Objects of interest includ ...
,
electrical engineering Electrical engineering is an engineering discipline concerned with the study, design, and application of equipment, devices, and systems that use electricity, electronics, and electromagnetism. It emerged as an identifiable occupation in the l ...
,
computer science Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
, quantitative biology,
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
,
mathematical finance Mathematical finance, also known as quantitative finance and financial mathematics, is a field of applied mathematics, concerned with mathematical modeling in the financial field. In general, there exist two separate branches of finance that req ...
, and
economics Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and interac ...
, which can be accessed online. In many fields of mathematics and physics, almost all scientific papers are self-archived on the arXiv repository before publication in a peer-reviewed journal. Some publishers also grant permission for authors to archive the peer-reviewed
postprint A postprint is a digital draft of a research journal article ''after'' it has been peer reviewed and accepted for publication, but ''before'' it has been typeset and formatted by the journal. Related terminology A digital draft before peer re ...
. Begun on August 14, 1991, arXiv.org passed the half-million-article milestone on October 3, 2008, had hit a million by the end of 2014 and two million by the end of 2021. As of November 2024, the submission rate is about 24,000 articles per month.


History

arXiv was made possible by the compact
TeX Tex, TeX, TEX, may refer to: People and fictional characters * Tex (nickname), a list of people and fictional characters with the nickname * Tex Earnhardt (1930–2020), U.S. businessman * Joe Tex (1933–1982), stage name of American soul singer ...
file format, which allowed scientific papers to be easily transmitted over the
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
and rendered client-side. Around 1990,
Joanne Cohn Joanne Cohn is an American astrophysicist known for her work in cosmology and particle physics. She is also known for her role in the creation of the ArXiv.org e-print archive. Cohn is a Senior Space Fellow and Full Researcher in the Space Scie ...
began emailing
physics Physics is the scientific study of matter, its Elementary particle, fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge whi ...
preprints to colleagues as TeX files, but the number of papers being sent soon filled mailboxes to capacity.
Paul Ginsparg Paul Henry Ginsparg is an American physicist. He developed the arXiv.org e-print archive. Education He is a graduate of Syosset High School in Syosset, New York, on Long Island. He graduated from Harvard University with a Bachelor of Arts in ...
recognized the need for central storage, and in August 1991 he created a central
repository Repository may refer to: Archives and online databases * Content repository, a database with an associated set of data management tools, allowing application-independent access to the content * Disciplinary repository (or subject repository), an ...
mailbox stored at the
Los Alamos National Laboratory Los Alamos National Laboratory (often shortened as Los Alamos and LANL) is one of the sixteen research and development Laboratory, laboratories of the United States Department of Energy National Laboratories, United States Department of Energy ...
(LANL) that could be accessed from any computer. Additional modes of access were soon added:
FTP The File Transfer Protocol (FTP) is a standard communication protocol used for the transfer of computer files from a server to a client on a computer network. FTP is built on a client–server model architecture using separate control and dat ...
in 1991,
Gopher Pocket gophers, commonly referred to simply as gophers, are burrowing rodents of the family Geomyidae. The roughly 41 speciesSearch results for "Geomyidae" on thASM Mammal Diversity Database are all endemic to North and Central America. They ar ...
in 1992, and the
World Wide Web The World Wide Web (WWW or simply the Web) is an information system that enables Content (media), content sharing over the Internet through user-friendly ways meant to appeal to users beyond Information technology, IT specialists and hobbyis ...
in 1993. The term e-print was quickly adopted to describe the articles. It began as a physics archive, called the LANL preprint archive, but soon expanded to include astronomy, mathematics, computer science, quantitative biology and, most recently, statistics. Its original
domain name In the Internet, a domain name is a string that identifies a realm of administrative autonomy, authority, or control. Domain names are often used to identify services provided through the Internet, such as websites, email services, and more. ...
was xxx.lanl.gov. Due to LANL's lack of interest in the rapidly expanding technology, in 2001 Ginsparg changed institutions to
Cornell University Cornell University is a Private university, private Ivy League research university based in Ithaca, New York, United States. The university was co-founded by American philanthropist Ezra Cornell and historian and educator Andrew Dickson W ...
and changed the name of the repository to arXiv.org. Ginsparg brainstormed the new name with his wife; the domain "archive" was already claimed, so "chi" was replaced with "X" standing in as the Greek letter chi and the "e" dropped for symmetry around the "X". arXiv was an early adopter and promoter of
preprints In academic publishing, a preprint is a version of a scholarly or scientific paper that precedes formal peer review and publication in a peer-reviewed scholarly or scientific journal. The preprint may be available, often as a non-typeset versio ...
. Its success in sharing preprints was one of the precipitating factors that led to the later movement in
scientific publishing Scientific literature encompasses a vast body of academic papers that spans various disciplines within the natural and social sciences. It primarily consists of academic papers that present original empirical research and theoretical ...
known as
open access Open access (OA) is a set of principles and a range of practices through which nominally copyrightable publications are delivered to readers free of access charges or other barriers. With open access strictly defined (according to the 2001 de ...
.
Mathematician A mathematician is someone who uses an extensive knowledge of mathematics in their work, typically to solve mathematical problems. Mathematicians are concerned with numbers, data, quantity, mathematical structure, structure, space, Mathematica ...
s and scientists regularly upload their papers to arXiv.org for worldwide access and sometimes for reviews before they are published in
peer-reviewed Peer review is the evaluation of work by one or more people with similar competencies as the producers of the work ( peers). It functions as a form of self-regulation by qualified members of a profession within the relevant field. Peer review ...
journals. Ginsparg was awarded a
MacArthur Fellowship The MacArthur Fellows Program, also known as the MacArthur Fellowship and colloquially called the "Genius Grant", is a prize awarded annually by the MacArthur Foundation, John D. and Catherine T. MacArthur Foundation to typically between 20 and ...
in 2002 for his establishment of arXiv. The annual budget for arXiv was approximately $826,000 for 2013 to 2017, funded jointly by Cornell University Library, the
Simons Foundation The Simons Foundation is an American private foundation established in 1994 by Marilyn and James Harris Simons, Jim Simons with offices in New York City. As one of the largest charitable organizations in the United States with assets of over $5 ...
(in both gift and challenge grant forms) and annual fee income from member institutions. This model arose in 2010, when Cornell sought to broaden the financial funding of the project by asking institutions to make annual voluntary contributions based on the amount of download usage by each institution. Each member institution pledges a five-year funding commitment to support arXiv. Based on institutional usage ranking, the annual fees are set in four tiers from $1,000 to $4,400. Cornell's goal is to raise at least $504,000 per year through membership fees generated by approximately 220 institutions. In September 2011, Cornell University Library took overall administrative and financial responsibility for arXiv's operation and development. Ginsparg was quoted in the ''
Chronicle of Higher Education ''The Chronicle of Higher Education'' is an American newspaper and website that presents news, information, and jobs for college and university faculty and student affairs professionals, including staff members and administrators. A subscriptio ...
'' as joking that it "was supposed to be a three-hour tour, not a life sentence". However, Ginsparg remains on the arXiv's Scientific Advisory Board and its Physics Advisory Committee. In January 2022, arXiv began assigning DOIs to articles, in collaboration with
DataCite DataCite is an international not-for-profit organization which aims to improve ''data citation'' in order to: *establish easier access to research data on the Internet *increase acceptance of research data as legitimate, citable contributions to ...
.


Data format

Each arXiv paper has a unique identifier: * YYMM.NNNNN, e.g. 1507.00123, * YYMM.NNNN, e.g. 0704.0001, * arch-ive/YYMMNNN for older papers, e.g. hep-th/9901001. Different versions of the same paper are specified by a version number at the end. For example, 1709.08980v1. If no version number is specified, the default is the latest version. arXiv uses a category system. Each paper is tagged with one or more categories. Some categories have two layers. For example, q-fin.TR is the "Trading and Market Microstructure" category within "quantitative finance". Other categories have one layer. For example, hep-ex is "high energy physics experiments".


Moderation process and endorsement

Although arXiv is not
peer review Peer review is the evaluation of work by one or more people with similar competencies as the producers of the work (:wiktionary:peer#Etymology 2, peers). It functions as a form of self-regulation by qualified members of a profession within the ...
ed, a collection of moderators for each area review the submissions; they may recategorize any that are deemed off-topic, or reject submissions that are not scientific papers, or sometimes for undisclosed reasons. The lists of moderators for many sections of arXiv are publicly available, but moderators for most of the physics sections remain unlisted. Additionally, an "endorsement" system was introduced in 2004 as part of an effort to ensure content is relevant and of interest to current research in the specified disciplines. Under the system, for categories that use it, an author must be endorsed by an established arXiv author before being allowed to submit papers to those categories. Endorsers are not asked to review the paper for errors but to check whether the paper is appropriate for the intended subject area. New authors from recognized academic institutions generally receive automatic endorsement, which in practice means that they do not need to deal with the endorsement system at all. However, the endorsement system has attracted criticism for allegedly restricting scientific inquiry. A majority of the e-prints are also submitted to journals for publication, but some work, including some very influential papers, remain purely as e-prints and are never published in a peer-reviewed journal. A well-known example of the latter is an outline of a proof of Thurston's geometrization conjecture, including the
Poincaré conjecture In the mathematical field of geometric topology, the Poincaré conjecture (, , ) is a theorem about the characterization of the 3-sphere, which is the hypersphere that bounds the unit ball in four-dimensional space. Originally conjectured b ...
as a particular case, uploaded by
Grigori Perelman Grigori Yakovlevich Perelman (, ; born 13June 1966) is a Russian mathematician and geometer who is known for his contributions to the fields of geometric analysis, Riemannian geometry, and geometric topology. In 2005, Perelman resigned from his ...
in November 2002. Perelman appears content to forgo the traditional peer-reviewed journal process, stating: "If anybody is interested in my way of solving the problem, it's all there let them go and read about it". Despite this non-traditional method of publication, other mathematicians recognized this work by offering the
Fields Medal The Fields Medal is a prize awarded to two, three, or four mathematicians under 40 years of age at the International Congress of Mathematicians, International Congress of the International Mathematical Union (IMU), a meeting that takes place e ...
and Clay Mathematics Millennium Prizes to Perelman, both of which he refused. While arXiv does contain some dubious e-prints, such as those claiming to refute famous theorems or proving famous conjectures such as
Fermat's Last Theorem In number theory, Fermat's Last Theorem (sometimes called Fermat's conjecture, especially in older texts) states that no three positive number, positive integers , , and satisfy the equation for any integer value of greater than . The cases ...
using only high-school mathematics, a 2002 article which appeared in ''
Notices of the American Mathematical Society ''Notices of the American Mathematical Society'' is the membership journal of the American Mathematical Society (AMS), published monthly except for the combined June/July issue. The first volume was published in 1953. Each issue of the magazine ...
'' described those as "surprisingly rare". arXiv generally re-classifies these works, e.g. in "General mathematics", rather than deleting them; however, some authors have voiced concern over the lack of transparency in the arXiv screening process.


Withdrawn preprints

It has been reported that 14,000 preprints have been withdrawn at arXiv, most commonly due to "crucial errors". A lesser number of the withdrawals were due to the preprint being subsumed by another publication. The report itself was posted at arXiv December, 2024.


Submission formats

Papers can be submitted in any of several formats, including
LaTeX Latex is an emulsion (stable dispersion) of polymer microparticles in water. Latices are found in nature, but synthetic latices are common as well. In nature, latex is found as a wikt:milky, milky fluid, which is present in 10% of all floweri ...
, and
PDF Portable document format (PDF), standardized as ISO 32000, is a file format developed by Adobe Inc., Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, computer hardware, ...
printed from a
word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features. Early word processors were stand-alone devices dedicated to the function, but current word ...
other than TeX or LaTeX. The
submission Deference (also called submission or passivity) is the condition of submitting to the espoused, legitimate influence of one's superior or superiors. Deference implies a yielding or submitting to the judgment of a recognized superior, out of re ...
is rejected by the arXiv software if generating the final
PDF Portable document format (PDF), standardized as ISO 32000, is a file format developed by Adobe Inc., Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, computer hardware, ...
file fails, if any image file is too large, or if the total size of the submission is too large. arXiv now allows one to store and modify an incomplete submission, and only finalize the submission when ready. The time stamp on the article is set when the submission is finalized.


Access

The standard access route is through the arXiv.org website. Other interfaces and access routes have also been created by other un-associated organisations.
Metadata Metadata (or metainformation) is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive ...
for arXiv is made available through OAI-PMH, the standard for
open access repositories An open repository or open-access repository is a digital platform that holds research output and provides free, immediate and permanent access to research results for anyone to use, download and distribute. To facilitate open access such reposito ...
. Content is therefore indexed in all major consumers of such data, such as BASE,
CORE Core or cores may refer to: Science and technology * Core (anatomy), everything except the appendages * Core (laboratory), a highly specialized shared research resource * Core (manufacturing), used in casting and molding * Core (optical fiber ...
and
Unpaywall OurResearch, formerly known as ImpactStory, is a nonprofit organization that creates and distributes tools and services for libraries, institutions and researchers. The organization follows open practices with their data (to the extent allowed b ...
. As of 2020, the Unpaywall dump links over 500,000 arxiv URLs as the
open access Open access (OA) is a set of principles and a range of practices through which nominally copyrightable publications are delivered to readers free of access charges or other barriers. With open access strictly defined (according to the 2001 de ...
version of a work found in CrossRef data from the publishers, making arXiv a top 10 global host of
green open access Self-archiving is the act of (the author's) depositing a free copy of an electronic document online in order to provide open access to it. The term usually refers to the self-archiving of peer-reviewed research journal and conference articles, as ...
. Finally, researchers can select sub-fields and receive daily e-mailings or
RSS feed RSS ( RDF Site Summary or Really Simple Syndication) is a web feed that allows users and applications to access updates to websites in a standardized, computer-readable format. Subscribing to RSS feeds can allow a user to keep track of many d ...
s of all submissions in them.


Copyright status of files

Files on arXiv can have a number of different copyright statuses: # Some are
public domain The public domain (PD) consists of all the creative work to which no Exclusive exclusive intellectual property rights apply. Those rights may have expired, been forfeited, expressly Waiver, waived, or may be inapplicable. Because no one holds ...
, in which case they will have a statement saying so. # Some are available under either the
Creative Commons Creative Commons (CC) is an American non-profit organization and international network devoted to educational access and expanding the range of creative works available for others to build upon legally and to share. The organization has release ...
4.0 Attribution-ShareAlike license or the Creative Commons 4.0 Attribution-Noncommercial-ShareAlike license. # Some are copyright to the publisher, but the author has the right to distribute them and has given arXiv a non-exclusive irrevocable license to distribute them. # Most are copyright to the author, and arXiv has only a non-exclusive irrevocable license to distribute them.


See also

*
BioRxiv bioRxiv (pronounced "bio-archive") is an open access preprint repository for the biological sciences co-founded by John Inglis and Richard Sever in November 2013. It was hosted by Cold Spring Harbor Laboratory (CSHL) until March 11, 2025, whe ...
* ChemRxiv * PsyArXiv *
List of academic databases and search engines This page contains a representative list of major databases and search engines useful in an academic setting for finding and accessing articles in academic journals, institutional repository, institutional repositories, archives, or other collecti ...
*
List of academic journals by preprint policy This is a list of publishers of academic journals by their submission policies regarding the use of preprints prior to publication ( example list). Publishers' policies on self-archiving (including of preprint versions) can also be found at SHE ...
*
List of preprint repositories This is a list of repositories used to store open science Open science is the movement to make scientific research (including publications, data, physical samples, and software) and its dissemination accessible to all levels of society, ama ...
*
Sci-Hub Sci-Hub is a library website that provides free access to millions of research papers, regardless of copyright, by bypassing publishers' paywalls in various ways. Unlike Library Genesis, it does not provide access to books. Sci-Hub was found ...
* ViXra


Citations


General and cited sources

* * * * * * * * * * * * *


External links

* {{Cornell 1991 establishments in New Mexico American digital libraries Cornell University Eprint archives Internet properties established in 1991 Open science Open-access archives Physics websites Electronic documents