HOME

TheInfoList



OR:

The SAO/NASA Astrophysics Data System (ADS) is an online database of over 16 million
astronomy Astronomy () is a natural science that studies astronomical object, celestial objects and phenomena. It uses mathematics, physics, and chemistry in order to explain their origin and chronology of the Universe, evolution. Objects of interest ...
and
physics Physics is the natural science that studies matter, its fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge which r ...
papers from both
peer review Peer review is the evaluation of work by one or more people with similar competencies as the producers of the work (peers). It functions as a form of self-regulation by qualified members of a profession within the relevant field. Peer review ...
ed and non-peer reviewed sources.
Abstracts An abstract is a brief summary of a research article, thesis, review, conference proceeding, or any in-depth analysis of a particular subject and is often used to help the reader quickly ascertain the paper's purpose. When used, an abstract always ...
are available free online for almost all articles, and full scanned articles are available in
Graphics Interchange Format The Graphics Interchange Format (GIF; or , see pronunciation) is a bitmap image format that was developed by a team at the online services provider CompuServe led by American computer scientist Steve Wilhite and released on 15 June 1987. ...
(GIF) and
Portable Document Format Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. ...
(PDF) for older articles. It was developed by the
National Aeronautics and Space Administration The National Aeronautics and Space Administration (NASA ) is an independent agency of the US federal government responsible for the civil space program, aeronautics research, and space research. NASA was established in 1958, succeeding th ...
(NASA), and is managed by the Smithsonian Astrophysical Observatory. ADS is a powerful research tool and has had a significant impact on the efficiency of astronomical research since it was launched in 1992. Literature searches that previously would have taken days or weeks can now be carried out in seconds via the ADS search engine, which is custom-built for astronomical needs. Studies have found that the benefit to astronomy of the ADS is equivalent to several hundred million
US dollars The United States dollar (Currency symbol, symbol: Dollar sign, $; ISO 4217, code: USD; also abbreviated US$ or U.S. Dollar, to distinguish it from Dollar, other dollar-denominated currencies; referred to as the dollar, U.S. dollar, American ...
annually, and the system is estimated to have tripled the readership of astronomical journals. Use of ADS is almost universal among astronomers worldwide, and therefore ADS usage statistics can be used to analyze global trends in astronomical research. These studies have revealed that the amount of research an astronomer carries out is related to the per capita
gross domestic product Gross domestic product (GDP) is a money, monetary Measurement in economics, measure of the market value of all the final goods and services produced and sold (not resold) in a specific time period by countries. Due to its complex and subjec ...
(GDP) of the country in which he/she is based, and that the number of astronomers in a country is proportional to the GDP of that country, so the total amount of research done in a country is proportional to the square of its GDP divided by its population.
Preprint


History

For many years, a growing problem in astronomical research (as in other academic disciplines) was that the number of papers published in the major astronomical journals was increasing steadily, meaning astronomers were able to read less and less of the latest research findings. During the 1980s, astronomers saw that the nascent technologies which formed the basis of the
Internet The Internet (or internet) is the global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a '' network of networks'' that consists of private, pub ...
could eventually be used to build an electronic indexing system of astronomical research papers which would allow astronomers to keep abreast of a much greater range of research. The first suggestion of a database of journal paper abstracts was made at a conference on ''Astronomy from Large Data-bases'' held in
Garching bei München Garching bei München (''Garching near Munich'') or Garching is a town in Bavaria, Germany, near Munich. It is the home of several research institutes and university departments on its Campus Garching, campus. It became a city on 14 September 199 ...
in 1987. Initial development of an electronic system for accessing astrophysical abstracts took place during the following two years; in 1991 discussions took place on how to integrate ADS with the
SIMBAD SIMBAD (the Set of Identifications, Measurements and Bibliography for Astronomical Data) is an astronomical database of objects beyond the Solar System. It is maintained by the Centre de données astronomiques de Strasbourg (CDS), France. SIMBA ...
database, containing all available catalog designations for objects outside the
Solar System The Solar SystemCapitalization of the name varies. The International Astronomical Union, the authoritative body regarding astronomical nomenclature, specifies capitalizing the names of all individual astronomical objects but uses mixed "Solar S ...
, to create a system where astronomers could search for all the papers written about a given object. An initial version of ADS, with a database consisting of 40 papers, was created as a
proof of concept Proof of concept (POC or PoC), also known as proof of principle, is a realization of a certain method or idea in order to demonstrate its feasibility, or a demonstration in principle with the aim of verifying that some concept or theory has prac ...
in 1988, and the ADS database was successfully connected with the SIMBAD database in the summer of 1993. The creators believed this was the first use of the Internet to allow simultaneous querying of transatlantic scientific databases. Until 1994, the service was available via proprietary network software, but it was transferred to the nascent
World Wide Web The World Wide Web (WWW), commonly known as the Web, is an information system enabling documents and other web resources to be accessed over the Internet. Documents and downloadable media are made available to the network through web se ...
early that year. The number of users of the service quadrupled in the five weeks following the introduction of the ADS web-based service. At first, the journal articles available via ADS were
scan Scan may refer to: Acronyms * Schedules for Clinical Assessment in Neuropsychiatry (SCAN), a psychiatric diagnostic tool developed by WHO * Shared Check Authorization Network (SCAN), a database of bad check writers and collection agency for bad ...
ned
bitmap In computing, a bitmap is a mapping from some domain (for example, a range of integers) to bits. It is also called a bit array A bit array (also known as bitmask, bit map, bit set, bit string, or bit vector) is an array data structure that c ...
s created from the paper journals, but from 1995 onwards, the ''
Astrophysical Journal ''The Astrophysical Journal'', often abbreviated ''ApJ'' (pronounced "ap jay") in references and speech, is a peer-reviewed scientific journal of astrophysics and astronomy, established in 1895 by American astronomers George Ellery Hale and Jame ...
'' began to publish an on-line edition, soon followed by the other main journals such as '' Astronomy and Astrophysics'' and the ''
Monthly Notices of the Royal Astronomical Society ''Monthly Notices of the Royal Astronomical Society'' (MNRAS) is a peer-reviewed scientific journal covering research in astronomy and astrophysics. It has been in continuous existence since 1827 and publishes letters and papers reporting orig ...
''. ADS provided links to these electronic editions from their first appearance. Since about 1995, the number of ADS users has doubled roughly every two years. ADS now has agreements with almost all astronomical journals, who supply abstracts. Scanned articles from as far back as the early 19th century are available via the service, which now contains over eight million documents. The service is distributed worldwide, with twelve mirror sites in twelve countries on five continents, with the database synchronized by means of weekly updates using
rsync rsync is a utility for efficiently transferring and synchronizing files between a computer and a storage drive and across networked computers by comparing the modification times and sizes of files. It is commonly found on Unix-like operat ...
, a mirroring utility which allows updates to only the portions of the database which have changed. All updates are triggered centrally, but they initiate scripts at the mirror sites which "pull" updated data from the main ADS servers.


Data in the system

Papers are indexed within the database by their bibliographic record, containing the details of the journal they were published in and various associated
metadata Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive metadata – the descriptive ...
, such as author lists,
reference Reference is a relationship between objects in which one object designates, or acts as a means by which to connect to or link to, another object. The first object in this relation is said to ''refer to'' the second object. It is called a ''name'' ...
s and citations. Originally this data was stored in
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
format, but eventually the limitations of this encouraged the database maintainers to migrate all records to an
XML Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable ...
(Extensible Markup Language) format in 2000. Bibliographic records are now stored as an XML element, with sub-elements for the various metadata. Since the advent of online editions of journals, abstracts are loaded into the ADS on or before the publication date of articles, with the full journal text available to subscribers. Older articles have been scanned, and an abstract is created using
optical character recognition Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scen ...
software. Scanned articles from before about 1995 are usually available free, by agreement with the journal publishers. Scanned articles are stored in
TIFF Tag Image File Format, abbreviated TIFF or TIF, is an image file format for storing raster graphics images, popular among graphic artists, the publishing industry, and photographers. TIFF is widely supported by scanning, faxing, word processin ...
format, at both medium and high
resolution Resolution(s) may refer to: Common meanings * Resolution (debate), the statement which is debated in policy debate * Resolution (law), a written motion adopted by a deliberative body * New Year's resolution, a commitment that an individual mak ...
. The TIFF files are converted on demand into GIF files for on-screen viewing, and
PDF Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. ...
or
PostScript PostScript (PS) is a page description language in the electronic publishing and desktop publishing realm. It is a dynamically typed, concatenative programming language. It was created at Adobe Systems by John Warnock, Charles Geschke, Doug Br ...
files for printing. The generated files are then cached to eliminate needlessly frequent regenerations for popular articles. As of 2000, ADS contained 250 GB of scans, which consisted of 1,128,955 article pages comprising 138,789 articles. By 2005 this had grown to 650 GB, and is expected to grow further, to about 900 GB by 2007. No further information has been published. The database initially contained only astronomical references, but has now grown to incorporate three databases, covering
astronomy Astronomy () is a natural science that studies astronomical object, celestial objects and phenomena. It uses mathematics, physics, and chemistry in order to explain their origin and chronology of the Universe, evolution. Objects of interest ...
(including planetary sciences and solar physics) references,
physics Physics is the natural science that studies matter, its fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge which r ...
(including instrumentation and geosciences) references, as well as preprints of scientific papers from
arXiv arXiv (pronounced "archive"—the X represents the Greek letter chi ⟨χ⟩) is an open-access repository of electronic preprints and postprints (known as e-prints) approved for posting after moderation, but not peer review. It consists of ...
. The astronomy database is by far the most advanced and its use accounts for about 85% of the total ADS usage. Articles are assigned to the different databases according to the subject rather than the journal they are published in, so that articles from any one journal might appear in all three subject databases. The separation of the databases allows searching in each discipline to be tailored, so that words can automatically be given different
weight function A weight function is a mathematical device used when performing a sum, integral, or average to give some elements more "weight" or influence on the result than other elements in the same set. The result of this application of a weight function is ...
s in different database searches, depending on how common they are in the relevant field. Data in the preprint archive is updated daily from the
arXiv arXiv (pronounced "archive"—the X represents the Greek letter chi ⟨χ⟩) is an open-access repository of electronic preprints and postprints (known as e-prints) approved for posting after moderation, but not peer review. It consists of ...
, the main repository of physics and astronomy preprints. The advent of preprint servers has, like ADS, had a significant impact on the rate of astronomical research, as papers are often made available from preprint servers weeks or months before they are published in the journals. The incorporation of preprints from the arXiv into ADS means that the search engine can return the most current research available, with the caveat that preprints may not have been peer reviewed or
proofread Proofreading is the reading of a galley proof or an electronic copy of a publication to find and correct reproduction errors of text or art. Proofreading is the final step in the editorial cycle before publication. Professional Traditional ...
to the required standard for publication in the main journals. ADS's database links preprints with subsequently published articles wherever possible, so that citation and reference searches will return links to the journal article where the preprint was cited.


Software and hardware

The software runs on a system that was written specifically for it, allowing for extensive customization for astronomical needs that would not have been possible with general purpose
database In computing, a database is an organized collection of data stored and accessed electronically. Small databases can be stored on a file system, while large databases are hosted on computer clusters or cloud storage. The design of databases sp ...
software. The scripts are designed to be as
platform independent In computing, cross-platform software (also called multi-platform software, platform-agnostic software, or platform-independent software) is computer software that is designed to work in several computing platforms. Some cross-platform software r ...
as possible, given the need to facilitate mirroring on different systems around the world, although the growing use of
Linux Linux ( or ) is a family of open-source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991, by Linus Torvalds. Linux is typically packaged as a Linux distribution, which ...
as the
operating system An operating system (OS) is system software that manages computer hardware, software resources, and provides common services for computer programs. Time-sharing operating systems schedule tasks for efficient use of the system and may also in ...
of choice within astronomy has led to increasing optimization of the scripts for installation on that platform. The main ADS server is located at the Center for Astrophysics Harvard & Smithsonian in
Cambridge, Massachusetts Cambridge ( ) is a city in Middlesex County, Massachusetts, United States. As part of the Boston metropolitan area, the cities population of the 2020 U.S. census was 118,403, making it the fourth most populous city in the state, behind Boston, ...
, and is a dual 64-bit X86
Intel Intel Corporation is an American multinational corporation and technology company headquartered in Santa Clara, California. It is the world's largest semiconductor chip manufacturer by revenue, and is one of the developers of the x86 seri ...
server with two quad-core 3.0
GHz The hertz (symbol: Hz) is the unit of frequency in the International System of Units (SI), equivalent to one event (or cycle) per second. The hertz is an SI derived unit whose expression in terms of SI base units is s−1, meaning that one he ...
CPUs and 32 GB of
RAM Ram, ram, or RAM may refer to: Animals * A male sheep * Ram cichlid, a freshwater tropical fish People * Ram (given name) * Ram (surname) * Ram (director) (Ramsubramaniam), an Indian Tamil film director * RAM (musician) (born 1974), Dutch * ...
, running the CentOS 5.4
Linux Linux ( or ) is a family of open-source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991, by Linus Torvalds. Linux is typically packaged as a Linux distribution, which ...
distribution. Mirrors are located in Brazil, China, Chile, France, Germany, India, Indonesia, Japan, Russia, South Korea, United Kingdom, and Ukraine.


Indexing

ADS currently receives abstracts or tables of contents from almost two hundred journal sources. The service may receive data referring to the same article from multiple sources, and creates one bibliographic reference based on the most accurate data from each source. The common use of
TeX Tex may refer to: People and fictional characters * Tex (nickname), a list of people and fictional characters with the nickname * Joe Tex (1933–1982), stage name of American soul singer Joseph Arrington Jr. Entertainment * ''Tex'', the Italian ...
and
LaTeX Latex is an emulsion (stable dispersion) of polymer microparticles in water. Latexes are found in nature, but synthetic latexes are common as well. In nature, latex is found as a milky fluid found in 10% of all flowering plants (angiosperms ...
by almost all scientific journals greatly facilitates the incorporation of bibliographic data into the system in a standardized format, and importing
HTML The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScri ...
-coded web-based articles is also simple. ADS utilizes
Python Python may refer to: Snakes * Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia ** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia * Python (mythology), a mythical serpent Computing * Python (pro ...
and
Perl Perl is a family of two high-level, general-purpose, interpreted, dynamic programming languages. "Perl" refers to Perl 5, but from 2000 to 2019 it also referred to its redesigned "sister language", Perl 6, before the latter's name was offici ...
scripts for importing, processing and standardizing bibliographic data. The apparently mundane task of converting author names into a standard ''
Surname In some cultures, a surname, family name, or last name is the portion of one's personal name that indicates one's family, tribe or community. Practices vary by culture. The family name may be placed at either the start of a person's full name ...
, Initial'' format is actually one of the more difficult to automate, due to the wide variety of naming conventions around the world and the possibility that a given name such as Davis could be a
first name First or 1st is the ordinal form of the number one (#1). First or 1st may also refer to: *World record, specifically the first instance of a particular achievement Arts and media Music * 1$T, American rapper, singer-songwriter, DJ, and rec ...
,
middle name In various cultures, a middle name is a portion of a personal name that is written between the person's first given name and their surname. A middle name is often abbreviated and is then called middle initial or just initial. A person may be ...
or surname. The accurate conversion of names requires a detailed knowledge of the names of authors active in astronomy, and ADS maintains an extensive database of author names, which is also used in searching the database (see below). For electronic articles, a list of the references given at the end of the article is easily extracted. For scanned articles, reference extraction relies on OCR. The reference database can then be "inverted" to list the citations for each paper in the database. Citation lists have been used in the past to identify popular articles missing from the database; mostly these were from before 1975 and have now been added to the system.


Coverage

The database now contains over eight million articles. In the cases of the major journals of astronomy (''
Astrophysical Journal ''The Astrophysical Journal'', often abbreviated ''ApJ'' (pronounced "ap jay") in references and speech, is a peer-reviewed scientific journal of astrophysics and astronomy, established in 1895 by American astronomers George Ellery Hale and Jame ...
'', ''
Astronomical Journal ''The Astronomical Journal'' (often abbreviated ''AJ'' in scientific papers and references) is a peer-reviewed monthly scientific journal owned by the American Astronomical Society (AAS) and currently published by IOP Publishing. It is one of th ...
'', '' Astronomy and Astrophysics'', ''
Publications of the Astronomical Society of the Pacific ''Publications of the Astronomical Society of the Pacific'' (often abbreviated as ''PASP'' in references and literature) is a monthly peer-reviewed scientific journal managed by the Astronomical Society of the Pacific. It publishes research and ...
'' and the ''
Monthly Notices of the Royal Astronomical Society ''Monthly Notices of the Royal Astronomical Society'' (MNRAS) is a peer-reviewed scientific journal covering research in astronomy and astrophysics. It has been in continuous existence since 1827 and publishes letters and papers reporting orig ...
''), coverage is complete, with all issues indexed from number 1 to the present. These journals account for about two-thirds of the papers in the database, with the rest consisting of papers published in over 100 other journals from around the world, as well as in conference proceedings. While the database contains the complete contents of all the major journals and many minor ones as well, its coverage of references and citations is much less complete. References in and citations of articles in the major journals are fairly complete, but references such as "private communication", "in press" or "in preparation" cannot be matched, and author errors in reference listings also introduce potential errors. Astronomical papers may cite and be cited by articles in journals which fall outside the scope of ADS, such as
chemistry Chemistry is the science, scientific study of the properties and behavior of matter. It is a natural science that covers the Chemical element, elements that make up matter to the chemical compound, compounds made of atoms, molecules and ions ...
,
mathematics Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
or
biology Biology is the scientific study of life. It is a natural science with a broad scope but has several unifying themes that tie it together as a single, coherent field. For instance, all organisms are made up of cells that process hereditary i ...
journals.


Search engine

Since its inception, the ADS has developed a highly complex search engine to query the abstract and object databases. The search engine is tailor-made for searching astronomical abstracts, and the engine and its
user interface In the industrial design field of human–computer interaction, a user interface (UI) is the space where interactions between humans and machines occur. The goal of this interaction is to allow effective operation and control of the machine f ...
assume that the user is well-versed in astronomy and able to interpret search results which are designed to return more than just the most relevant papers. The database can be queried for author names,
astronomical object An astronomical object, celestial object, stellar object or heavenly body is a naturally occurring physical entity, association, or structure that exists in the observable universe. In astronomy, the terms ''object'' and ''body'' are often us ...
names, title words, and words in the abstract text, and results can be filtered according to a number of criteria. It works by first gathering synonyms and simplifying search terms as described above, and then generating an "inverted file", which is a list of all the documents matching each search term. The user-selected logic and filters are then applied to this inverted list to generate the final search results.


Author name queries

The system indexes author names by surname and initials, and accounts for the possible variations in spelling of names using a list of variations. This is common in the case of names including accents such as umlauts and transliterations from
Arabic Arabic (, ' ; , ' or ) is a Semitic languages, Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C ...
or
Cyrillic script The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic languages, Slavic, Turkic languages, Turkic, Mongolic languages, ...
. An example of an entry in the author synonym list is: :''AFANASJEV, V'' :''AFANAS’EV, V'' :''AFANAS’IEV, V'' :''AFANASEV, V'' :''AFANASYEV, V'' :''AFANS’IEV, V'' :''AFANSEV, V''


Object name searches

The capability to search for papers on specific astronomical objects is one of ADS's most powerful tools. The system uses data from the
SIMBAD SIMBAD (the Set of Identifications, Measurements and Bibliography for Astronomical Data) is an astronomical database of objects beyond the Solar System. It is maintained by the Centre de données astronomiques de Strasbourg (CDS), France. SIMBA ...
, the
NASA/IPAC Extragalactic Database The NASA/IPAC Extragalactic Database (NED) is an online astronomical database for astronomers that collates and cross-correlates astronomical information on extragalactic objects (galaxies, quasars, radio, x-ray and infrared sources, etc.). NED was ...
, the
International Astronomical Union The International Astronomical Union (IAU; french: link=yes, Union astronomique internationale, UAI) is a nongovernmental organisation with the objective of advancing astronomy in all aspects, including promoting astronomical research, outreac ...
Circulars and the
Lunar and Planetary Institute The Lunar and Planetary Institute (LPI) is a scientific research institute dedicated to study of the Solar System, its formation, evolution, and current state. The Institute is part of the Universities Space Research Association (USRA) and is supp ...
to identify papers referring to a given object, and can also search by object position, listing papers which concern objects within a 10 
arcminute A minute of arc, arcminute (arcmin), arc minute, or minute arc, denoted by the symbol , is a unit of angular measurement equal to of one degree. Since one degree is of a turn (or complete rotation), one minute of arc is of a turn. The n ...
radius of a given
Right Ascension Right ascension (abbreviated RA; symbol ) is the angular distance of a particular point measured eastward along the celestial equator from the Sun at the March equinox to the (hour circle of the) point in question above the earth. When paired w ...
and
Declination In astronomy, declination (abbreviated dec; symbol ''δ'') is one of the two angles that locate a point on the celestial sphere in the equatorial coordinate system, the other being hour angle. Declination's angle is measured north or south of the ...
. These databases combine the many catalogue designations an object might have, so that a search for the
Pleiades The Pleiades (), also known as The Seven Sisters, Messier 45 and other names by different cultures, is an asterism and an open star cluster containing middle-aged, hot B-type stars in the north-west of the constellation Taurus. At a distance of ...
will also find papers which list the famous
open cluster An open cluster is a type of star cluster made of up to a few thousand stars that were formed from the same giant molecular cloud and have roughly the same age. More than 1,100 open clusters have been discovered within the Milky Way galaxy, and ...
in
Taurus Taurus is Latin for 'bull' and may refer to: * Taurus (astrology), the astrological sign * Taurus (constellation), one of the constellations of the zodiac * Taurus (mythology), one of two Greek mythological characters named Taurus * '' Bos tauru ...
under any of its other catalog designations or popular names, such as M45, the Seven Sisters or Melotte 22.


Title and abstract searches

The search engine first filters search terms in several ways. An M followed by a space or
hyphen The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. ''Son-in-law'' is an example of a hyphenated word. The hyphen is sometimes confused with dashes (figure d ...
has the space or hyphen removed, so that searching for
Messier catalogue The Messier objects are a set of 110 astronomical objects catalogued by the French astronomer Charles Messier in his ''Catalogue des Nébuleuses et des Amas d'Étoiles'' (''Catalogue of Nebulae and Star Clusters''). Because Messier was only in ...
objects is simplified and a user input of M45, M 45 or M-45 all result in the same query being executed; similarly, NGC designations and common search terms such as Shoemaker Levy and
T Tauri T Tauri is a variable star in the constellation Taurus, the prototype of the T Tauri stars. It was discovered in October 1852 by John Russell Hind. T Tauri appears from Earth amongst the Hyades cluster, not far from ε Tauri, but i ...
are stripped of spaces. Unimportant words such as AT, OR and TO are stripped out, although in some cases
case sensitivity In computers, case sensitivity defines whether uppercase and lowercase letters are treated as distinct (case-sensitive) or equivalent (case-insensitive). For instance, when users interested in learning about dogs search an e-book, "dog" and "Dog" a ...
is maintained, so that while and is ignored, And is converted to " Andromedae", and Her is converted to " Herculis", but her is ignored.


Synonym replacement

Once search terms have been pre-processed, the database is queried with the revised search term, as well as synonyms for it. As well as simple
synonym A synonym is a word, morpheme, or phrase that means exactly or nearly the same as another word, morpheme, or phrase in a given language. For example, in the English language, the words ''begin'', ''start'', ''commence'', and ''initiate'' are all ...
replacement such as searching for both
plural The plural (sometimes abbreviated pl., pl, or ), in many languages, is one of the values of the grammatical category of number. The plural of a noun typically denotes a quantity greater than the default quantity represented by that noun. This de ...
and
singular Singular may refer to: * Singular, the grammatical number that denotes a unit quantity, as opposed to the plural and other forms * Singular homology * SINGULAR, an open source Computer Algebra System (CAS) * Singular or sounder, a group of boar, ...
forms, ADS also searches for a large number of specifically astronomical synonyms. For example,
spectrograph An optical spectrometer (spectrophotometer, spectrograph or spectroscope) is an instrument used to measure properties of light over a specific portion of the electromagnetic spectrum, typically used in spectroscopic analysis to identify mate ...
and
spectroscope An optical spectrometer (spectrophotometer, spectrograph or spectroscope) is an instrument used to measure properties of light over a specific portion of the electromagnetic spectrum, typically used in spectroscopic analysis to identify mate ...
have basically the same meaning, and in an astronomical context
metallicity In astronomy, metallicity is the abundance of elements present in an object that are heavier than hydrogen and helium. Most of the normal physical matter in the Universe is either hydrogen or helium, and astronomers use the word ''"metals"'' as a ...
and
abundance Abundance may refer to: In science and technology * Abundance (economics), the opposite of scarcities * Abundance (ecology), the relative representation of a species in a community * Abundance (programming language), a Forth-like computer prog ...
are also synonymous. ADS's synonym list was created manually, by grouping the list of words in the database according to similar meanings. As well as
English language English is a West Germanic language of the Indo-European language family, with its earliest forms spoken by the inhabitants of early medieval England. It is named after the Angles, one of the ancient Germanic peoples that migrated to the is ...
synonyms, ADS also searches for English translations of foreign search terms and vice versa, so that a search for the French word ''soleil'' retrieves references to
Sun The Sun is the star at the center of the Solar System. It is a nearly perfect ball of hot plasma, heated to incandescence by nuclear fusion reactions in its core. The Sun radiates this energy mainly as light, ultraviolet, and infrared radi ...
, and papers in languages other than English can be returned by English search terms. Synonym replacement can be disabled if required, so that a rare term which is a synonym of a much more common term (such as '
dateline A dateline is a brief piece of text included in news articles that describes where and when the story was written or filed, though the date is often omitted. In the case of articles reprinted from wire services, the distributing organization i ...
' rather than '
date Date or dates may refer to: *Date (fruit), the fruit of the date palm (''Phoenix dactylifera'') Social activity *Dating, a form of courtship involving social activity, with the aim of assessing a potential partner ** Group dating *Play date, a ...
') can be searched for specifically.


Selection logic

The search engine allows selection
logic Logic is the study of correct reasoning. It includes both formal and informal logic. Formal logic is the science of deductively valid inferences or of logical truths. It is a formal science investigating how conclusions follow from premises ...
both within fields and between fields. Search terms in each field can be combined with OR, AND, simple logic or
Boolean logic In mathematics and mathematical logic, Boolean algebra is a branch of algebra. It differs from elementary algebra in two ways. First, the values of the variable (mathematics), variables are the truth values ''true'' and ''false'', usually denote ...
, and the user can specify which fields must be matched in the search results. This allows complex searches to be built; for example, the user could search for papers concerning
NGC 6543 The Cat's Eye Nebula (also known as NGC 6543 and Caldwell 6) is a planetary nebula in the northern constellation of Draco, discovered by William Herschel on February 15, 1786. It was the first planetary nebula whose spectrum was investigated by t ...
OR NGC 7009, with the paper titles containing (radius OR velocity) AND NOT (abundance OR temperature).


Result filtering

Search results can be filtered according to a number of criteria, including specifying a range of years such as '1945 to 1975', '2000 to the present day' or 'before 1900', and what type of journal the article appears in – non-peer reviewed articles such as
conference A conference is a meeting of two or more experts to discuss and exchange opinions or new information about a particular topic. Conferences can be used as a form of group decision-making, although discussion, not always decisions, are the main p ...
proceedings can be excluded or specifically searched for, or specific journals can be included in or excluded from the search.


Search results

Although it was conceived as a means of accessing abstracts and papers, ADS provides a substantial amount of ancillary information along with search results. For each abstract returned, links are provided to other papers in the database which are referenced, and which cite the paper, and a link is provided to a preprint, where one exists. The system also generates a link to 'also-read' articles – that is, those which have been most commonly accessed by those reading the article. In this way, an ADS user can determine which papers are of most interest to astronomers who are interested in the subject of a given paper. Also returned are links to the SIMBAD and/or NASA Extragalactic Database object name databases, via which a user can quickly find out basic observational data about the objects analyzed in a paper, and find further papers on those objects.


Impact on astronomy

ADS is almost universally used as a research tool among astronomers, and there are several studies that have estimated quantitatively how much more efficient ADS has made astronomy; one estimated that ADS increased the efficiency of astronomical research by 333 full-time equivalent research years per year, and another found that in 2002 its effect was equivalent to 736 full-time researchers, or all the astronomical research done in France. ADS has allowed literature searches that would previously have taken days or weeks to carry out to be completed in seconds, and it is estimated that ADS has increased the readership and use of the astronomical literature by a factor of about three since its inception. In monetary terms, this increase in efficiency represents a considerable amount. There are about 12,000 active astronomical researchers worldwide, so ADS is the equivalent of about 5% of the working population of astronomers. The global astronomical research budget is estimated at between 4,000 and US$5,000 million, so the value of ADS to astronomy would be about 200–250 million USD annually. Its operating budget is a small fraction of this amount. The great importance of ADS to astronomers has been recognized by the
United Nations The United Nations (UN) is an intergovernmental organization whose stated purposes are to maintain international peace and international security, security, develop friendly relations among nations, achieve international cooperation, and be ...
, the
General Assembly A general assembly or general meeting is a meeting of all the members of an organization or shareholders of a company. Specific examples of general assembly include: Churches * General Assembly (presbyterian church), the highest court of presby ...
of which has commended ADS on its work and success, particularly noting its importance to astronomers in the developing world, in reports of the
United Nations Committee on the Peaceful Uses of Outer Space The United Nations Committee on the Peaceful Uses of Outer Space (COPUOS) is a United Nations committee whose main task is to review and foster international cooperation in the peaceful uses of outer space, as well as to consider legal issues ar ...
. A 2002 report by a visiting committee to the Center for Astrophysics, meanwhile, said that the service had "revolutionized the use of the astronomical literature", and was "probably the most valuable single contribution to astronomy research that the CfA has made in its lifetime".


Sociological studies using ADS

Because it is used almost universally by astronomers, ADS can reveal much about how astronomical research is distributed around the world. Most users access the system from institutes of higher education, whose
IP address An Internet Protocol address (IP address) is a numerical label such as that is connected to a computer network that uses the Internet Protocol for communication.. Updated by . An IP address serves two main functions: network interface ident ...
can easily be used to determine the user's geographical location. Studies reveal that the highest per-capita users of ADS are France and Netherlands-based astronomers, and while more developed countries (measured by
GDP per capita Lists of countries by GDP per capita list the countries in the world by their gross domestic product (GDP) per capita. The lists may be based on nominal or purchasing power parity GDP. Gross national income (GNI) per capita accounts for inflows ...
) use the system more than less developed countries; the relationship between GDP per capita and ADS use is not linear. The range of ADS usage per capita far exceeds the range of GDPs per capita, and basic research carried out in a country, as measured by ADS usage, has been found to be proportional to the square of the country's GDP divided by its population. ADS usage statistics also suggest that astronomers in more developed countries tend to be more productive than those in less developed countries. The amount of basic research carried out is proportional to the number of astronomers in a country multiplied by the GDP per capita. Statistics also imply that astronomers in European cultures carry out about three times as much research as those in
Asian culture The culture of Asia encompasses the collective and diverse customs and traditions of art, architecture, music, literature, lifestyle, philosophy, politics and religion that have been practiced and maintained by the numerous ethnic groups ...
s, perhaps suggesting cultural differences in the importance attached to astronomical research. ADS has also been used to show that the fraction of single-author astronomy papers has decreased substantially since 1975 and that astronomical papers with more than 50 authors have become more common since 1990.


See also

*
List of academic databases and search engines This article contains a representative list of notable databases and search engines useful in an academic setting for finding and accessing articles in academic journals, institutional repositories, archives, or other collections of scientific and ...
*
Bibcode The bibcode (also known as the refcode) is a compact identifier used by several astronomical data systems to uniquely specify literature references. Adoption The Bibliographic Reference Code (refcode) was originally developed to be used in SIM ...
*
INSPIRE-HEP INSPIRE-HEP is an open access digital library for the field of high energy physics (HEP). It is the successor of the Stanford Physics Information Retrieval System (SPIRES) database, the main literature database for high energy physics since the 1970 ...
*
NASA/IPAC Extragalactic Database The NASA/IPAC Extragalactic Database (NED) is an online astronomical database for astronomers that collates and cross-correlates astronomical information on extragalactic objects (galaxies, quasars, radio, x-ray and infrared sources, etc.). NED was ...
(NED) *
NASA The National Aeronautics and Space Administration (NASA ) is an independent agency of the US federal government responsible for the civil space program, aeronautics research, and space research. NASA was established in 1958, succeeding t ...
Planetary Data System The Planetary Data System (PDS) is a distributed data system that NASA uses to archive data collected by Solar System missions. The PDS is an active archive that makes available well documented, peer reviewed planetary data to the research communi ...
(PDS) *
PubMed PubMed is a free search engine accessing primarily the MEDLINE database of references and abstracts on life sciences and biomedical topics. The United States National Library of Medicine (NLM) at the National Institutes of Health maintain the ...
*
SIMBAD SIMBAD (the Set of Identifications, Measurements and Bibliography for Astronomical Data) is an astronomical database of objects beyond the Solar System. It is maintained by the Centre de données astronomiques de Strasbourg (CDS), France. SIMBA ...
* Michael J. Kurtz


References


External links

*
NASA ADS: Query Form
– start your article search here.
ADS help pages
{{Portal bar, Physics, Astronomy, Stars, Outer space, Education, Science NASA online Discipline-oriented digital libraries Bibliographic databases and indexes Full-text scholarly online databases Astronomical databases