HOME

TheInfoList



OR:

Gerard A. "Gerry" Salton (8 March 1927 – 28 August 1995) was a professor of
computer science Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
at
Cornell University Cornell University is a Private university, private Ivy League research university based in Ithaca, New York, United States. The university was co-founded by American philanthropist Ezra Cornell and historian and educator Andrew Dickson W ...
. Salton was perhaps the leading computer scientist working in the field of
information retrieval Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant to an Information needs, information need. The information need can be specified in the form ...
during his time, and "the father of Information Retrieval". His group at Cornell developed the SMART Information Retrieval System, which he initiated when he was at
Harvard Harvard University is a private Ivy League research university in Cambridge, Massachusetts, United States. Founded in 1636 and named for its first benefactor, the Puritan clergyman John Harvard, it is the oldest institution of higher lear ...
. It was the first system to use the now popular
vector space In mathematics and physics, a vector space (also called a linear space) is a set (mathematics), set whose elements, often called vector (mathematics and physics), ''vectors'', can be added together and multiplied ("scaled") by numbers called sc ...
model for information retrieval.


Education and career

Salton was born Gerhard Anton Sahlmann in Nuremberg, Germany. He came to the United States in 1947 and was naturalized in 1952. He received a Bachelor's (1950) and Master's (1952) degree in mathematics from
Brooklyn College Brooklyn College is a public university in Brooklyn in New York City, United States. It is part of the City University of New York system and enrolls nearly 14,000 students on a campus in the Midwood and Flatbush sections of Brooklyn as of fall ...
, and a Ph.D. from
Harvard Harvard University is a private Ivy League research university in Cambridge, Massachusetts, United States. Founded in 1636 and named for its first benefactor, the Puritan clergyman John Harvard, it is the oldest institution of higher lear ...
in
applied mathematics Applied mathematics is the application of mathematics, mathematical methods by different fields such as physics, engineering, medicine, biology, finance, business, computer science, and Industrial sector, industry. Thus, applied mathematics is a ...
in 1958, the last of
Howard Aiken Howard Hathaway Aiken (March 8, 1900 – March 14, 1973) was an American physicist and a list of pioneers in computer science, pioneer in computing. He was the original conceptual designer behind IBM's Harvard Mark I, the United States' first C ...
's doctoral students, and taught there until 1965, when he joined
Cornell University Cornell University is a Private university, private Ivy League research university based in Ithaca, New York, United States. The university was co-founded by American philanthropist Ezra Cornell and historian and educator Andrew Dickson W ...
and co-founded its department of Computer Science. Salton was perhaps most well known for developing the now widely used
vector space model Vector space model or term vector model is an algebraic model for representing text documents (or more generally, items) as vector space, vectors such that the distance between vectors represents the relevance between the documents. It is used in i ...
for Information Retrieval. In this model, both documents and queries are represented as vectors of term counts, and the similarity between a document and a query is given by the cosine between the term vector and the document vector. In this paper, he also introduced TF-IDF, or term-frequency-inverse-document frequency, a model in which the score of a term in a document is the ratio of the number of terms in that document divided by the frequency of the number of documents in which that term occurs. (The concept of inverse document frequency, a measure of specificity, had been introduced in 1972 by Karen Sparck-Jones.) Later in life, he became interested in automatic text summarization and analysis, as well as automatic hypertext generation. He published over 150 research articles and 5 books during his life.


Honors and awards

Salton was editor-in-chief of the
Communications of the ACM ''Communications of the ACM'' (''CACM'') is the monthly journal of the Association for Computing Machinery (ACM). History It was established in 1958, with Saul Rosen as its first managing editor. It is sent to all ACM members. Articles are i ...
and the
Journal of the ACM The ''Journal of the ACM'' (''JACM'') is a peer-reviewed scientific journal covering computer science in general, especially theoretical aspects. It is an official journal of the Association for Computing Machinery. Its current editor-in-chief is ...
, and chaired
Special Interest Group on Information Retrieval SIGIR is the Association for Computing Machinery's Special Interest Group on Information Retrieval. The scope of the group's specialty is the theory and application of computers to the acquisition, organization, storage, retrieval and distribut ...
(SIGIR). He was an associate editor of the ACM Transactions on Information Systems. He was an
ACM Fellow ACM Fellowship is an award and fellowship that recognises outstanding members of the Association for Computing Machinery (ACM). The title of ACM Fellow A fellow is a title and form of address for distinguished, learned, or skilled individuals ...
(elected 1995), received the Award of Merit from the American Society for Information Science (1989), and was the first recipient of the SIGIR Award for outstanding contributions to study of Information Retrieval (1983) -- now called the Gerard Salton Award.


Bibliography

*Salton, ''Automatic Information Organization and Retrieval'', 1968. * *--- and Michael J. McGill, ''Introduction to modern Information Retrieval'', 1983. * * * G. Salton, A. Wong, and C. S. Yang (1975),
A Vector Space Model for Automatic Indexing
" ''Communications of the ACM'', vol. 18, nr. 11, pages 613–620. ''(Article in which a vector space model was presented)'' *G. Salton. (1980). 'Toward a dynamic library." In F. Wilfrid Lancaster, ed.''The Role of the Library in an Electronic Society: Clinic on Library Applications of Data Processing. '' Urbana-Champaign: University of Illinois Graduate School of Library Science.


See also

*
List of pioneers in computer science This is a list of people who made transformative breakthroughs in the creation, development and imagining of what computers could do. Pioneers ~ Items marked with a tilde are circa dates. See also * Computer Pioneer Award * IEEE John von ...


References


External links


In Memoriam


* "The Most Influential Paper Gerard Salton Never Wrote." Dubin D. This 2004 ''Library Trends'' paper (2004;52(4):748-764) by David Dubin serves as a historical review of the metamorphosis of the term discrimination value model (TDV) into the vector space model as an information retrieval model (VSM as an IR model). This paper calls into question what the Information Retrieval research community believed Salton's vector space model was originally intended to model. What much later became an information retrieval model was originally a data-centric mathematical–computational model used as an explanatory device. In addition, Dubin's paper points out that a 1975 Salton paper oft cited does not exist but is probably a combination of two other papers, neither of which actually refers to the VSM as an IR model. {{DEFAULTSORT:Salton, Gerard 1927 births 1995 deaths American computer scientists Harvard John A. Paulson School of Engineering and Applied Sciences alumni Harvard University faculty Cornell University faculty 1995 fellows of the Association for Computing Machinery Information retrieval researchers Brooklyn College alumni Scientists from Nuremberg Deaths from lung cancer in New York (state)