Martin Kay (1935 – 8 August 2021) was a

computer scientist A computer scientist is a person who is trained in the academic study of computer science. Computer scientists typically work on the theoretical side of computation, as opposed to the hardware side on which computer engineers mainly focus (al ...

, known especially for his work in

computational linguistics Computational linguistics is an Interdisciplinarity, interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, comput ...

. Born and raised in the United Kingdom, he received his M.A. from

Trinity College, Cambridge Trinity College is a constituent college of the University of Cambridge. Founded in 1546 by Henry VIII, King Henry VIII, Trinity is one of the largest Cambridge colleges, with the largest financial endowment of any college at either Cambridge ...

, in 1961. In 1958 he started to work at the

Cambridge Language Research Unit Cambridge ( ) is a university city and the county town in Cambridgeshire, England. It is located on the River Cam approximately north of London. As of the 2021 United Kingdom census, the population of Cambridge was 145,700. Cambridge became ...

, one of the earliest centres for research in what is now known as Computational Linguistics. In 1961, he moved to the

Rand Corporation The RAND Corporation (from the phrase "research and development") is an American nonprofit global policy think tank created in 1948 by Douglas Aircraft Company to offer research and analysis to the United States Armed Forces. It is financed ...

Santa Monica, California Santa Monica (; Spanish language, Spanish: ''Santa Mónica'') is a city in Los Angeles County, California, Los Angeles County, situated along Santa Monica Bay on California's South Coast (California), South Coast. Santa Monica's 2020 United Sta ...

, US, where he eventually became head of research in linguistics and machine translation. He left Rand in 1972 to become Chair of the Department of Computer Science at the

University of California, Irvine The University of California, Irvine (UCI or UC Irvine) is a public land-grant research university in Irvine, California. One of the ten campuses of the University of California system, UCI offers 87 undergraduate degrees and 129 graduate and pr ...

. In 1974, he moved to the

Xerox Palo Alto Research Center Xerox Holdings Corporation (; also known simply as Xerox) is an American corporation that sells print and digital document products and services in more than 160 countries. Xerox is headquartered in Norwalk, Connecticut (having moved from Sta ...

as a Research Fellow. In 1985, while retaining his position at Xerox PARC, he joined the faculty of

Stanford University Stanford University, officially Leland Stanford Junior University, is a private research university in Stanford, California. The campus occupies , among the largest in the United States, and enrolls over 17,000 students. Stanford is consider ...

half-time. He was most recently Professor of Linguistics at Stanford University and Honorary Professor of Computational Linguistics at

Saarland University Saarland University (german: Universität des Saarlandes, ) is a public research university located in Saarbrücken, the capital of the German state of Saarland. It was founded in 1948 in Homburg in co-operation with France and is organized in si ...

Life

He was born in Edgware (Middlesex, Great Britain) in 1935 and he studied linguistics and computational linguistics at Trinity College in Cambridge. His main interests were translation, both by people and machines, and computational linguistic algorithms, especially in the fields of morphology and syntax.

Work

Kay began his career at the

Cambridge language Research Unit Cambridge ( ) is a university city and the county town in Cambridgeshire, England. It is located on the River Cam approximately north of London. As of the 2021 United Kingdom census, the population of Cambridge was 145,700. Cambridge became ...

in Cambridge, England under

Margaret Masterman Margaret Masterman (4 May 1910 – 1 April 1986) was a British linguist and philosopher, most known for her pioneering work in the field of computational linguistics and especially machine translation. She founded the Cambridge Language R ...

. In 1961

David G. Hays David Glenn Hays (November 17, 1928 – July 26, 1995) was a linguist, computer scientist and social scientist best known for his early work in machine translation and computational linguistics. Career overview David Hays graduated from Harvard ...

hired him to work for the

RAND Corporation The RAND Corporation (from the phrase "research and development") is an American nonprofit global policy think tank created in 1948 by Douglas Aircraft Company to offer research and analysis to the United States Armed Forces. It is financed ...

; he subsequently worked for the

and

Xerox PARC PARC (Palo Alto Research Center; formerly Xerox PARC) is a research and development company in Palo Alto, California. Founded in 1969 by Jacob E. "Jack" Goldman, chief scientist of Xerox Corporation, the company was originally a division of Xero ...

. Kay is one of the pioneers of

and machine translation. He was responsible for introducing the notion of chart parsing in computational linguistics, and the notion of unification in linguistics generally. With Ron Kaplan, he pioneered research and application development in finite-state morphology. He has been a longtime contributor to, and critic of, work on machine translation. In his seminal paper "The Proper Place of Men and Machines in Language Translation," Kay argued for MT systems that were tightly integrated in the human translation process. He was reviewer and critic of EUROTRA, Verbmobil, and many other MT projects. Kay was a former Chair of the Association of Computational Linguistics and President of the

International Committee on Computational Linguistics The International Committee on Computational Linguistics (ICCL) was founded by Dr. David Hays of the RAND Corporation in 1965 to promote the biennial International Conference on Computational Linguistics, which since the third conference in Stock ...

. He was a Research Fellow at

until 2002. He held an honorary doctorate of

Gothenburg University The University of Gothenburg ( sv, Göteborgs universitet) is a university in Sweden's second largest city, Gothenburg. Founded in 1891, the university is the third-oldest of the current Swedish universities and with 37,000 students and 6000 s ...

. Kay received the lifetime Achievement Award of the Association for Computational Linguistics for his sustained role as an intellectual leader of NLP research in 2005.

Achievements and honours

* His achievements included the development of

chart parsing In computer science, a chart parser is a type of parser suitable for ambiguous grammars (including grammars of natural languages). It uses the dynamic programming approach—partial hypothesized results are stored in a structure called a chart and ...

and functional unification grammar and major contributions to the application of finite state automata in computational phonology and morphology. He was also regarded as a leading authority on

machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates t ...

. * His honours included an honorary Doctor of Philosophy from

and the 2005

Association for Computational Linguistics The Association for Computational Linguistics (ACL) is a scientific and professional organization for people working on natural language processing. Its namesake conference is one of the primary high impact conferences for natural language proces ...

' Lifetime Achievement Award. He was the permanent chairman of the

Contributions

1. Martin Kay's "proper" paper After the ALPAC report in 1966, the conclusion was made as "There is no immediate or predictable prospect of useful MT producing useful translation of general scientific texts." And because of this result, the field of machine translation entered into a dark period. From 1966 to 1976, almost ten years, few researches were done. However, in 1980s, the Renaissance period was coming. "The Proper Place of Men and Machines in Language Translation" attracted more attention on the machine translation. In this paper, new thoughts were achieved about the relationship between machine translation and human translation. At that time, with the application of cheaper computers and broad usage of domains in machine translation, high quality outputs were badly needed. And the theory of Fully Automatic High Quality Translation was just the ideal level for machine translation after the criticisms by Bar-Hillel in his 1960s review of MT progress: "The goal of MT should not be the fully automatic high quality translation (FAHQT) that can replace human translators. Instead, MT should adopt less ambitious goals, e.g. more cost-effective human-machine interaction and aim at enhancement of human translation productivity." The useful of human translation was promoted to a new higher level. According to this thought, Martin Kay proposed a more practical idea about the relationship between human and machine in the process of machine translation, called "translator's amanuensis". 1.1 Two arguments against the useful of machine translation Because this idea includes the human and machine at the same time, so both computer scientists and linguists have responsibilities to the MT. But "they should never be asked to provide an engineering solution to a problem that they only dimly understand." They just need to achieve "by doing only what can be done with absolute surety and reliability …can be virtually guaranteed to all concerned." As the main parts of the translation, there are two related arguments against the plausibility of machine translation as an industrial enterprise from the point of view of linguistics and computer science. Two arguments are commonly made for ad hoc solutions to the problems of machine translation. In the former argument, "Ad hoc solutions tend to be based on case-by-case analyses of what linguists call surface phenomena, essentially strings of words, and on real or imagined statistical properties of particular styles of writing and domains of discourse." It is a simple statistical claim that can be dismissed. In the second argument, ad hoc solutions is only alluded to the understanding of the second language by reading text, and was called sorcerer's apprentice, because "this kind of argument is to the effect that the kind of incomplete theory that linguists and computer scientists have been able to provide is often a worse base on which to build practical devices than no theory at all because the theory does not know when to stop." "The main problem with the sorcerer's-apprentice argument is that the decision that a sentence could be translated without analysis can only be made after the fact. Example sentence shows that there is more than one interpretation of a sentence at some level and further analysis shows that there is a single translation that is compatible with each of them. In short, the algorithm required to decide when analysis is required would have to use the results of the very analysis it is designed to avoid." 1.2 The Translator's Amanuensis and translation memory This is the main part of the paper, for illustrate what is translator's amanuensis, the author showed three aspects: text editing, translation aids, and machine translation. "Suppose that the translators are provided with a terminal consisting of a keyboard, a screen, and some way of pointing at individual words and letters. The display on the screen is divided into two windows. The text to be translated appears in the upper window and the translation will be composed in the bottom one." It is the form of the translator's amanuensis which is not a real device and never will. "Both windows behave in the same way. Using the pointing device, the translator can select a letter, word, sentence, line, or paragraph and, by pressing the appropriate key, cause some operation to be visited upon it." These two figures show the translation process from the initial display to selection. This device is not simple as these two figures, more special service can be made to translator by it. In the translation aids, the author showed the third figure: "A relatively trivial addition would be a dictionary. The translator selects a word or sequence of words and gives a command to cause them to be looked up…This new window gives the effect of overlaying some portion of the windows already present. In this case, the new window contains a deceptively simple dictionary entry for the selected word." What's more, the device has many other features. For example, the simplicity of the dictionary entry, words Syntax and Semantics will be included when pointing to symbols, modifiable dictionary entries and the temporary amendments make this device more practical. Then, machine translation be explained. "One of the options that should be offered to a user of the hypothetical system I have been describing, at a fairly early stage, be a command that will direct the program to translate the currently selected unit. What will happen when this command is given will be different at different stages of the system's development. But a user of the system will always be empowered to intervene in the translation process to the extent that he himself specifies. If he elects not to intervene at all, a piece of text purporting to translate the current unit will be displayed in the lower window of his screen. He will be able to edit this in any way he likes, just as post-editors have done in the past. Alternatively, he may ask to be consulted whenever the program is confronted with a decision of a specified type, when certain kinds of ambiguities are detected, or whatever. On these occasions, the system will put a question to the human translator. He may, for example, ask to be consulted on questions of pronominal reference." In this part, idea of translation memory was shown as a dictionary operation. "Suppose, for example, that a word is put in the local store – that part of the dictionary that persists only as long as this document is being worked on – if it occurs in the text significantly more frequently than statistics stored in the main dictionary indicate. A phrase will be noted if it occurs two or three times but is not recognized as an idiom or set phrase by the dictionary. By examining the contents of this store before embarking on the translation, a user may hope to get a preview of the difficulties ahead and to make some decisions in advance about how to treat them. These decisions, of course, will be recorded in the store itself. In the course of doing this or, indeed, for any reason whatever, the translator can call for a display of all the units in the text that contain a certain word, phrase, string of characters, or whatever. After all, the most important reference to have when translating a text is the text itself. If the piece of text to be translated next is anything but entirely straightforward, the translator might start by issuing a command causing the system to display anything in the store that might be relevant to it. This will bring to his attention decisions he made before the actual translation started, statistically significant words and phrases, and a record of anything that had attracted attention when it occurred before. Before going on, he can examine past and future fragments of text that contain similar material." 1.3 Expectation of the better performance of the translator's amanuensis At the end of the paper, Kay mentioned some reasons to expect better performance of this device. First, the system is in a position to draw its human collaborator's attention to the matters most likely to need it, second, the decisions that have to be made in the course of translating a passage are rarely independent, third, one of the most important facilities in the system is the one that keeps track of words and phrases that are used in some special way in the current text. * «A Life in Language». A speech given in acknowledgement of the Lifetime Achievement Award at the 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbor, Michigan, 27 June 2005. http://www.stanford.edu/~mjkay/LifeOfLanguage.pdf * String Alignment Using Suffix Trees. A paper about the possible use of suffix trees for aligning texts and their translations. http://www.stanford.edu/~mjkay/CYCLING.pdf * Some unfinished musings on the nature of translation. Here are some unfinished musings on the nature of translation. http://www.stanford.edu/~mjkay/CurrentState.pdf * Some half-baked thoughts on language models in statistical NLP on which I need some help. http://www.stanford.edu/~mjkay/language_models.pdf * His 1994 paper on
Regular Models of Phonological Rule Systems
. Computational Linguistics 20(3):331–378" with Ronald Kaplan. http://www.stanford.edu/~mjkay/Kaplan%26Kay.pdf

Books

* ''Linguistics and Information Science'' (with

Karen Spärck Jones Karen Sparck Jones is a computer science researcher and innovator who pioneered the search engine algorithm known as inverse document frequency (IDF). While many early information scientists and computer engineers were focused on developing progr ...

Academic Press Academic Press (AP) is an academic book publisher founded in 1941. It was acquired by Harcourt, Brace & World in 1969. Reed Elsevier bought Harcourt in 2000, and Academic Press is now an imprint of Elsevier. Academic Press publishes reference ...

, 1973. * ''Natural Language in Information Science'' (edited with D. E. Walker and Hans Karlgren), Skriptor, Stockholm, 1977 * ''Verbmobil: A Translation System for Face-to-Face Dialog'' (with Jean Mark Gawron and

Peter Norvig Peter Norvig (born December 14, 1956) is an American computer scientist and Distinguished Education Fellow at the Stanford Institute for Human-Centered AI. He previously served as a director of research and search quality at Google. Norvig is t ...

), CSLI, Stanford, California, 1994. * ''An Introduction to Machine Translation''. W. John Hutchins and Harold L. Somers. London: Academic Press, 1992. * ''Handbook of Computational Linguistics''. Ruslan Mitkov (ed.).

Oxford University Press Oxford University Press (OUP) is the university press of the University of Oxford. It is the largest university press in the world, and its printing history dates back to the 1480s. Having been officially granted the legal right to print books ...

, 2003. (Introduction.)

Selected papers

*
Rules of Interpretation—An Approach to the Problem of Computation in the Semantics of Natural Language
, in Proceedings of the Second International Congress of the International Federation for Information Processing, 1962. * "A Parsing Procedure" Proceedings of the Second International Congress of the International Federation for Information Processing, 1962. * "A General Procedure for Rewriting Strings", paper presented at the annual meeting of the Association for Machine Translation and Computational Linguistics, Bloomington, Indiana, 1964. * The Logic of Cognate Recognition in Historical Linguistics, RM-4224-PR, Santa Monica, The RAND Corporation, July 1964. * A Parsing Program for Categorial Grammars, RM-4283-PR, Santa Monica, The RAND Corporation, August 1964. * The Tabular Parser: A Parsing Program for Phrase-Structure and Dependency, RM-4933-PR, Santa Monica, The RAND Corporation, July 1966. * The Computer System to Aid the Linguistic Field Worker, P-4095, Santa Monica, The RAND Corporation, May 1969. * The MIND System: The Morphological Analysis Program, RM-6265/2-PR, Santa Monica, The RAND Corporation, April 1970. (with Gary R. Martins). * "Automatic Translation of Natural Languages" in Language as a Human Problem: Daedalus, 1973. *
Functional Unification Grammar: A Formalism for Machine Translation
in Proceedings of the International Conference on Computational Linguistics (COLING 84), The Association for Computational Linguistics, 1984. * "Parsing in Free Word Order Languages" (with Lauri Karttunen), in Dowty, David R., Lauri Karttunen, and Arnold M. Zwicky, Natural Language Parsing, Cambridge University Press, 1985. * "Unification in Grammar", in Dahl, V., and P. Saint-Dizier, Natural Language Understanding and Logic Programming, North Holland, 1985. * "Theoretical Issues in the Design of a Translator's Work Station", Proceedings of the IBM workshop on Computers and Translation, Copenhagen. * "Regular Models of Phonological Rule Systems" (with R. M. Kaplan), Computational Linguistics 20:3 (September 1994. With R. M. Kaplan). * "Substring Alignment Using Suffix Trees". Computational Linguistics and Intelligent Text Processing, Springer, Lecture Notes in Computer Science, 2004.

Course readings

* Disjunctive Unification http://www.stanford.edu/~mjkay/DisjunctiveUnification.pdf * Functional Uncertainty http://www.stanford.edu/~mjkay/FunctionalUncertainty.pdf * HPSG1 http://www.stanford.edu/~mjkay/pollard-foundations.pdf * HPSG2 http://www.stanford.edu/~mjkay/levine03.pdf * HPSG Generation http://www.stanford.edu/~mjkay/Shieber.pdf * CCG http://www.stanford.edu/~mjkay/Steedman%26Baldridge.pdf * Typed Features http://www.stanford.edu/~mjkay/Copestake.pdf * Dependency http://www.stanford.edu/~mjkay/covington.pdf

Awards

* He has an honorary professorship at the University of the Saarland and honorary doctorates from the universities of Gothenburg and Geneva. * He also won the 2005 ACL Lifetime Achievement Award. His acceptance speech was entitled "A Life of Language".

References

External links

Stanford home pageUniversity of Saarland home pageACL Lifetime Achievement Award citation"A Life of Language" — ACL Lifetime Award Acceptance Speech
– Martin Kay outlining his work in Computational Linguistics (13 pages)
Lecture announcement with biographical noteAn interview (video and audio) with Martin Kay at the Oxford Internet Institute, June 18, 2009
{{DEFAULTSORT:Kay, Martin Living people Alumni of Trinity College, Cambridge British computer scientists Computational linguistics researchers University of California, Irvine faculty Stanford University Department of Linguistics faculty Computer science writers British emigrants to the United States Scientists at PARC (company) 1935 births