Jean-Paul Benzécri
   HOME

TheInfoList



OR:

Jean-Paul Benzécri was a
French French may refer to: * Something of, from, or related to France ** French language, which originated in France ** French people, a nation and ethnic group ** French cuisine, cooking traditions and practices Arts and media * The French (band), ...
mathematician A mathematician is someone who uses an extensive knowledge of mathematics in their work, typically to solve mathematical problems. Mathematicians are concerned with numbers, data, quantity, mathematical structure, structure, space, Mathematica ...
and
statistician A statistician is a person who works with Theory, theoretical or applied statistics. The profession exists in both the private sector, private and public sectors. It is common to combine statistical knowledge with expertise in other subjects, a ...
. He studied at
École Normale Supérieure École or Ecole may refer to: * an elementary school in the French educational stages normally followed by Secondary education in France, secondary education establishments (collège and lycée) * École (river), a tributary of the Seine flowing i ...
and was professor at Université de Rennes and later for most of his career at the Paris Institute of Statistics (l'Institut de Statistique de l'Université de Paris),
Université Pierre-et-Marie-Curie Pierre and Marie Curie University ( , UPMC), also known as Paris VI, was a public research university in Paris, France, from 1971 to 2017. The university was located on the Jussieu Campus in the Latin Quarter of the 5th arrondissement of Paris, ...
in
Paris Paris () is the Capital city, capital and List of communes in France with over 20,000 inhabitants, largest city of France. With an estimated population of 2,048,472 residents in January 2025 in an area of more than , Paris is the List of ci ...
. He is most known for his specific inductive approach to
data analysis Data analysis is the process of inspecting, Data cleansing, cleansing, Data transformation, transforming, and Data modeling, modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making. Da ...
which led to the creation of
Correspondence analysis Correspondence analysis (CA) is a multivariate statistical technique proposed by Herman Otto Hartley (Hirschfeld) and later developed by Jean-Paul Benzécri. It is conceptually similar to principal component analysis, but applies to categorical ...
, a statistical technique for analyzing
contingency table In statistics, a contingency table (also known as a cross tabulation or crosstab) is a type of table in a matrix format that displays the multivariate frequency distribution of the variables. They are heavily used in survey research, business int ...
s and for the invention of the
nearest-neighbor chain algorithm In the theory of cluster analysis, the nearest-neighbor chain algorithm is an algorithm that can speed up several methods for agglomerative hierarchical clustering. These are methods that take a collection of points as input, and create a hierarch ...
for
agglomerative hierarchical clustering In data mining and statistics, hierarchical clustering (also called hierarchical cluster analysis or HCA) is a method of cluster analysis that seeks to build a hierarchy of clusters. Strategies for hierarchical clustering generally fall into two ...
.


Early life

Jean-Paul Benzécri was born in
Oran, Algeria Oran () is a major coastal city located in the northwest of Algeria. It is considered the second most important city of Algeria, after the capital, Algiers, because of its population and commercial, industrial and cultural importance. It is w ...
, in 1932, where his father was a doctor. He attended high school in Lycée Lamoricière, Oran and Lycée Bugeaud, Alger. In 1950, he was first in the entrance examination to the ENS (
École Normale Supérieure École or Ecole may refer to: * an elementary school in the French educational stages normally followed by Secondary education in France, secondary education establishments (collège and lycée) * École (river), a tributary of the Seine flowing i ...
) in Paris and again in 1953 to the "Agrégation de Mathématiques", a national teacher's diploma examination. He then did some science research in mathematics. Leaving for the United States in 1955 for
Princeton University Princeton University is a private university, private Ivy League research university in Princeton, New Jersey, United States. Founded in 1746 in Elizabeth, New Jersey, Elizabeth as the College of New Jersey, Princeton is the List of Colonial ...
, after a 4 months study he submitted a Ph.D. thesis in differential geometry entitled ''Variété localement plates'' under the supervision of
Henri Cartan Henri Paul Cartan (; 8 July 1904 – 13 August 2008) was a French mathematician who made substantial contributions to algebraic topology. He was the son of the mathematician Élie Cartan, nephew of mathematician Anna Cartan, oldest brother of c ...
. From 1959 until 1960 he did conscripted military service in the Operational Research Group of the
French Navy The French Navy (, , ), informally (, ), is the Navy, maritime arm of the French Armed Forces and one of the four military service branches of History of France, France. It is among the largest and most powerful List of navies, naval forces i ...
where he practiced multidimensional data modeling by traditional analytical methods without the use of a computer. In 1960 he delivered a "Doctorat" at Sorbonne, Paris entitled ''Sur les variétés localement affines et localement projectives'' again under the supervision of
Henri Cartan Henri Paul Cartan (; 8 July 1904 – 13 August 2008) was a French mathematician who made substantial contributions to algebraic topology. He was the son of the mathematician Élie Cartan, nephew of mathematician Anna Cartan, oldest brother of c ...
.


Career

Benzécri's teaching career began in 1963 as an assistant professor at the Faculty of Sciences in
Rennes Rennes (; ; Gallo language, Gallo: ''Resnn''; ) is a city in the east of Brittany in Northwestern France at the confluence of the rivers Ille and Vilaine. Rennes is the prefecture of the Brittany (administrative region), Brittany Regions of F ...
where he created a course in
mathematical linguistics Mathematical linguistics is the application of mathematics to model phenomena and solve problems in general linguistics and theoretical linguistics. Mathematical linguistics has a significant amount of overlap with computational linguistic ...
. One of his first students was Brigitte Escofier-Cordier who published in 1965 a dissertation entitled ''Analyse Factorielle des Correspondances (
Correspondence analysis Correspondence analysis (CA) is a multivariate statistical technique proposed by Herman Otto Hartley (Hirschfeld) and later developed by Jean-Paul Benzécri. It is conceptually similar to principal component analysis, but applies to categorical ...
)'' with application to textual data. In 1965, Benzécri became professor at the Sorbonne and founded the Laboratoire de Statistique inside the Paris Institute of Statistics. His initial course in "Analyse des Données" evolved into a full scale MS-PhD program which was the basis of his research activity.


Research

Since his early work in 1963 on
Natural Language Processing Natural language processing (NLP) is a subfield of computer science and especially artificial intelligence. It is primarily concerned with providing computers with the ability to process data encoded in natural language and is thus closely related ...
(NLP), Benzécri got the intuition that electronic computing was going to be the ''Novius Organum'' (i.e., the new tool) enabling to solve the problem cooperatively between mathematics, logic and linguistics. Inspired by the pionneering works of
Louis Guttman Louis Guttman (; February 10, 1916 – October 25, 1987) was an American sociologist and Professor of Social and Psychological Assessment at the Hebrew University of Jerusalem, known primarily for his work in social statistics. Biography Louis ( ...
and Chikio Hayashi as well as by the distributional methodology of
Zellig Harris Zellig Sabbettai Harris (; October 23, 1909 – May 22, 1992) was an influential American linguist, mathematical syntactician, and methodologist of science. Originally a Semiticist, he is best known for his work in structural linguistics and di ...
, he devised a geometric equivalence to these approaches by searching the principal axes of inertia of a weighted cloud of points. These algorithms were the primary building blocks of a method which he later called "
Correspondence analysis Correspondence analysis (CA) is a multivariate statistical technique proposed by Herman Otto Hartley (Hirschfeld) and later developed by Jean-Paul Benzécri. It is conceptually similar to principal component analysis, but applies to categorical ...
". Developing correspondence analysis with the systematic supplement of clustering techniques, his interest went to analysing both large contingency and binary tables and some other kinds of data arrays after suitable transformation including lexical tables derived from raw texts. Favouring induction over hypothesis testing, much of his approach lies in describing and understanding how a multidimensional dataset diverges from the hypothesis of independence of its rows and columns through the interpretation of patterns often revealed by point cloud graphic displays. But he was also opened to reintroduce a new statistical framework into this purely exploratory process by deriving an ''a posteriori'' projection of supplementary variables (i.e. columns) and individuals (i.e. rows). His early familiarity with computers and their programming languages lead him to adopt tensor notations and quasi
ALGOL ALGOL (; short for "Algorithmic Language") is a family of imperative computer programming languages originally developed in 1958. ALGOL heavily influenced many other languages and was the standard method for algorithm description used by the ...
-like algorithmic formulas in his course texts as early as 1967. This facilitated the transcription of his concepts by his fellow colleagues and students to computer programs in a wide range of languages, the latest being a wide variety on implementations in R language such as FactoMineR. Benzecri's tensor notations were precursors to the latest developments of
tensor calculus In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects associated with a vector space. Tensors may map between different objects such as vectors, scalars, and even other ...
for machine learning (for example,
TensorFlow TensorFlow is a Library (computing), software library for machine learning and artificial intelligence. It can be used across a range of tasks, but is used mainly for Types of artificial neural networks#Training, training and Statistical infer ...
). In the field of clustering methods, Benzécri (1982) also proposed a new algorithm (nearest-neighbor chain algorithm) for agglomerative hierarchical clustering.


Selected publications

* ''L'Analyse des données''. Tome 1 : ''La Taxinomie'', Dunod, 1973, 615 p. * ''L'Analyse des données''. Tome 2 : ''L'Analyse des correspondances'', Dunod, 1973, 619 p. * ''Histoire et préhistoire de l'analyse des données'', Dunod, 1982, 159 p. * ''L'Analyse des données / leçons sur l'analyse factorielle et la reconnaissance des formes et travaux'', ** Vol. 1 : ''L'Analyse des correspondances'', Dunod, 1982, 635 p. ** Vol. 2 : ''La Taxinomie'', Dunod, 1982, 632 p. * ''Pratique de l'analyse des données'', ** Tome I : ''Analyse des correspondances, exposé élémentaire'', Dunod, 1980, ** Tome II : ''Abrégé théorique, études de cas modèles'', Dunod, 1980, 466 p. ** Tome III : ''Linguistique et lexicologie'', Dunod, 1981, 565 p. ** Tome IV : ''En médecine, pharmacologie, physiologie clinique'', Statmatic, Paris, 199, 532 p. ** Tome V : ''Pratique de l'analyse des données en économie'', Dunod, 1987, 533 p. * ''Les cahiers de l'analyse des données'', Gauthier-Villars, Dunod, 1976–1997 * ''Linguistique et lexicologie'', Dunod, 2007 é-édition Only one manual was published in English under the direct supervision of Benzécri near the end of his university career. * ''Correspondence analysis handbook'',
Marcel Dekker Marcel Dekker was a journal and encyclopedia publishing company with editorial boards found in New York City. Dekker encyclopedias are now published by CRC Press, part of the Taylor and Francis publishing group. History Initially a textbook publ ...
(1992), 665 p.


References


External links

*

-
Library of Congress The Library of Congress (LOC) is a research library in Washington, D.C., serving as the library and research service for the United States Congress and the ''de facto'' national library of the United States. It also administers Copyright law o ...


-
WorldCat WorldCat is a union catalog that itemizes the collections of tens of thousands of institutions (mostly libraries), in many countries, that are current or past members of the OCLC global cooperative. It is operated by OCLC, Inc. Many of the O ...
ID

-
OCLC OCLC, Inc. See also: is an American nonprofit cooperative organization "that provides shared technology services, original research, and community programs for its membership and the library community at large". It was founded in 1967 as the ...
- VIAF (Virtual International Authority File) {{DEFAULTSORT:Benzecri, Jean-Paul 1932 births 2019 deaths French statisticians 20th-century French mathematicians 21st-century French mathematicians People from Oran École Normale Supérieure alumni Academic staff of the University of Rennes Academic staff of Pierre and Marie Curie University Paris-Sorbonne University alumni Computational linguistics researchers Data miners