Statistical graphics, also known as statistical graphical techniques, are
graphics
Graphics () are visual images or designs on some surface, such as a wall, canvas, screen, paper, or stone, to inform, illustrate, or entertain. In contemporary usage, it includes a pictorial representation of the data, as in design and manufa ...
used in the field of
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
for
data visualization
Data and information visualization (data viz/vis or info viz/vis) is the practice of designing and creating Graphics, graphic or visual Representation (arts), representations of a large amount of complex quantitative and qualitative data and i ...
.
Overview
Whereas
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
and
data analysis
Data analysis is the process of inspecting, Data cleansing, cleansing, Data transformation, transforming, and Data modeling, modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making. Da ...
procedures generally yield their output in numeric or tabular form, graphical techniques allow such results to be displayed in some sort of pictorial form. They include
plots such as
scatter plots,
histogram
A histogram is a visual representation of the frequency distribution, distribution of quantitative data. To construct a histogram, the first step is to Data binning, "bin" (or "bucket") the range of values— divide the entire range of values in ...
s,
probability plots,
spaghetti plots, residual plots,
box plot
In descriptive statistics, a box plot or boxplot is a method for demonstrating graphically the locality, spread and skewness groups of numerical data through their quartiles.
In addition to the box on a box plot, there can be lines (which are ca ...
s, block plots and
biplots.
Exploratory data analysis
In statistics, exploratory data analysis (EDA) is an approach of data analysis, analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization methods. A statistical model can be used or ...
(EDA) relies heavily on such techniques. They can also provide insight into a data set to help with testing assumptions,
model selection
Model selection is the task of selecting a model from among various candidates on the basis of performance criterion to choose the best one.
In the context of machine learning and more generally statistical analysis, this may be the selection of ...
and
regression model validation, estimator selection, relationship identification, factor effect determination, and
outlier
In statistics, an outlier is a data point that differs significantly from other observations. An outlier may be due to a variability in the measurement, an indication of novel data, or it may be the result of experimental error; the latter are ...
detection. In addition, the choice of appropriate statistical graphics can provide a convincing means of communicating the underlying message that is present in the data to others.
Graphical statistical methods have four objectives:
* The exploration of the content of a data set
* The use to find structure in data
* Checking assumptions in statistical models
* Communicate the results of an analysis.
If one is not using statistical graphics, then one is forfeiting insight into one or more aspects of the underlying structure of the data.
History
Statistical graphics have been central to the development of science and date to the earliest attempts to analyse data. Many familiar forms, including
bivariate plots,
statistical maps,
bar chart
A bar chart or bar graph is a chart or graph that presents categorical variable, categorical data with rectangular bars with heights or lengths proportional to the values that they represent. The bars can be plotted vertically or horizontally. A ...
s, and
coordinate paper were used in the 18th century. Statistical graphics developed through attention to four problems:
[ James R. Beniger and Dorothy L. Robyn (1978). "Quantitative graphics in statistics: A brief history". In: ''The American Statistician''. 32: pp. 1–11.]
* Spatial organization in the 17th and 18th century
* Discrete comparison in the 18th and early 19th century
* Continuous distribution in the 19th century and
* Multivariate distribution and correlation in the late 19th and 20th century.
Since the 1970s statistical graphics have been re-emerging as an important analytic tool with the revitalisation of
computer graphics
Computer graphics deals with generating images and art with the aid of computers. Computer graphics is a core technology in digital photography, film, video games, digital art, cell phone and computer displays, and many specialized applications. ...
and related technologies.
Examples

Famous graphics were designed by:
*
William Playfair
William Playfair (22 September 1759 – 11 February 1823) was a Scottish engineer and political economist. The founder of graphical methods of statistics, Playfair invented several types of diagrams: in 1786 he introduced the line, area and ...
who produced what could be called the first
line,
bar,
pie, and
area chart
An area chart or area graph displays graphically quantitative data. It is based on the line chart. The area between axis and line are commonly emphasized with colors, textures and hatchings. Commonly one compares two or more quantities with an a ...
s. For example, in 1786 he published the well known diagram that depicts the evolution of
England
England is a Countries of the United Kingdom, country that is part of the United Kingdom. It is located on the island of Great Britain, of which it covers about 62%, and List of islands of England, more than 100 smaller adjacent islands. It ...
's imports and exports,
*
James Watt
James Watt (; 30 January 1736 (19 January 1736 OS) – 25 August 1819) was a Scottish inventor, mechanical engineer, and chemist who improved on Thomas Newcomen's 1712 Newcomen steam engine with his Watt steam engine in 1776, which was f ...
and his employee
John Southern, who around 1790 invented
the steam indicator, a device for plotting pressure variations within a steam engine cylinder through its stroke,
*
Florence Nightingale
Florence Nightingale (; 12 May 1820 – 13 August 1910) was an English Reform movement, social reformer, statistician and the founder of modern nursing. Nightingale came to prominence while serving as a manager and trainer of nurses during th ...
, who used statistical graphics to persuade the British Government to improve army hygiene,
*
John Snow
John Snow (15 March 1813 – 16 June 1858) was an English physician and a leader in the development of anaesthesia and medical hygiene. He is considered one of the founders of modern epidemiology and early germ theory, in part because of hi ...
who plotted deaths from
cholera
Cholera () is an infection of the small intestine by some Strain (biology), strains of the Bacteria, bacterium ''Vibrio cholerae''. Symptoms may range from none, to mild, to severe. The classic symptom is large amounts of watery diarrhea last ...
in
London
London is the Capital city, capital and List of urban areas in the United Kingdom, largest city of both England and the United Kingdom, with a population of in . London metropolitan area, Its wider metropolitan area is the largest in Wester ...
in 1854 to detect the source of the disease, and
*
Charles Joseph Minard
Charles Joseph Minard (; ; 27 March 1781 – 24 October 1870) was a French civil engineer recognized for his significant contribution in the field of information graphics in civil engineering and statistics. Minard was, among other things, noted ...
who designed a large portfolio of maps of which the one depicting
Napoleon
Napoleon Bonaparte (born Napoleone di Buonaparte; 15 August 1769 – 5 May 1821), later known by his regnal name Napoleon I, was a French general and statesman who rose to prominence during the French Revolution and led Military career ...
's campaign in
Russia
Russia, or the Russian Federation, is a country spanning Eastern Europe and North Asia. It is the list of countries and dependencies by area, largest country in the world, and extends across Time in Russia, eleven time zones, sharing Borders ...
is the best known.
See the
plots page for many more examples of statistical graphics.
See also
*
Data Presentation Architecture
*
List of graphical methods
*
Visual inspection
*
Chart
A chart (sometimes known as a graph) is a graphics, graphical representation for data visualization, in which "the data is represented by symbols, such as bars in a bar chart, lines in a line chart, or slices in a pie chart". A chart can repres ...
*
List of charting software
There are many different types of software available to produce charts.
A number of notable examples (with their own Wikipedia articles) are given below and organized according to the programming language or other context in which they are used.
...
References
; Citations
; Attribution
Further reading
*
*
*
*
*
External links
Trend CompassDataScope a website devoted to data visualization and statistical graphics
{{Visualization
Infographics