Statistical Geography
   HOME

TheInfoList



OR:

Statistical geography is the study and practice of collecting, analysing and presenting data that has a geographic or areal dimension, such as census or demographics data. It uses techniques from
spatial analysis Spatial analysis or spatial statistics includes any of the formal techniques which studies entities using their topological, geometric, or geographic properties. Spatial analysis includes a variety of techniques, many still in their early deve ...
, but also encompasses geographical activities such as the defining and naming of geographical regions for statistical purposes. For example, for the purposes of statistical geography, the
Australian Bureau of Statistics The Australian Bureau of Statistics (ABS) is the independent statutory agency of the Australian Government responsible for statistical collection and analysis and for giving evidence-based advice to federal, state and territory governments ...
uses the Australian Standard Geographical Classification, a hierarchical regionalisation that divides
Australia Australia, officially the Commonwealth of Australia, is a Sovereign state, sovereign country comprising the mainland of the Australia (continent), Australian continent, the island of Tasmania, and numerous List of islands of Australia, sma ...
up into states and territories, then statistical divisions, statistical subdivisions, statistical local areas, and finally census collection districts.


Background

Geographer A geographer is a physical scientist, social scientist or humanist whose area of study is geography, the study of Earth's natural environment and human society, including how society and nature interacts. The Greek prefix "geo" means "earth" a ...
s study how and why elements differ from place to place, as well as how spatial patterns change through time. Geographers begin with the question 'Where?', exploring how features are distributed on a physical or cultural landscape, observing spatial patterns and the variation of phenomena. Contemporary geographical analysis has shifted to 'Why?', determining why a specific spatial pattern exists, what spatial or ecological processes may have affected a pattern, and why such processes operate. Only by approaching the 'why?' questions can social scientists begin to appreciate the mechanisms of change, which are infinite in their complexity.


Role of statistics in geography

Statistical techniques and procedures are applied in all fields of academic research; wherever data are collected and summarized or wherever any numerical information is analyzed or research is conducted, statistics are needed for sound analysis and interpretation of results. Geographers use statistics in numerous ways: * To describe and summarize spatial data. * To make generalizations concerning complex spatial patterns. * To estimate the probability of outcomes for an event at a given location. * To use samples of geographic data to infer characteristics for a larger set of geographic data (population). * To determine if the magnitude or frequency of some phenomenon differs from one location to another. * To learn whether an actual spatial pattern matches some expected pattern.


Spatial data and descriptive statistics

There are several potential difficulties associated with the analysis of spatial data, among these are boundary delineation, modifiable areal units, and the level of spatial aggregation or scale. In each of these cases, the absolute descriptive statistics of an area - the mean, median, mode, standard deviation, and variation - are changed through the manipulation of these spatial problems.


Boundary delineation

The location of a study area boundary and the positioning of internal boundaries affect various descriptive statistics. With respect to measures such as the mean or standard deviation, the study area size alone may have large implications; consider a study of per capita income within a city, if confined to the inner city, income levels are likely to be lower because of a less affluent population, if expanded to include the suburbs or surrounding communities, income levels will become greater with the influence of homeowner populations. Because of this problem, absolute descriptive statistics such as the mean, standard deviation, and variance should be evaluated comparatively only in relation to a particular study area. In the determination of internal boundaries this is also true, as these statistics may only have valid interpretations for the area and subarea configuration over which they are calculated.


Modifiable areal units

In many cases the subdivision of spatial data has already been determined, this is evident in demographic datasets, as the available information will be grouped into their respective counties or municipalities. For this type of data, analysts must use the same county or municipal boundaries delineated in the collected data for their subsequent analysis. When alternate boundaries are possible, an analyst must take into account that any new subdivision model may create different results.


Spatial aggregation/scale problem

Socio-economic data may be available at a variety of scales, for example: municipalities, regional districts, census tracts, enumeration districts, or at the provincial/state level. When this data is aggregated at different scales, the resulting descriptive statistics may exhibit variations, either in a systematic, predictable way, or in a more uncertain fashion. If we are observing economic data, we may notice a distinct reduction in manufacturing productivity for a country (the USA) over a certain period; since this is a general model, individual states may experience these effects differently. The result of this aggregation is that the standard deviation of the data in question is increased due to the variability among states.


Descriptive spatial statistics

For summarizing point pattern analysis, a set of descriptive spatial statistics has been developed that are areal equivalents to nonspatial measures. Since geographers are particularly concerned with the analysis of locational data, these descriptive spatial statistics (geostatistics) are often applied to summarize point patterns and to describe the degree of spatial variability of some phenomena.


Spatial measures of central tendency

An example here is the idea of a
center of population In demographics, the center of population (or population center) of a region is a geographical point that describes a centerpoint of the region's population. There are several ways of defining such a "center point", leading to different geogr ...
, of which a particular example is the
mean center of U.S. population The mean center of the United States population is determined by the United States Census Bureau from the results of each national census. The Bureau defines it as follows: After moving roughly west by south during the 19th century, the sh ...
. Several different ways of defining a center are available: *Mean center: The mean is an important measure of central tendency, which when extended to a set of points, located on a
Cartesian coordinate system A Cartesian coordinate system (, ) in a plane is a coordinate system that specifies each point uniquely by a pair of numerical coordinates, which are the signed distances to the point from two fixed perpendicular oriented lines, measured in t ...
, the average location,
centroid In mathematics and physics, the centroid, also known as geometric center or center of figure, of a plane figure or solid figure is the arithmetic mean position of all the points in the surface of the figure. The same definition extends to any ob ...
or mean center, can be determined. *The weighted mean center is analogous to frequencies in the calculation of grouped statistics, such as the weighted mean. A point may represent a retail outlet, while its frequency will represent the volume of sales within the particular store. *Median center or Euclidean center and in the
median center of United States population The median center of U.S. population is determined by the United States Census Bureau from the results of each census. The Bureau defines it to be: As of the 2020 U.S. census, this places roughly 165.7 million Americans living on each side of a ...
. This is related to the
Manhattan distance A taxicab geometry or a Manhattan geometry is a geometry whose usual distance function or Metric (mathematics), metric of Euclidean geometry is replaced by a new metric in which the distance between two points is the sum of the absolute differences ...
.


Spatial measures of dispersion

*Standard distance: Just as the
standard deviation In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...
indicates how closely the values in a data set are clustered around the mean, so standard distance in a spatial distribution indicates how closely the points are clustered around the mean centre. *Relative distance


Topology

The motivating insight behind topology is that some geometric problems depend not on the exact shape of the objects involved, but rather on the "way they are connected together". One of the first papers in topology was the demonstration, by
Leonhard Euler Leonhard Euler ( , ; 15 April 170718 September 1783) was a Swiss mathematician, physicist, astronomer, geographer, logician and engineer who founded the studies of graph theory and topology and made pioneering and influential discoveries in ma ...
, that it was impossible to find a route through the town of Königsberg (now
Kaliningrad Kaliningrad ( ; rus, Калининград, p=kəlʲɪnʲɪnˈɡrat, links=y), until 1946 known as Königsberg (; rus, Кёнигсберг, Kyonigsberg, ˈkʲɵnʲɪɡzbɛrk; rus, Короле́вец, Korolevets), is the largest city and ...
) that would cross each of its seven bridges exactly once. This result did not depend on the lengths of the bridges, nor on their distance from one another, but only on connectivity properties: which bridges are connected to which islands or riverbanks. This problem, the ''
Seven Bridges of Königsberg The Seven Bridges of Königsberg is a historically notable problem in mathematics. Its negative resolution by Leonhard Euler in 1736 laid the foundations of graph theory and prefigured the idea of topology. The city of Königsberg in Prussia (n ...
'', is now a famous problem in introductory mathematics, and led to the branch of mathematics known as
graph theory In mathematics, graph theory is the study of ''graphs'', which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of '' vertices'' (also called ''nodes'' or ''points'') which are conne ...
.


Topology rules

Topology rules are particularly important within
GIS A geographic information system (GIS) is a type of database containing Geographic data and information, geographic data (that is, descriptions of phenomena for which location is relevant), combined with Geographic information system software, sof ...
, and are used for a variety of correction and analytical procedures. The primary shapes in GIS are the
point Point or points may refer to: Places * Point, Lewis, a peninsula in the Outer Hebrides, Scotland * Point, Texas, a city in Rains County, Texas, United States * Point, the NE tip and a ferry terminal of Lismore, Inner Hebrides, Scotland * Point ...
,
line Line most often refers to: * Line (geometry), object with zero thickness and curvature that stretches to infinity * Telephone line, a single-user circuit on a telephone communication system Line, lines, The Line, or LINE may also refer to: Arts ...
, and
polygon In geometry, a polygon () is a plane figure that is described by a finite number of straight line segments connected to form a closed ''polygonal chain'' (or ''polygonal circuit''). The bounded plane region, the bounding circuit, or the two toge ...
, each of which implies different spatial characteristics; for instance, the only shape which has a distinguishable inside and outside is the polygon. Principles of connectivity associated with topology lead to applications in
hydrology Hydrology () is the scientific study of the movement, distribution, and management of water on Earth and other planets, including the water cycle, water resources, and environmental watershed sustainability. A practitioner of hydrology is calle ...
,
urban planning Urban planning, also known as town planning, city planning, regional planning, or rural planning, is a technical and political process that is focused on the development and design of land use and the built environment, including air, water, ...
, and
logistics Logistics is generally the detailed organization and implementation of a complex operation. In a general business sense, logistics manages the flow of goods between the point of origin and the point of consumption to meet the requirements of ...
, as well as other fields; as such, topological analyses offer unique modelling capabilities, defining the vector nature of topological features and correcting spatial data errors from digitizing.


National examples


United Kingdom

Due to the devolved nature of the United Kingdom, responsibility for managing statistical geographies often falls to the National Statistical Institute with jurisdiction for that devolved administration. For England and Wales this is the
Office for National Statistics The Office for National Statistics (ONS; cy, Swyddfa Ystadegau Gwladol) is the executive office of the UK Statistics Authority, a non-ministerial department which reports directly to the UK Parliament. Overview The ONS is responsible for th ...
, for Scotland
National Records of Scotland National Records of Scotland ( gd, Clàran Nàiseanta na h-Alba) is a non-ministerial department of the Scottish Government. It is responsible for Civil registry, civil registration, the census in Scotland, demography and statistics, family histor ...
and for Northern Ireland the
Northern Ireland Statistics and Research Agency The Northern Ireland Statistics and Research Agency (NISRA, ga, Gníomhaireacht Thuaisceart Éireann um Staitisticí agus Taighde, links=no) is an executive agency within the Department of Finance (Northern Ireland), Department of Finance in No ...
.


England and Wales

The lowest form of statistical geography in England and Wales is the
Output Area ONS codes are geocodes maintained by the United Kingdom's Office for National Statistics to represent a wide range of geographical areas of the UK, for use in tabulating census and other statistical data. These codes are also known as GSS codes, wh ...
. These are small geographies of approximately 300 people and 100 households for which Census data is published. By containing roughly the same number of people and households it is possible to compare statistics for any two Output Areas in the country, and know that this is being done in a consistent way (unlike comparing statistics for Administrative geographies). The Output Areas form the smallest part of a hierarchy that consists of Output Areas, Lower Layer Super Output Areas and Middle Layer Super Output Areas. England and Wales also have a statistical geography designed specifically for the publication of workplace statistics. This is because Output Areas are built around residential populations and make analysing workplace statistics difficult. Workplace Zones have been released as part of the 2011 Census.


Scotland

Like England and Wales, the lowest level of statistical geography in Scotland is the Output Area. Scottish OAs are smaller than those for England and Wales because smaller thresholds are applied, but the methodology for their creation is broadly similar to that used by ONS. The higher levels are again similar to England and Wales but operate as Data Zones and Intermediate Zones rather than Lower and Middle Layer Super Output Areas. There are no Workplace Zones for Scotland.


See also

*
Geostatistics Geostatistics is a branch of statistics focusing on spatial or spatiotemporal datasets. Developed originally to predict probability distributions of ore grades for mining operations, it is currently applied in diverse disciplines including petro ...
*
Quantitative revolution The quantitative revolution (QR) was a paradigm shift that sought to develop a more rigorous and systematic methodology for the discipline of geography. It came as a response to the inadequacy of regional geography to explain general spatial dynam ...
*
Spatial analysis Spatial analysis or spatial statistics includes any of the formal techniques which studies entities using their topological, geometric, or geographic properties. Spatial analysis includes a variety of techniques, many still in their early deve ...


References

* * {{cite book , year = 1973 , last = Dickinson , first=G.C., title = Statistical mapping and the presentation of statistics, publisher = Edward Arnold , isbn= 0-7131-5641-4 Human geography Applied statistics Spatial analysis