In
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, a unit of observation is the unit described by the
data
Data ( , ) are a collection of discrete or continuous values that convey information, describing the quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpreted for ...
that one analyzes. A study may treat groups as a unit of observation with a country as the unit of analysis, drawing conclusions on group characteristics from data
collected at the national level. For example, in a study of the
demand for money, the unit of observation might be chosen as the individual, with different observations (data points) for a given point in time differing as to which individual they refer to; or the unit of observation might be the country, with different observations differing only in regard to the country they refer to.
Unit of observation vs unit of analysis
The unit of observation should not be confused with the
unit of analysis. A study may have a differing unit of observation and unit of analysis: for example, in
community
A community is a social unit (a group of people) with a shared socially-significant characteristic, such as place, set of norms, culture, religion, values, customs, or identity. Communities may share a sense of place situated in a given g ...
research, the
research design
Research design refers to the overall strategy utilized to answer research questions. A research design typically outlines the theories and models underlying a project; the research question(s) of a project; a strategy for gathering data and info ...
may collect data at the individual level of observation but the
level of analysis
Level of analysis is used in the social sciences to point to the location, size, or scale of a research target. It is distinct from unit of observation in that the former refers to a more or less integrated set of relationships while the latter re ...
might be at the neighborhood level, drawing conclusions on neighborhood characteristics from data collected from individuals. Together, the unit of observation and the
level of analysis
Level of analysis is used in the social sciences to point to the location, size, or scale of a research target. It is distinct from unit of observation in that the former refers to a more or less integrated set of relationships while the latter re ...
define the
population
Population is a set of humans or other organisms in a given region or area. Governments conduct a census to quantify the resident population size within a given jurisdiction. The term is also applied to non-human animals, microorganisms, and pl ...
of a research enterprise.
Data point
A data point or observation is a set of one or more
measurement
Measurement is the quantification of attributes of an object or event, which can be used to compare with other objects or events.
In other words, measurement is a process of determining how large or small a physical quantity is as compared to ...
s on a single member of the unit of observation. For example, in a study of the determinants of
money demand with the unit of observation being the individual, a data point might be the values of income, wealth, age of individual, and number of dependents.
Statistical inference
Statistical inference is the process of using data analysis to infer properties of an underlying probability distribution.Upton, G., Cook, I. (2008) ''Oxford Dictionary of Statistics'', OUP. . Inferential statistical analysis infers properties of ...
about the population would be conducted using a
statistical sample
In this statistics, quality assurance, and survey methodology, sampling is the selection of a subset or a statistical sample (termed sample for short) of individuals from within a statistical population to estimate characteristics of the whole ...
consisting of various such data points.
In addition, in
statistical graphics
Statistical graphics, also known as statistical graphical techniques, are graphics used in the field of statistics for data visualization.
Overview
Whereas statistics and data analysis procedures generally yield their output in numeric or tabul ...
, a "data point" may be an individual item with a statistical display; such points may relate to either a single member of a population or to a
summary statistic calculated for a given subpopulation.
Types of data
The measurements contained in a unit of observation are formally ''typed'', where here ''type'' is used in a way compatible with
datatype
In computer science and computer programming, a data type (or simply type) is a collection or grouping of data values, usually specified by a set of possible values, a set of allowed operations on these values, and/or a representation of these ...
in
computing
Computing is any goal-oriented activity requiring, benefiting from, or creating computer, computing machinery. It includes the study and experimentation of algorithmic processes, and the development of both computer hardware, hardware and softw ...
; so that the type of measurement can specify whether the measurement results in a
Boolean value from , an
integer
An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
or
real number
In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...
, the identity of some
category
Category, plural categories, may refer to:
General uses
*Classification, the general act of allocating things to classes/categories Philosophy
* Category of being
* ''Categories'' (Aristotle)
* Category (Kant)
* Categories (Peirce)
* Category ( ...
, or some
vector
Vector most often refers to:
* Euclidean vector, a quantity with a magnitude and a direction
* Disease vector, an agent that carries and transmits an infectious pathogen into another living organism
Vector may also refer to:
Mathematics a ...
or
array.
The implication of ''point'' is often that the data may be plotted in a graphic display, but in many cases the data are
processed numerically before that is done. In the context of
statistical graphics
Statistical graphics, also known as statistical graphical techniques, are graphics used in the field of statistics for data visualization.
Overview
Whereas statistics and data analysis procedures generally yield their output in numeric or tabul ...
, measured values for individuals or summary statistics for different subpopulations are displayed as separate symbols within a display; since such symbols can differ by shape, size and colour, a single ''data point'' within a display can convey multiple aspects of the set of measurements for an individual or subpopulation.
See also
*
Observation error
*
Sample point
References
{{DEFAULTSORT:Unit Of Observation
Statistical data types
Social research