HOME

TheInfoList



OR:

Social statistics is the use of
statistical Statistics (from German: ''Statistik'', "description of a state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, industria ...
measurement systems to study
human behavior Human behavior is the potential and expressed capacity ( mentally, physically, and socially) of human individuals or groups to respond to internal and external stimuli throughout their life. Kagan, Jerome, Marc H. Bornstein, and Richard M. L ...
in a social environment. This can be accomplished through
polling Poll, polled, or polling may refer to: Figurative head counts * Poll, a formal election ** Election verification exit poll, a survey taken to verify election counts ** Polling, voting to make decisions or determine opinions ** Polling places o ...
a group of people, evaluating a subset of data obtained about a group of people, or by observation and statistical analysis of a set of data that relates to people and their behaviors.


Statistics in the social sciences


History

Adolph Quetelet was a proponent of
social physics Social physics or sociophysics is a field of science which uses mathematical tools inspired by physics to understand the behavior of human crowds. In a modern commercial use, it can also refer to the analysis of social phenomena with big data. Soc ...
. In his book ''Physique sociale'' he presents distributions of human
height Height is measure of vertical distance, either vertical extent (how "tall" something or someone is) or vertical position (how "high" a point is). For example, "The height of that building is 50 m" or "The height of an airplane in-flight is abou ...
s,
age of marriage Marriageable age (or marriage age) is the general age, as a legal age or as the minimum age subject to parental, religious or other forms of social acceptance, social approval, at which a person is legitimately allowed for marriage. Age and othe ...
, time of birth and death,
time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. Exa ...
of human marriages, births and deaths, a survival density for humans and curve describing
fecundity Fecundity is defined in two ways; in human demography, it is the potential for reproduction of a recorded population as opposed to a sole organism, while in population biology, it is considered similar to fertility, the natural capability to pr ...
as a function of age. He also developed the
Quetelet Index Body mass index (BMI) is a value derived from the mass (Mass versus weight, weight) and height of a person. The BMI is defined as the human body weight, body mass divided by the square (algebra), square of the human height, body height, and is ...
.
Francis Ysidro Edgeworth Francis Ysidro Edgeworth (8 February 1845 – 13 February 1926) was an Anglo-Irish philosopher and political economist who made significant contributions to the methods of statistics during the 1880s. From 1891 onward, he was appointed the ...
published "On Methods of Ascertaining Variations in the Rate of Births, Deaths, and Marriages" in 1885 which uses squares of differences for studying fluctuations and
George Udny Yule George Udny Yule FRS (18 February 1871 – 26 June 1951), usually known as Udny Yule, was a British statistician, particularly known for the Yule distribution. Personal life Yule was born at Beech Hill, a house in Morham near Haddington, ...
published "On the
Correlation In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistics ...
of total
Pauperism Pauperism (Lat. ''pauper'', poor) is poverty or generally the state of being poor, or particularly the condition of being a "pauper", i.e. receiving relief administered under the English Poor Laws. From this, pauperism can also be more generally ...
with Proportion of Out-Relief" in 1895. A numerical
calibration In measurement technology and metrology, calibration is the comparison of measurement values delivered by a device under test with those of a calibration standard of known accuracy. Such a standard could be another measurement device of know ...
for the fertility curve was given by
Karl Pearson Karl Pearson (; born Carl Pearson; 27 March 1857 – 27 April 1936) was an English mathematician and biostatistician. He has been credited with establishing the discipline of mathematical statistics. He founded the world's first university st ...
in 1897 in his "The Chances of Death, and Other Studies in Evolution" In this book Pearson also uses
standard deviation In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...
,
correlation In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistics ...
and
skewness In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined. For a unimodal d ...
for studying humans.
Vilfredo Pareto Vilfredo Federico Damaso Pareto ( , , , ; born Wilfried Fritz Pareto; 15 July 1848 – 19 August 1923) was an Italian polymath (civil engineer, sociologist, economist, political scientist, and philosopher). He made several important contribut ...
published his analysis of the
distribution of income In economics, income distribution covers how a country's total GDP is distributed amongst its population. Economic theory and economic policy have long seen income and its distribution as a central concern. Unequal distribution of income causes eco ...
in
Great Britain Great Britain is an island in the North Atlantic Ocean off the northwest coast of continental Europe. With an area of , it is the largest of the British Isles, the largest European island and the ninth-largest island in the world. It is ...
and
Ireland Ireland ( ; ga, Éire ; Ulster Scots dialect, Ulster-Scots: ) is an island in the Atlantic Ocean, North Atlantic Ocean, in Northwestern Europe, north-western Europe. It is separated from Great Britain to its east by the North Channel (Grea ...
in 1897, this is now known as the
Pareto principle The Pareto principle states that for many outcomes, roughly 80% of consequences come from 20% of causes (the "vital few"). Other names for this principle are the 80/20 rule, the law of the vital few, or the principle of factor sparsity. Manage ...
.
Louis Guttman Louis (Eliyahu) Guttman (February 10, 1916 – October 25, 1987; he, לואיס (אליהו) גוטמן) was an American sociologist and Professor of Social and Psychological Assessment at the Hebrew University of Jerusalem, known primarily for ...
proposed that the values of
ordinal variable Ordinal data is a categorical, statistical data type where the variables have natural, ordered categories and the distances between the categories are not known. These data exist on an ordinal scale, one of four levels of measurement described b ...
s can be represented by a Guttman scale, which is useful if the number of variables is large and allows the use of techniques such as
ordinary least squares In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model (with fixed level-one effects of a linear function of a set of explanatory variables) by the prin ...
.
Macroeconomic Macroeconomics (from the Greek prefix ''makro-'' meaning "large" + ''economics'') is a branch of economics dealing with performance, structure, behavior, and decision-making of an economy as a whole. For example, using interest rates, taxes, and ...
statistical research has provided
stylized facts In social sciences, especially economics, a stylized fact is a simplified presentation of an empirical finding. Stylized facts are broad tendencies that aim to summarize the data, offering essential truths while ignoring individual details. A prom ...
, which include: *
Bowley's law Bowley's law, also known as the law of the constant wage share, is a stylized fact of economics which states that the wage share of a country, i.e., the share of a country's economic output that is given to employees as compensation for their work ...
(1937) regarding the proportion between
wages A wage is payment made by an employer to an employee for work done in a specific period of time. Some examples of wage payments include compensatory payments such as ''minimum wage'', ''prevailing wage'', and ''yearly bonuses,'' and remuner ...
and national output * The
Phillips curve The Phillips curve is an economic model, named after William Phillips hypothesizing a correlation between reduction in unemployment and increased rates of wage rises within an economy. While Phillips himself did not state a linked relationship ...
(1958) regarding the relation between
wages A wage is payment made by an employer to an employee for work done in a specific period of time. Some examples of wage payments include compensatory payments such as ''minimum wage'', ''prevailing wage'', and ''yearly bonuses,'' and remuner ...
and
unemployment Unemployment, according to the OECD (Organisation for Economic Co-operation and Development), is people above a specified age (usually 15) not being in paid employment or self-employment but currently available for Work (human activity), w ...
Statistics and statistical analyses have become a key feature of social science: statistics is employed in
economics Economics () is the social science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and intera ...
,
psychology Psychology is the scientific study of mind and behavior. Psychology includes the study of conscious and unconscious phenomena, including feelings and thoughts. It is an academic discipline of immense scope, crossing the boundaries betwe ...
,
political science Political science is the scientific study of politics. It is a social science dealing with systems of governance and power, and the analysis of political activities, political thought, political behavior, and associated constitutions and la ...
,
sociology Sociology is a social science that focuses on society, human social behavior, patterns of Interpersonal ties, social relationships, social interaction, and aspects of culture associated with everyday life. It uses various methods of Empirical ...
and
anthropology Anthropology is the scientific study of humanity, concerned with human behavior, human biology, cultures, societies, and linguistics, in both the present and past, including past human species. Social anthropology studies patterns of behavi ...
.


Statistical methods in social sciences

Methods and concepts used in quantitative social sciences include: *
Research design Research design refers to the overall strategy utilized to carry out research that defines a succinct and logical plan to tackle established research question(s) through the collection, interpretation, analysis, and discussion of data. Incorporat ...
,
survey methodology Survey methodology is "the study of survey methods". As a field of applied statistics concentrating on human-research surveys, survey methodology studies the sampling of individual units from a population and associated techniques of survey da ...
and
survey sampling In statistics, survey sampling describes the process of selecting a sample of elements from a target population to conduct a survey. The term "survey" may refer to many different types or techniques of observation. In survey sampling it most often ...
*
Delphi method } The Delphi method or Delphi technique ( ; also known as Estimate-Talk-Estimate or ETE) is a structured communication technique or method, originally developed as a systematic, interactive forecasting method which relies on a panel of experts. The ...
Statistical techniques include:


Covariance based methods

*
Regression analysis In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome' or 'response' variable, or a 'label' in machine learning parlance) and one ...
*
Canonical correlation In statistics, canonical-correlation analysis (CCA), also called canonical variates analysis, is a way of inferring information from cross-covariance matrices. If we have two vectors ''X'' = (''X''1, ..., ''X'n'') and ''Y'' ...
*
Causal analysis Causal analysis is the field of experimental design and statistics pertaining to establishing cause and effect. Typically it involves establishing four elements: correlation, sequence in time (that is, causes must occur before their proposed effec ...
*
Multilevel models Multilevel models (also known as hierarchical linear models, linear mixed-effect model, mixed models, nested data models, random coefficient, random-effects models, random parameter models, or split-plot designs) are statistical models of param ...
*
Factor analysis Factor analysis is a statistical method used to describe variability among observed, correlated variables in terms of a potentially lower number of unobserved variables called factors. For example, it is possible that variations in six observed ...
*
Linear discriminant analysis Linear discriminant analysis (LDA), normal discriminant analysis (NDA), or discriminant function analysis is a generalization of Fisher's linear discriminant, a method used in statistics and other fields, to find a linear combination of features ...
* Path analysis * Structural Equation Modeling


Probability based methods

*
Probit In probability theory and statistics, the probit function is the quantile function associated with the standard normal distribution. It has applications in data analysis and machine learning, in particular exploratory statistical graphics and s ...
and
logit In statistics, the logit ( ) function is the quantile function associated with the standard logistic distribution. It has many uses in data analysis and machine learning, especially in data transformations. Mathematically, the logit is the ...
*
Item response theory In psychometrics, item response theory (IRT) (also known as latent trait theory, strong true score theory, or modern mental test theory) is a paradigm for the design, analysis, and scoring of tests, questionnaires, and similar instruments measuring ...
*
Bayesian statistics Bayesian statistics is a theory in the field of statistics based on the Bayesian interpretation of probability where probability expresses a ''degree of belief'' in an event. The degree of belief may be based on prior knowledge about the event, ...
*
Stochastic process In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables. Stochastic processes are widely used as mathematical models of systems and phenomena that appea ...
*
Latent class model In statistics, a latent class model (LCM) relates a set of observed (usually discrete) multivariate variables to a set of latent variables. It is a type of latent variable model. It is called a latent class model because the latent variable is dis ...


Distance based methods

*
Cluster analysis Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). It is a main task of ...
*
Multidimensional scaling Multidimensional scaling (MDS) is a means of visualizing the level of similarity of individual cases of a dataset. MDS is used to translate "information about the pairwise 'distances' among a set of n objects or individuals" into a configurati ...


Methods for categorical data

* Classification analysis *
Cohort analysis Cohort analysis is a kind of behavioral analytics that breaks the data in a data set into related groups before analysis. These groups, or cohorts, usually share common characteristics or experiences within a defined time-span. Cohort analysis allo ...


Usage and applications

Social scientists use social statistics for many purposes, including: * the
evaluation Evaluation is a systematic determination and assessment of a subject's merit, worth and significance, using criteria governed by a set of standards. It can assist an organization, program, design, project or any other intervention or initiative ...
of the quality of services available to a group or organization, * analyzing behaviors of groups of people in their environment and special situations, * determining the wants of people through statistical sampling * evaluation of wage expenditures and savings * preventing industrial diseases * prevention of industrial accidents *
labour dispute A labor dispute is a disagreement between an employer and employees regarding the terms of employment. This could include disputes regarding conditions of employment, fringe benefits, hours of work, tenure, and wages to be negotiated during ...
s, such as supporting the Anthracite Coal Strike Commission of 1902-1903 * supporting governments in times of peace and war


Reliability

The use of statistics has become so widespread in the social sciences that many universities such as
Harvard Harvard University is a private Ivy League research university in Cambridge, Massachusetts. Founded in 1636 as Harvard College and named for its first benefactor, the Puritan clergyman John Harvard, it is the oldest institution of higher le ...
, have developed institutes focusing on "quantitative social science." Harvard's Institute for Quantitative Social Science focuses mainly on fields like
political science Political science is the scientific study of politics. It is a social science dealing with systems of governance and power, and the analysis of political activities, political thought, political behavior, and associated constitutions and la ...
that incorporate the advanced causal statistical models that
Bayesian method Bayesian inference is a method of statistical inference in which Bayes' theorem is used to update the probability for a hypothesis as more evidence or information becomes available. Bayesian inference is an important technique in statistics, and e ...
s provide. However, some experts in causality feel that these claims of causal statistics are overstated.J. Pearl, Bayesianism and causality, or, why I am only a half-bayesian http://ftp.cs.ucla.edu/pub/stat_ser/r284-reprint.pdf There is a debate regarding the uses and value of statistical methods in social science, especially in
political science Political science is the scientific study of politics. It is a social science dealing with systems of governance and power, and the analysis of political activities, political thought, political behavior, and associated constitutions and la ...
, with some statisticians questioning practices such as
data dredging Data dredging (also known as data snooping or ''p''-hacking) is the misuse of data analysis to find patterns in data that can be presented as statistically significant, thus dramatically increasing and understating the risk of false positives. ...
that can lead to unreliable policy conclusions of political partisans who overestimate the interpretive power that non-robust statistical methods such as simple and multiple
linear regression In statistics, linear regression is a linear approach for modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is call ...
allow. Indeed, an important axiom that social scientists cite, but often forget, is that "
correlation does not imply causation The phrase "correlation does not imply causation" refers to the inability to legitimately deduce a cause-and-effect relationship between two events or variables solely on the basis of an observed association or correlation between them. The id ...
." For example, it appears widely accepted that the lower numbers of women in decision making positions in politics, business and science is good evidence of
gender discrimination Sexism is prejudice or discrimination based on one's sex or gender. Sexism can affect anyone, but it primarily affects women and girls.There is a clear and broad consensus among academic scholars in multiple fields that sexism refers primaril ...
. But where men suffer adverse statistical indicators such as greater imprisonment rates or a higher suicide rate, that is not usually accepted as evidence of
gender bias Sexism is prejudice or discrimination based on one's sex or gender. Sexism can affect anyone, but it primarily affects women and girls.There is a clear and broad consensus among academic scholars in multiple fields that sexism refers primaril ...
acting against them.


Further reading

* * * *Irvine, John, Miles, Ian, Evans, Jeff, (editors), "Demystifying Social Statistics ", London : Pluto Press, 1979. * *


References


External links

* ;Social science statistics centers
Center for Statistics and Social Sciences, University of WashingtonCenter for the Promotion of Research Involving Innovative Statistical Methodology, New York University, NYCentre for Research Methods, Faculty of Social Sciences, University of Helsinki, FinlandCornell Institute for Social and Economic ResearchHarvard Institute for Quantitative Social ScienceInter-University Consortium for Political and Social ResearchNational Centre for Research Methods, UKSocial Statistics Department, University of ManchesterSocial Statistics Division, School of Social Sciences, University of Southampton, UK
;Statistical databases for social science
Inter-University Consortium for Political and Social ResearchUN Statistics Division- Demographic and Social Statistics
* ttp://www.bls.gov US Bureau of Labor Statisticsbr>International Labour Organisation- LABORSTAUnionstats.com
{{Authority control