Survey Sampling
   HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, survey sampling describes the process of selecting a sample of elements from a target
population Population typically refers to the number of people in a single area, whether it be a city or town, region, country, continent, or the world. Governments typically quantify the size of the resident population within their jurisdiction using a ...
to conduct a survey. The term "
survey Survey may refer to: Statistics and human research * Statistical survey, a method for collecting quantitative information about items in a population * Survey (human research), including opinion polls Spatial measurement * Surveying, the techniq ...
" may refer to many different types or techniques of observation. In survey sampling it most often involves a questionnaire used to measure the characteristics and/or attitudes of people. Different ways of contacting members of a sample once they have been selected is the subject of
survey data collection With the application of probability sampling in the 1930s, surveys became a standard tool for empirical research in social sciences, marketing, and official statistics. The methods involved in survey data collection are any of a number of ways in ...
. The purpose of sampling is to reduce the cost and/or the amount of work that it would take to survey the entire target population. A survey that measures the entire target population is called a
census A census is the procedure of systematically acquiring, recording and calculating information about the members of a given population. This term is used mostly in connection with national population and housing censuses; other common censuses incl ...
. A sample refers to a group or section of a
population Population typically refers to the number of people in a single area, whether it be a city or town, region, country, continent, or the world. Governments typically quantify the size of the resident population within their jurisdiction using a ...
from which information is to be obtained Survey samples can be broadly divided into two types: probability samples and super samples. Probability-based samples implement a sampling plan with specified probabilities (perhaps adapted probabilities specified by an adaptive procedure). Probability-based sampling allows design-based inference about the target population. The inferences are based on a known objective probability distribution that was specified in the study protocol. Inferences from probability-based surveys may still suffer from many types of bias. Surveys that are not based on probability sampling have greater difficulty measuring their bias or
sampling error In statistics, sampling errors are incurred when the statistical characteristics of a population are estimated from a subset, or sample, of that population. Since the sample does not include all members of the population, statistics of the sample ( ...
. Surveys based on non-probability samples often fail to represent the people in the target population. In academic and government survey research, probability sampling is a standard procedure. In the United States, the
Office of Management and Budget The Office of Management and Budget (OMB) is the largest office within the Executive Office of the President of the United States (EOP). OMB's most prominent function is to produce the president's budget, but it also examines agency programs, pol ...
's "List of Standards for Statistical Surveys" states that federally funded surveys must be performed:
selecting samples using generally accepted statistical methods (e.g., probabilistic methods that can provide estimates of sampling error). Any use of nonprobability sampling methods (e.g., cut-off or model-based samples) must be justified statistically and be able to measure estimation error.
Random sampling and design-based inference are supplemented by other statistical methods, such as model-assisted sampling and model-based sampling. For example, many surveys have substantial amounts of nonresponse. Even though the units are initially chosen with known probabilities, the nonresponse mechanisms are unknown. For surveys with substantial nonresponse, statisticians have proposed statistical models with which the data sets are analyzed. Issues related to survey sampling are discussed in several sources, including Salant and Dillman (1994).


Probability sampling

In a probability sample (also called "scientific" or "random" sample) each member of the target population has a known and non-zero probability of inclusion in the sample. A survey based on a probability sample can in theory produce statistical measurements of the target population that are: *
unbiased Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individual, a group, ...
, the expected value of the sample mean is equal to the population mean E(ȳ)=μ, and * have a measurable sampling error, which can be expressed as a
confidence interval In frequentist statistics, a confidence interval (CI) is a range of estimates for an unknown parameter. A confidence interval is computed at a designated ''confidence level''; the 95% confidence level is most common, but other levels, such as 9 ...
, or
margin of error The margin of error is a statistic expressing the amount of random sampling error in the results of a survey. The larger the margin of error, the less confidence one should have that a poll result would reflect the result of a census of the e ...
. A probability-based survey sample is created by constructing a list of the target population, called the
sampling frame In statistics, a sampling frame is the source material or device from which a sample is drawn. It is a list of all those within a population who can be sampled, and may include individuals, households or institutions. Importance of the sampling fra ...
, a randomized process for selecting units from the sample frame, called a selection procedure, and a method of contacting selected units to enable them to complete the survey, called a data collection method or mode. For some target populations this process may be easy; for example, sampling the employees of a company by using payroll lists. However, in large, disorganized populations simply constructing a suitable sample frame is often a complex and expensive task. Common methods of conducting a probability sample of the household population in the United States are Area Probability Sampling, Random Digit Dial telephone sampling, and more recently, Address-Based Sampling. Within probability sampling, there are specialized techniques such as
stratified sampling In statistics, stratified sampling is a method of sampling from a population which can be partitioned into subpopulations. In statistical surveys, when subpopulations within an overall population vary, it could be advantageous to sample each s ...
and
cluster sampling In statistics, cluster sampling is a sampling plan used when mutually homogeneous yet internally heterogeneous groupings are evident in a statistical population. It is often used in marketing research. In this sampling plan, the total populat ...
that improve the precision or efficiency of the sampling process without altering the fundamental principles of probability sampling. Stratification is the process of dividing members of the population into homogeneous subgroups before sampling, based on auxiliary information about each sample unit. The strata should be mutually exclusive: every element in the population must be assigned to only one stratum. The strata should also be collectively exhaustive: no population element can be excluded. Then methods such as
simple random sampling In statistics, a simple random sample (or SRS) is a subset of individuals (a sample (statistics), sample) chosen from a larger Set (mathematics), set (a statistical population, population) in which a subset of individuals are chosen randomization, ...
or
systematic sampling In survey methodology, systematic sampling is a statistical method involving the selection of elements from an ordered sampling frame. The most common form of systematic sampling is an equiprobability method. In this approach, progression through ...
can be applied within each stratum. Stratification often improves the representativeness of the sample by reducing sampling error.


Bias in probability sampling

Bias in surveys is undesirable, but often unavoidable. The major types of bias that may occur in the sampling process are: *
Non-response bias Participation bias or non-response bias is a phenomenon in which the results of elections, studies, polls, etc. become non-representative because the participants disproportionately possess certain traits which affect the outcome. These traits mea ...
: When individuals or households selected in the survey sample cannot or will not complete the survey there is the potential for bias to result from this non-response. Nonresponse bias occurs when the observed value deviates from the population parameter due to differences between respondents and nonrespondents. *
Response bias Response bias is a general term for a wide range of tendencies for participants to respond inaccurately or falsely to questions. These biases are prevalent in research involving participant self-report, such as structured interviews or surveys. R ...
: This is not the opposite of non-response bias, but instead relates to a possible tendency of respondents to give inaccurate or untruthful answers for various reasons. * Selection Bias: Selection bias occurs when some units have a differing probability of selection that is unaccounted for by the researcher. For example, some households have multiple phone numbers making them more likely to be selected in a telephone survey than households with only one phone number. This selection bias would be corrected by applying a survey weight equal to /(# of phone numbers)to each household. *
Self-selection bias In statistics, self-selection bias arises in any situation in which individuals select themselves into a group, causing a biased sample with nonprobability sampling. It is commonly used to describe situations where the characteristics of the peop ...
: A type of bias in which individuals voluntarily select themselves into a group, thereby potentially biasing the response of that group. *
Participation bias Participation bias or non-response bias is a phenomenon in which the results of elections, studies, polls, etc. become non-representative because the participants disproportionately possess certain traits which affect the outcome. These traits me ...
: Bias that arises due to the characteristics of those who choose to participate in a survey or poll. * Coverage bias: Coverage bias can occur when population members do not appear in the sample frame (undercoverage). Coverage bias occurs when the observed value deviates from the population parameter due to differences between covered and non-covered units. Telephone surveys suffer from a well known source of coverage bias because they cannot include households without telephones.


Non-probability sampling

Many surveys are not based on probability samples, but rather on finding a suitable collection of respondents to complete the survey. Some common examples of non-probability sampling are: * Judgement Samples: A researcher decides which population members to include in the sample based on his or her judgement. The researcher may provide some alternative justification for the representativeness of the sample. The underlying assumption is that the investigator will select units that are characteristic of the population. This method can be subjected to researcher's biases and perception. * Snowball Samples: Often used when a target population is rare. Members of the target population recruit other members of the population for the survey. * Quota Samples: The sample is designed to include a designated number of people with certain specified characteristics. For example, 100 coffee drinkers. This type of sampling is common in non-probability market research surveys. * Convenience Samples: The sample is composed of whatever persons can be most easily accessed to fill out the survey. In non-probability samples the relationship between the target population and the survey sample is immeasurable and potential bias is unknowable. Sophisticated users of non-probability survey samples tend to view the survey as an experimental condition, rather than a tool for population measurement, and examine the results for internally consistent relationships.


See also

*
Sample size determination Sample size determination is the act of choosing the number of observations or replicates to include in a statistical sample. The sample size is an important feature of any empirical study in which the goal is to make inferences about a populati ...
*
Sampling (statistics) In statistics, quality assurance, and survey methodology, sampling is the selection of a subset (a statistical sample) of individuals from within a statistical population to estimate characteristics of the whole population. Statisticians attempt ...
*
Total survey error In survey sampling, total survey error includes all forms of survey error including sampling variability, interviewer effects, frame errors, response bias, and non-response bias. Total survey error is discussed in detail in many sources including ...


References


Further reading

The textbook by Groves et alia provides an overview of survey methodology, including recent literature on questionnaire development (informed by
cognitive psychology Cognitive psychology is the scientific study of mental processes such as attention, language use, memory, perception, problem solving, creativity, and reasoning. Cognitive psychology originated in the 1960s in a break from behaviorism, which ...
) : *
Robert Groves Robert Martin Groves (born September 27, 1948) is an American sociologist and expert in survey methodology who has served as the Provost of Georgetown University in Washington, D.C. since August 2012. He also served as the Director of the Unit ...
, et alia. ''Survey methodology'' (2010) Second edition of the (2004) first edition . The other books focus on the
statistical theory The theory of statistics provides a basis for the whole range of techniques, in both study design and data analysis, that are used within applications of statistics. The theory covers approaches to statistical-decision problems and to statistica ...
of survey sampling and require some knowledge of basic statistics, as discussed in the following textbooks: *
David S. Moore David Sheldon Moore is an American statistician, who is known for his leadership of statistics education for many decades. Biography David S. Moore received his A.B. from Princeton University and the Ph.D. from Cornell University in mathematics ...
and George P. McCabe (February 2005). "''Introduction to the practice of statistics''" (5th edition). W.H. Freeman & Company. . * The elementary book by Scheaffer et alia uses quadratic equations from high-school algebra: * Scheaffer, Richard L., William Mendenhal and R. Lyman Ott. ''Elementary survey sampling'', Fifth Edition. Belmont: Duxbury Press, 1996. More mathematical statistics is required for Lohr, for Särndal et alia, and for Cochran (classic): * * * The historically important books by Deming and Kish remain valuable for insights for social scientists (particularly about the U.S. census and the
Institute for Social Research The Institute for Social Research (german: Institut für Sozialforschung, IfS) is a research organization for sociology and continental philosophy, best known as the institutional home of the Frankfurt School and critical theory. Currently a pa ...
at the
University of Michigan , mottoeng = "Arts, Knowledge, Truth" , former_names = Catholepistemiad, or University of Michigania (1817–1821) , budget = $10.3 billion (2021) , endowment = $17 billion (2021)As o ...
): * * Kish, Leslie (1995) ''Survey Sampling'', Wiley,


External links


CRAN Task View Survey MethodologyWhat is a Survey?
Booklet published by National Opinion Research Center and The American Statistical Association
''Journal of Information Technology Learning and Performance'' article Organizational Research: Determining Sample Size in Survey ResearchSample Design and Confidence IntervalsSurvey Sampling Methods
{{Social surveys Sampling techniques Public opinion Survey methodology Mathematical and quantitative methods (economics)