A cohort study is a particular form of
longitudinal study
A longitudinal study (or longitudinal survey, or panel study) is a research design that involves repeated observations of the same variables (e.g., people) over short or long periods of time (i.e., uses longitudinal data). It is often a type of obs ...
that samples a
cohort
Cohort or cohortes may refer to:
* Cohort (educational group), a group of students working together through the same academic curriculum
* Cohort (floating point), a set of different encodings of the same numerical value
* Cohort (military unit ...
(a group of people who share a defining characteristic, typically those who experienced a common event in a selected period, such as birth or graduation), performing a
cross-section at intervals through time. It is a type of
panel study
A longitudinal study (or longitudinal survey, or panel study) is a research design that involves repeated observations of the same variables (e.g., people) over short or long periods of time (i.e., uses longitudinal data). It is often a type of obs ...
where the individuals in the panel share a common characteristic.
Cohort studies represent one of the fundamental designs of
epidemiology
Epidemiology is the study and analysis of the distribution (who, when, and where), patterns and determinants of health and disease conditions in a defined population.
It is a cornerstone of public health, and shapes policy decisions and evidenc ...
which are used in research in the fields of
medicine
Medicine is the science and practice of caring for a patient, managing the diagnosis, prognosis, prevention, treatment, palliation of their injury or disease, and promoting their health. Medicine encompasses a variety of health care pract ...
,
pharmacy
Pharmacy is the science and practice of discovering, producing, preparing, dispensing, reviewing and monitoring medications, aiming to ensure the safe, effective, and affordable use of medicines. It is a miscellaneous science as it links heal ...
,
nursing
Nursing is a profession within the health care sector focused on the care of individuals, families, and communities so they may attain, maintain, or recover optimal health and quality of life. Nurses may be differentiated from other health ...
,
psychology
Psychology is the scientific study of mind and behavior. Psychology includes the study of conscious and unconscious phenomena, including feelings and thoughts. It is an academic discipline of immense scope, crossing the boundaries betwe ...
,
social science
Social science is one of the branches of science, devoted to the study of societies and the relationships among individuals within those societies. The term was formerly used to refer to the field of sociology, the original "science of soc ...
, and in any field reliant on 'difficult to reach' answers that are based on evidence (
statistics
Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
). In medicine for instance, while clinical trials are used primarily for assessing the safety of newly developed pharmaceuticals before they are approved for sale, epidemiological analysis on how risk factors affect the incidence of diseases is often used to identify the causes of diseases in the first place, and to help provide pre-clinical justification for the plausibility of protective factors (treatments).
Comparison with controlled trials
Cohort studies differ from clinical trials in that no intervention, treatment, or exposure is administered to participants in a cohort design; and no control group is defined. Rather, cohort studies are largely about the life histories of segments of populations and the individual people who constitute these segments. Exposures or protective factors are identified as preexisting characteristics of participants. The study is controlled by including other common characteristics of the cohort in the statistical analysis. Both exposure/treatment and control variables are measured at baseline. Participants are then followed over time to observe the incidence rate of the disease or outcome in question. Regression analysis can then be used to evaluate the extent to which the exposure or treatment variable contributes to the incidence of the disease, while accounting for other variables that may be at play.
Double-blind randomized controlled trials (RCTs) are generally considered superior methodology in the
hierarchy of evidence
A hierarchy of evidence (or levels of evidence) is a heuristic used to rank the relative strength of results obtained from scientific research. There is broad agreement on the relative strength of large-scale, epidemiological studies. More than 80 ...
in treatment, because they allow for the most control over other variables that could affect the outcome, and the randomization and blinding processes reduce bias in the study design. This minimizes the chance that results will be influenced by
confounding variables, particularly ones that are unknown. However, educated hypotheses based on prior research and background knowledge are used to select variables to be included in the regression model for cohort studies, and statistical methods can be used to identify and account potential confounders from these variables. Bias can also be mitigated in a cohort study when selecting participants for the cohort. It is also important to note that RCTs may not be suitable in all cases; such as when the outcome is a negative health effect and the exposure is hypothesized to be a risk factor for the outcome. Ethical standards, and morality, would prevent the use of risk factors in RCTs. The natural or incidental exposure to these risk factors (e.g. time spent in the sun), or self-administered exposure (e.g. smoking), can be measured without subjecting participants to risk factors outside of their individual lifestyles, habits, and choices.
Types
Cohort studies can be
retrospective
A retrospective (from Latin ''retrospectare'', "look back"), generally, is a look back at events that took place, or works that were produced, in the past. As a noun, ''retrospective'' has specific meanings in medicine, software development, popu ...
(looking back in time, thus using existing data such as medical records or claims database) or
prospective (requiring the collection of new data). Retrospective cohort studies restrict the investigators ability to reduce confounding and bias because collected information is restricted to data that already exists. There are advantages to this design however, as retrospective studies are much cheaper and faster because the data has already been collected and stored.
A
cohort
Cohort or cohortes may refer to:
* Cohort (educational group), a group of students working together through the same academic curriculum
* Cohort (floating point), a set of different encodings of the same numerical value
* Cohort (military unit ...
is a group of people who share a common characteristic or experience within a defined period (e.g., are currently living, are exposed to a drug or vaccine or pollutant, or undergo a certain medical procedure). Thus a group of people who were born on a day or in a particular period, say 1948, form a birth cohort. The comparison group may be the general population from which the cohort is drawn, or it may be another cohort of persons thought to have had little or no exposure to the substance under investigation, but otherwise similar. Alternatively, subgroups within the cohort may be compared with each other.
Application
In medicine, a cohort study is often undertaken to obtain evidence to try to refute the existence of a suspected association between cause and effect; failure to refute a hypothesis often strengthens confidence in it. Crucially, the cohort is identified before the appearance of the disease under investigation. The study groups follow a group of people who do not have the disease for a period of time and see who develops the disease (new incidence). The cohort cannot therefore be defined as a group of people who already have the disease. Prospective (longitudinal) cohort studies between exposure and disease strongly aid in studying causal associations, though distinguishing true causality usually requires further corroboration from further experimental trials.
The advantage of prospective cohort study data is that it can help determine risk factors for contracting a new disease because it is a longitudinal observation of the individual through time, and the collection of data at regular intervals, so recall error is reduced. However, cohort studies are expensive to conduct, are sensitive to
attrition
Attrition may refer to
*Attrition warfare, the military strategy of wearing down the enemy by continual losses in personnel and material
**War of Attrition, fought between Egypt and Israel from 1968 to 1970
**War of attrition (game), a model of agg ...
and take a long follow-up time to generate useful data. Nevertheless, the results that are obtained from long-term cohort studies are of substantially superior quality to those obtained from retrospective/cross-sectional studies. Prospective cohort studies are considered to yield the most reliable results in observational epidemiology. They enable a wide range of exposure-disease associations to be studied.
Some cohort studies track groups of children from their birth, and record a wide range of information (exposures) about them. The value of a cohort study depends on the researchers' capacity to stay in touch with all members of the cohort. Some studies have continued for decades.
In a cohort study, the population under investigation consists of individuals who are at risk of developing a specific disease or health outcome.
Examples
An example of an epidemiological question that can be answered using a cohort study is whether exposure to X (say, smoking) associates with outcome Y (say, lung cancer). For example, in 1951, the
British Doctors Study was started. Using a cohort which included both smokers (the exposed group) and non-smokers (the unexposed group). The study continued through 2001. By 1956, the study provided convincing proof of the association between smoking and the incidence of lung cancer. In a cohort study, the groups are ''matched'' in terms of many other variables such as economic status and other health status so that the variable being assessed, the
independent variable (in this case, smoking) can be isolated as the cause of the
dependent variable (in this case, lung cancer). In this example, a
statistically significant increase in the incidence of lung cancer in the smoking group as compared to the non-smoking group is evidence in favor of the hypothesis. However, rare outcomes, such as lung cancer, are generally not studied with the use of a cohort study, but are rather studied with the use of a
case-control study.
Cohort Succession can explain most change in literature, art, intellectualism, political opinions, and phonology.
Shorter term studies are commonly used in medical research as a form of
clinical trial
Clinical trials are prospective biomedical or behavioral research studies on human participants designed to answer specific questions about biomedical or behavioral interventions, including new treatments (such as novel vaccines, drugs, dietar ...
, or means to test a particular hypothesis of clinical importance. Such studies typically follow two groups of patients for a period of time and compare an endpoint or outcome measure between the two groups.
Randomized controlled trial
A randomized controlled trial (or randomized control trial; RCT) is a form of scientific experiment used to control factors not under direct experimental control. Examples of RCTs are clinical trials that compare the effects of drugs, surgical te ...
s, or RCTs, are a superior methodology in the hierarchy of evidence, because they limit the potential for bias by randomly assigning one patient pool to an intervention and another patient pool to non-intervention (or placebo). This minimizes the chance that the incidence of confounding variables will differ between the two groups. Nevertheless, it is sometimes not practical or ethical to perform RCTs to answer a clinical question. To take our example, if we already had reasonable evidence that smoking causes lung cancer then persuading a pool of non-smokers to take up smoking in order to test this hypothesis would generally be considered quite unethical.
Two examples of cohort studies that have been going on for more than 50 years are the
Framingham Heart Study
The Framingham Heart Study is a long-term, ongoing cardiovascular cohort study of residents of the city of Framingham, Massachusetts. The study began in 1948 with 5,209 adult subjects from Framingham, and is now on its third generation of partic ...
and the
National Child Development Study
The National Child Development Study (NCDS) is a continuing, multi-disciplinary longitudinal study which follows the lives of 17,415 people born in England, Scotland and Wales from 17,205 women during the week of 3–9 March 1958. The results from ...
(NCDS), the most widely researched of the
British birth cohort studies Birth cohort studies in Britain include four long-term medical and social studies, carried out over the lives of a group of participants, from birth. Two of these studies have continued for over 50 years.
Principal cohort studies
* National Surve ...
. Key findings of NCDS and a detailed profile of the study appear in the ''International Journal of Epidemiology''.
The
Dunedin Longitudinal Study
The Dunedin Multidisciplinary Health and Development Study (also known as the Dunedin Study) is a detailed study of human health, development and behaviour. Based at the University of Otago in New Zealand, the Dunedin Study has followed the liv ...
, started in 1975, has been studying the thousand people born in
Dunedin
Dunedin ( ; mi, Ōtepoti) is the second-largest city in the South Island of New Zealand (after Christchurch), and the principal city of the Otago region. Its name comes from , the Scottish Gaelic name for Edinburgh, the capital of Scotland. Th ...
, New Zealand, in 1972–1973. The subjects are interviewed regularly, with Phase 45 starting in 2017.
The largest cohort study in women is the
Nurses' Health Study
The Nurses Health Study is a series of prospective studies that examine epidemiology and the long-term effects of nutrition, hormones, environment, and nurses' work-life on health and disease development. The studies have been among the largest in ...
. Started in 1976, it is tracking over 120,000 nurses and has been analyzed for many different conditions and outcomes.
The largest cohort study in Africa is the
Birth to Twenty Study, which began in 1990 and tracks a cohort of over 3,000 children born in the weeks following
Nelson Mandela
Nelson Rolihlahla Mandela (; ; 18 July 1918 – 5 December 2013) was a South African Internal resistance to apartheid, anti-apartheid activist who served as the President of South Africa, first president of South Africa from 1994 to 1 ...
's release from prison.
Other famous examples are the
Grant Study The Grant Study is part of the Study of Adult Development at Harvard Medical School. It is a 75-year longitudinal study that followed 268 Harvard educated men, the majority of whom were members of the undergraduate classes of 1942, 1943 and 1944. It ...
tracking a number of Harvard graduates from ca. 1950.77, the
Whitehall Study
The Whitehall Studies investigated social determinants of health, specifically the cardiovascular disease prevalence and mortality rates among British civil servants. The initial prospective cohort study, the Whitehall I Study, examined over 17 ...
tracking 10,308 British civil servants, and the
Caerphilly Heart Disease Study
The Caerphilly Heart Disease Study, also known as the Caerphilly Prospective Study (CaPS), is an epidemiological prospective cohort, set up in 1979 in a representative population sample drawn from Caerphilly, a typical small town in South Wales ...
, which since 1979 has studied a representative sample of 2,512 men, drawn from the Welsh town of Caerphilly.
The ASPREE-XT study is designed to determine whether there are long-lasting effects of an average of 4–5 years of treatment with daily low-dose aspirin, with outcome measures including cancer mortality. As of September 2018, there were 16,703 ASPREE-XT participants in Australia. It has been proposed that the existing ASPREE-XT study could provide a platform for a future multigenerational research study.
Variations
The diagram indicates the starting point and direction of cohort and case-control studies.
In Case-control studies the analysis proceeds from documented disease and investigations are made to arrive at the possible causes of the disease. In cohort studies the assessments starts with the putative cause of disease, and observations are made of the occurrence of disease relative to the hypothesized causal agent.
Current and historical cohorts
A current cohort study represents a true prospective study where the data concerning exposure are assembled prior to the occurrence of the fact to be studied, for instance a disease. An example of a current cohort study is the Oxford Family Planning Association Study in the United Kingdom, which aimed to provide a balanced view of the beneficial and harmful effects of different methods of contraception. This study has provided a large amount of information on the efficacy and safety of contraceptive methods, and in particular oral contraceptives (OCs), diaphragms and intrauterine device (IUDs).
In a historical cohort study the data concerning exposure and occurrence of a disease, births, a political attitude or any other categorical variable are collected after the events have taken place, and the subjects (those exposed and unexposed to the agent under study) are assembled from existing records or health care registers.
A "
prospective cohort" defines the groups before the study is done, while historical studies, which are sometimes referred to as "
retrospective cohort", defines the grouping after the data is collected. Examples of a
retrospective cohort are ''Long-Term Mortality after Gastric Bypass Surgery''
and ''
The Lothian Birth Cohort Studies''.
Although historical studies are sometimes referred to as retrospective study, it a misnomer as the methodological principles of historical cohort studies and prospective studies are the same.
Nested case-control study
A nested case-control study is a case control nested inside of a cohort study. The procedure begins like a normal cohort study, however, as participants develop the outcome of interest they are selected as cases. Once the cases are identified, controls are selected and matched to each case. The process for selecting and matching cases is identical to a normal case control study. An example of a
nested case-control study
''Nested'' is the seventh studio album by Bronx-born singer, songwriter and pianist Laura Nyro, released in 1978 on Columbia Records.
Following on from her extensive tour to promote 1976's ''Smile'', which resulted in the 1977 live album '' Seas ...
is ''Inflammatory markers and the risk of coronary heart disease in men and women'', which was a case control analyses extracted from the
Framingham Heart Study
The Framingham Heart Study is a long-term, ongoing cardiovascular cohort study of residents of the city of Framingham, Massachusetts. The study began in 1948 with 5,209 adult subjects from Framingham, and is now on its third generation of partic ...
cohort.
Nested case-controls have the advantage of reducing the number of participants that require details follow up or diagnostic testing to assess outcome or exposure status. However, this will also reduce the power of the study, when compared to larger cohort the study population is drawn from.
Household panel survey
Household panel surveys are an important sub-type of cohort study. These draw representative samples of households and survey them, following all individuals through time on a usually annual basis. Examples include the US
Panel Study of Income Dynamics The Panel Study of Income Dynamics (PSID) is a longitudinal panel survey of American families, conducted by the Survey Research Center at the University of Michigan.
The PSID measures economic, social, and health factors over the life course of f ...
(since 1968), the ''German''
Socio-Economic Panel
The ''German'' Socio-Economic Panel (SOEP [], for ''Sozio-oekonomisches Panel'') is a Longitudinal study, longitudinal panel dataset of the population in Germany. It is a household based study which started in 1984 and which reinterviews adul ...
(since 1984), the
British Household Panel Survey
The British Household Panel Survey (BHPS), carried out at the Institute for Social and Economic Research of the University of Essex, is a survey for social and economic research. A sample of British households was drawn and first interviewed in ...
(since 1991), the
Household, Income and Labour Dynamics in Australia Survey
The Household, Income and Labour Dynamics in Australia (HILDA) survey is an Australian household-based panel study which began in 2001. It has been used for examining issues such as the incidence of persistent poverty; assets and income in the t ...
(since 2001) and the
European Community Household Panel
European, or Europeans, or Europeneans, may refer to:
In general
* ''European'', an adjective referring to something of, from, or related to Europe
** Ethnic groups in Europe
** Demographics of Europe
** European cuisine, the cuisines of Europe ...
(1994–2001).
Cohort analysis in business
For an example in business analysis, see
cohort analysis.
See also
*
Cohort (statistics)
In statistics, marketing and demography, a cohort is a group of subjects who share a defining characteristic (typically subjects who experienced a common event in a selected time period, such as birth or graduation).
Cohort data can oftentimes ...
*
Community of position {{Unreferenced, date=December 2009
A community of position is distinguished from a community of practice in that it tends to be more personally focused. Communities of position built around life stages (such as teenage years, university/college stu ...
*
Panel analysis Panel (data) analysis is a statistical method, widely used in social science, epidemiology, and econometrics to analyze two-dimensional (typically cross sectional and longitudinal) panel data. The data are usually collected over time and over the sa ...
*
Panel data
In statistics and econometrics, panel data and longitudinal data are both multi-dimensional data involving measurements over time. Panel data is a subset of longitudinal data where observations are for the same subjects each time.
Time series and ...
References
External links
Prospective cohortsCornell University College of Veterinary Medicine
Birth cohort study timelines (ESDS Longitudinal)Centre for Longitudinal Studies
{{DEFAULTSORT:Cohort Study
Epidemiology
*
*
Research
Nursing research