HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, Welch's ''t''-test, or unequal variances ''t''-test, is a two-sample
location test A location test is a statistical hypothesis test that compares the location parameter of a statistical population to a given constant, or that compares the location parameters of two statistical populations to each other. Most commonly, the locat ...
which is used to test the (null) hypothesis that two
populations Population is a set of humans or other organisms in a given region or area. Governments conduct a census to quantify the resident population size within a given jurisdiction. The term is also applied to non-human animals, microorganisms, and pl ...
have equal means. It is named for its creator, Bernard Lewis Welch, and is an adaptation of Student's ''t''-test, and is more reliable when the two samples have unequal variances and possibly unequal sample sizes. These tests are often referred to as "unpaired" or "independent samples" ''t''-tests, as they are typically applied when the statistical units underlying the two samples being compared are non-overlapping. Given that Welch's ''t''-test has been less popular than Student's ''t''-test and may be less familiar to readers, a more informative name is "Welch's unequal variances ''t''-test" — or "unequal variances ''t''-test" for brevity. Sometimes, it is referred as Satterthwaite or Welch–Satterthwaite test.


Assumptions

Student's ''t''-test assumes that the sample means being compared for two populations are normally distributed, and that the populations have equal variances. Welch's ''t''-test is designed for unequal population variances, but the assumption of normality is maintained. Welch's ''t''-test is an approximate solution to the Behrens–Fisher problem.


Calculations

Welch's ''t''-test defines the statistic ''t'' by the following formula: : t = \frac = \frac, : s_ = \frac, where \overline_i and s_ are the i-th
sample mean The sample mean (sample average) or empirical mean (empirical average), and the sample covariance or empirical covariance are statistics computed from a sample of data on one or more random variables. The sample mean is the average value (or me ...
and its
standard error The standard error (SE) of a statistic (usually an estimator of a parameter, like the average or mean) is the standard deviation of its sampling distribution or an estimate of that standard deviation. In other words, it is the standard deviati ...
, with s_i denoting the corrected sample standard deviation, and
sample size Sample size determination or estimation is the act of choosing the number of observations or replicates to include in a statistical sample. The sample size is an important feature of any empirical study in which the goal is to make inferences abo ...
N_i. Unlike in Student's ''t''-test, the denominator is ''not'' based on a pooled variance estimate. The
degrees of freedom In many scientific fields, the degrees of freedom of a system is the number of parameters of the system that may vary independently. For example, a point in the plane has two degrees of freedom for translation: its two coordinates; a non-infinite ...
\nu associated with this variance estimate is approximated using the Welch–Satterthwaite equation: : \nu \approx \frac . This expression can be simplified when N_1 = N_2: : \nu \approx \frac , where \nu_i = N_i - 1 is the degrees of freedom associated with the ''i''-th variance estimate. The statistic is approximately from the ''t''-distribution, since we have an approximation of the
chi-square distribution The term chi-square, chi-squared, or \chi^2 has various uses in statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analys ...
. This approximation is better done when both N_1 and N_2 are larger than 5.


Statistical test

Once ''t'' and ''\nu'' have been computed, these statistics can be used with the ''t''-distribution to test one of two possible null hypotheses: * A two-tailed test, in which the two population means are equal; or * A one-tailed test, in which one of the population means is greater than or equal to the other. The approximate degrees of freedom are
real number In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...
s \left(\nu\in\mathbb^+\right) and used as such in statistics-oriented software, whereas they are rounded down to the nearest integer in spreadsheets.


Confidence intervals

Based on Welch's t-test, it's possible to also construct a two sided confidence interval for the difference in means (while not having to assume equal variances). This will be by taking: :CI(\mu_1-\mu_2): \overline_1 - \overline_2 \pm \sqrt \times t_ Based on the above definitions of s_ and \nu.


Advantages and limitations

Welch's ''t''-test is more robust than Student's ''t''-test and maintains type I error rates close to nominal for unequal variances and for unequal sample sizes under normality. Furthermore, the power of Welch's ''t''-test comes close to that of Student's ''t''-test, even when the population variances are equal and sample sizes are balanced. Welch's ''t''-test can be generalized to more than 2-samples, which is more robust than
one-way analysis of variance In statistics, one-way analysis of variance (or one-way ANOVA) is a technique to compare whether two or more samples' means are significantly different (using the F distribution). This analysis of variance technique requires a numeric Dependent and ...
(ANOVA). It is ''not recommended'' to pre-test for equal variances and then choose between Student's ''t''-test or Welch's ''t''-test. Rather, Welch's ''t''-test can be applied directly and without any substantial disadvantages to Student's ''t''-test as noted above. Welch's ''t''-test remains robust for skewed distributions and large sample sizes. Reliability decreases for skewed distributions and smaller samples, where one could possibly perform Welch's ''t''-test.


Software implementations


See also

* Student's ''t''-test * ''Z''-test *
Factorial experiment In statistics, a factorial experiment (also known as full factorial experiment) investigates how multiple factors influence a specific outcome, called the response variable. Each factor is tested at distinct values, or levels, and the Experiment ...
*
One-way analysis of variance In statistics, one-way analysis of variance (or one-way ANOVA) is a technique to compare whether two or more samples' means are significantly different (using the F distribution). This analysis of variance technique requires a numeric Dependent and ...
* Hotelling's two-sample T-squared statistic, a multivariate extension of Welch's ''t''-test


References

{{Reflist, 30em Statistical approximations Statistical tests