The Brown–Forsythe test is a
statistical test
A statistical hypothesis test is a method of statistical inference used to decide whether the data provide sufficient evidence to reject a particular hypothesis. A statistical hypothesis test typically involves a calculation of a test statistic. ...
for the equality of group variances based on performing an
Analysis of Variance
Analysis of variance (ANOVA) is a family of statistical methods used to compare the Mean, means of two or more groups by analyzing variance. Specifically, ANOVA compares the amount of variation ''between'' the group means to the amount of variati ...
(ANOVA) on a
transformation of the
response variable. When a
one-way ANOVA is performed, samples are assumed to have been drawn from distributions with equal
variance
In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
. If this assumption is not valid, the resulting
''F''-test is invalid. The Brown–Forsythe
test statistic
Test statistic is a quantity derived from the sample for statistical hypothesis testing.Berger, R. L.; Casella, G. (2001). ''Statistical Inference'', Duxbury Press, Second Edition (p.374) A hypothesis test is typically specified in terms of a tes ...
is the F statistic resulting from an ordinary one-way analysis of variance on the absolute deviations of the groups or treatments data from their individual medians.
Transformation
The transformed response variable is constructed to measure the
spread in each group. Let
:
where
is the
median
The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...
of group ''j''. The Brown–Forsythe test statistic is the model ''F'' statistic from a one way ANOVA on ''z
ij'':
:
where ''p'' is the number of groups, ''n
j'' is the number of observations in group ''j'', and ''N'' is the total number of observations. Also
are the group means of the
and
is the overall mean of the
. This ''F''-statistic follows the
''F''-distribution with degrees of freedom
and
under the null hypothesis.
If the variances are indeed heterogeneous, techniques that allow for this (such as the
Welch one-way ANOVA) may be used instead of the usual ANOVA.
Good, noting that the deviations are linearly dependent, has modified the test so as to drop the redundant deviations.
Comparison with Levene's test
Levene's test uses the mean instead of the median. Although the optimal choice depends on the underlying distribution, the definition based on the median is recommended as the choice that provides good
robustness against many types of non-normal data while retaining good
statistical power
In frequentist statistics, power is the probability of detecting a given effect (if that effect actually exists) using a given test in a given context. In typical use, it is a function of the specific test that is used (including the choice of tes ...
.
If one has knowledge of the underlying distribution of the data, this may indicate using one of the other choices. Brown and Forsythe
performed
Monte Carlo
Monte Carlo ( ; ; or colloquially ; , ; ) is an official administrative area of Monaco, specifically the Ward (country subdivision), ward of Monte Carlo/Spélugues, where the Monte Carlo Casino is located. Informally, the name also refers to ...
studies that indicated that using the
trimmed mean performed best when the underlying data followed a
Cauchy distribution (a
heavy-tailed distribution) and the median performed best when the underlying data followed a
χ2 distribution with four degrees of freedom (a sharply
skewed distribution
In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real number, real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined.
For ...
). Using the mean provided the best power for symmetric, moderate-tailed, distributions. O'Brien tested several ways of using the traditional analysis of variance to test heterogeneity of spread in
factorial
In mathematics, the factorial of a non-negative denoted is the Product (mathematics), product of all positive integers less than or equal The factorial also equals the product of n with the next smaller factorial:
\begin
n! &= n \times ...
designs with equal or unequal sample sizes. The jackknife pseudovalues of s
2 and the absolute deviations from the cell median are shown to be robust and relatively powerful.
See also
*
Bartlett's test for unequal variances, which is derived from the
likelihood ratio test under the normal distribution.
References
External links
NIST: Levene Test for Equality of Variances
{{DEFAULTSORT:Brown-Forsythe test
Statistical tests