In the statistical theory of
factorial experiment
In statistics, a factorial experiment (also known as full factorial experiment) investigates how multiple factors influence a specific outcome, called the response variable. Each factor is tested at distinct values, or levels, and the Experiment ...
s, aliasing is the property of
fractional factorial designs that makes some effects "aliased" with each other – that is, indistinguishable from each other. A primary goal of the theory of such designs is the control of aliasing so that important effects are not aliased with each other.
In a "full" factorial experiment, the number of ''treatment combinations'' or ''cells'' (see
below
Below may refer to:
*Earth
*Ground (disambiguation)
*Soil
*Floor
* Bottom (disambiguation)
*Less than
*Temperatures below freezing
*Hell or underworld
People with the surname
* Ernst von Below (1863–1955), German World War I general
* Fred Belo ...
) can be very large. This necessitates limiting observations to a ''fraction'' (subset) of the treatment combinations.
Aliasing is an automatic and unavoidable result of observing such a fraction.
The aliasing properties of a design are often summarized by giving its
''resolution''. This measures the degree to which the design avoids aliasing between main effects and important interactions.
Fractional factorial experiments have long been a basic tool in
agriculture, food technology, industry, medicine and public health, and the social and behavioral sciences.
They are widely used in exploratory research, particularly in screening experiments, which have applications in industry, drug design and genetics. In all such cases, a crucial step in designing such an experiment is deciding on the desired aliasing pattern, or at least the desired resolution.
As noted
below
Below may refer to:
*Earth
*Ground (disambiguation)
*Soil
*Floor
* Bottom (disambiguation)
*Less than
*Temperatures below freezing
*Hell or underworld
People with the surname
* Ernst von Below (1863–1955), German World War I general
* Fred Belo ...
, the concept of aliasing may have influenced the identification of an analogous phenomenon in signal processing theory.
Overview
Associated with a factorial experiment is a collection of ''effects''. Each factor determines a ''main effect'', and each set of two or more factors determines an ''interaction effect'' (or simply an ''interaction'') between those factors. Each effect is defined by a set of relations between ''cell means'', as described below. In a ''fractional'' factorial design, effects are defined by restricting these relations to the cells in the fraction. It is when the restricted relations for two different effects turn out to be the same that the effects are said to be aliased.
The presence or absence of a given effect in a given data set is tested by statistical methods, most commonly
analysis of variance
Analysis of variance (ANOVA) is a family of statistical methods used to compare the Mean, means of two or more groups by analyzing variance. Specifically, ANOVA compares the amount of variation ''between'' the group means to the amount of variati ...
. While aliasing has significant implications for estimation and hypothesis testing, it is fundamentally a combinatorial and algebraic phenomenon. Construction and analysis of fractional designs thus rely heavily on algebraic methods.
The definition of a fractional design is sometimes broadened to allow multiple observations of some or all treatment combinations – a
''multisubset'' of all treatment combinations. A fraction that is a subset (that is, where treatment combinations are not repeated) is called ''simple''. The theory described below applies to simple fractions.
Contrasts and effects
In any design, full or fractional, the
expected value
In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...
of an observation in a given treatment combination is called a ''cell mean'', usually denoted using the Greek letter μ. (The term ''cell'' is borrowed from its use in
tables of data.)
A ''
contrast in cell means'' is a
linear combination
In mathematics, a linear combination or superposition is an Expression (mathematics), expression constructed from a Set (mathematics), set of terms by multiplying each term by a constant and adding the results (e.g. a linear combination of ''x'' a ...
of cell means in which the coefficients sum to 0. In the 2 × 3 experiment illustrated here, the expression
:
is a contrast that compares the mean responses of the treatment combinations 11 and 12. (The coefficients here are 1 and –1.)
The effects in a factorial experiment are expressed in terms of contrasts. In the above example, the contrast
:
is said to ''belong to the main effect of factor A'' as it contrasts the responses to the "1" level of factor
with those for the "2" level. The main effect of ''A'' is said to be ''absent'' if this expression equals 0.
[The claim that a contrast equals 0 is a statement that must be assessed by a ]statistical hypothesis test
A statistical hypothesis test is a method of statistical inference used to decide whether the data provide sufficient evidence to reject a particular hypothesis. A statistical hypothesis test typically involves a calculation of a test statistic. T ...
. See the article on factorial experiments for further detail. Similarly,
:
and
:
are contrasts belonging to the main effect of factor ''B''. On the other hand, the contrasts
:
and
:
''belong to the interaction of A and B''; setting them equal to 0 expresses the lack of interaction. These designations, which extend to arbitrary factorial experiments having three or more factors, depend on the pattern of coefficients, as explained
elsewhere.
Since it is the coefficients of these contrasts that carry the essential information, they are often displayed as
column vectors. For the example above, such a table might look like this:
The columns of such a table are called ''contrast vectors'': their components add up to 0. While there are in general many possible choices of columns to represent a given effect, the ''number'' of such columns — the ''degrees of freedom'' of the effect — is fixed and is given by a well-known formula. In the 2 × 3 example above, the degrees of freedom for
, and the
interaction are 1, 2 and 2, respectively.
In a fractional factorial experiment, the contrast vectors belonging to a given effect are restricted to the treatment combinations in the fraction. Thus, in the half-fraction in the 2 × 3 example, the three effects may be represented by the column vectors in the following table:
The consequence of this truncation — aliasing — is described
below
Below may refer to:
*Earth
*Ground (disambiguation)
*Soil
*Floor
* Bottom (disambiguation)
*Less than
*Temperatures below freezing
*Hell or underworld
People with the surname
* Ernst von Below (1863–1955), German World War I general
* Fred Belo ...
.
Definitions
The factors in the design are allowed to have different numbers of levels, as in a
factorial experiment (an ''asymmetric'' or ''mixed-level'' experiment).
Fix a fraction of a full factorial design. Let
be a set of contrast vectors representing an effect (in particular, a main effect or interaction) in the full factorial design, and let
consist of the restrictions of those vectors to the fraction. One says that the effect is
* ''preserved in the fraction'' if
consists of contrast vectors;
* ''completely lost in the fraction'' if
consists of ''constant'' vectors, that is, vectors whose components are equal; and
* ''partly lost'' otherwise.
Similarly, let
and
represent two effects and let
and
be their restrictions to the fraction. The two effects are said to be
* ''unaliased in the fraction'' if each vector in
is
orthogonal
In mathematics, orthogonality (mathematics), orthogonality is the generalization of the geometric notion of ''perpendicularity''. Although many authors use the two terms ''perpendicular'' and ''orthogonal'' interchangeably, the term ''perpendic ...
(perpendicular) to all the vectors in
, and vice versa;
* ''completely aliased in the fraction'' if each vector in
is a
linear combination
In mathematics, a linear combination or superposition is an Expression (mathematics), expression constructed from a Set (mathematics), set of terms by multiplying each term by a constant and adding the results (e.g. a linear combination of ''x'' a ...
of vectors in
, and vice versa; and
*
''partly aliased'' otherwise.
Finney and Bush introduced the terms "lost" and "preserved" in the sense used here. Despite the relatively long history of this topic, though, its terminology is not entirely standardized. The literature often describes lost effects as "not estimable" in a fraction, although estimation is not the only issue at stake. Rao referred to preserved effects as "measurable from" the fraction.
Resolution
The extent of aliasing in a given fractional design is measured by the ''resolution'' of the fraction, a concept first defined by
Box
A box (plural: boxes) is a container with rigid sides used for the storage or transportation of its contents. Most boxes have flat, parallel, rectangular sides (typically rectangular prisms). Boxes can be very small (like a matchbox) or v ...
and Hunter:
: A fractional factorial design is said to have ''resolution''
if every
-factor effect
[A 1-factor effect is the main effect of a single factor. For , a -factor effect is an interaction between factors. The 0-factor effect is the effect of the grand mean, described below.] is unaliased with every effect having fewer than
factors.
For example, a design has resolution
if main effects are unaliased with each other (taking
, though it allows main effects to be aliased with two-factor interactions. This is typically the lowest resolution desired for a fraction. It is not hard to see that a fraction of resolution
also has resolution
, etc., so one usually speaks of the ''maximum'' resolution of a fraction.
The number
in the definition of resolution is usually understood to be a positive integer, but one may consider the ''effect of the grand mean'' to be the (unique) effect with no factors (i.e., with
). This effect sometimes appears in
analysis of variance
Analysis of variance (ANOVA) is a family of statistical methods used to compare the Mean, means of two or more groups by analyzing variance. Specifically, ANOVA compares the amount of variation ''between'' the group means to the amount of variati ...
tables. It has one degree of freedom, and is represented by a single vector, a column of 1's. With this understanding, an effect is
* preserved in a fraction if it is unaliased with the grand mean, and
* completely lost in a fraction if it is completely aliased with the grand mean.
A fraction then has resolution
if all main effects are preserved in the fraction. If it has resolution
then two-factor interactions are also preserved.
Computation
The definitions above require some computations with vectors, illustrated in the examples that follow. For certain fractional designs (the ''regular'' ones), a simple algebraic technique can be used that bypasses these procedures and gives a simple way to determine resolution. This is discussed
below
Below may refer to:
*Earth
*Ground (disambiguation)
*Soil
*Floor
* Bottom (disambiguation)
*Less than
*Temperatures below freezing
*Hell or underworld
People with the surname
* Ernst von Below (1863–1955), German World War I general
* Fred Belo ...
.
Examples
The 2 × 3 experiment
The fraction of this experiment was described
above
Above may refer to:
*Above (artist)
Tavar Zawacki (b. 1981, California) is a Polish, Portuguese - American abstract artist and
internationally recognized visual artist based in Berlin, Germany. From 1996 to 2016, he created work under the ...
along with its restricted vectors. It is repeated here along with the complementary fraction :
In both fractions, the
effect is completely lost (the
column is constant) while the
and interaction effects are preserved (each 3 × 1 column is a contrast vector as its components sum to 0). In addition, the
and interaction effects are completely aliased in each fraction: In the first fraction, the vectors for
are linear combinations of those for
, viz.,
and
;
in the reverse direction, the vectors for
can be written similarly in terms of those representing
. The argument in the second fraction is analogous.
These fractions have maximum resolution 1. The fact that the main effect of
is lost makes both of these fractions undesirable in practice. It turns out that in a 2 × 3 experiment (or in any ''a × b'' experiment in which ''a'' and ''b'' are
relatively prime
In number theory, two integers and are coprime, relatively prime or mutually prime if the only positive integer that is a divisor of both of them is 1. Consequently, any prime number that divides does not divide , and vice versa. This is equiv ...
) there is no fraction that preserves both main effects -- that is, no fraction has resolution 2.
The 2 × 2 × 2 (or 2³) experiment
This is a "two-level" experiment with factors
and
. In such experiments the factor levels are often denoted by 0 and 1, for reasons explained
below
Below may refer to:
*Earth
*Ground (disambiguation)
*Soil
*Floor
* Bottom (disambiguation)
*Less than
*Temperatures below freezing
*Hell or underworld
People with the surname
* Ernst von Below (1863–1955), German World War I general
* Fred Belo ...
. A treatment combination is then denoted by an
ordered triple
In mathematics, a tuple is a finite sequence or ''ordered list'' of numbers or, more generally, mathematical objects, which are called the ''elements'' of the tuple. An -tuple is a tuple of elements, where is a non-negative integer. There is on ...
such as 101 (more formally, (1, 0, 1), denoting the cell in which
and
are at level "1" and
is at level "0"). The following table lists the eight cells of the full 2 × 2 × 2 factorial experiment, along with a contrast vector representing each effect, including a three-factor interaction:
Suppose that only the fraction consisting of the cells 000, 011, 101, and 110 is observed. The original contrast vectors, when restricted to these cells, are now 4 × 1, and can be seen by looking at just those four rows of the table. (Sorting the table on
will bring these rows together and make the restricted contrast vectors easier to see. Sorting twice puts them at the top.) The following can be observed concerning these ''restricted'' vectors:
* The
column consists just of the constant 1 repeated four times.
* The other columns are contrast vectors, having two 1's and two −1s.
* The columns for
and
are equal. The same holds for
and
, and for
and
.
* All other pairs of columns are orthogonal. For example, the column for
is orthogonal to that for
, for
, for
, and for
, as one can see by computing
dot products
In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used for other symmetric bilinear forms, for example in a pseudo-Euclidean space. Not to be confused wit ...
.
Thus
* the
interaction is completely lost in the fraction;
* the other effects are preserved in the fraction;
* the effects
and
are completely aliased with each other, as are
and
, and
and
.
* all other pairs of effects are unaliased. For example,
is unaliased with both
and
and with the
and
interactions.
Now suppose instead that the complementary fraction is observed. The same effects as before are lost or preserved, and the same pairs of effects as before are mutually unaliased. Moreover,
and
are still aliased in this fraction since the
and
vectors are negatives of each other, and similarly for
and
and for
and
. Both of these fractions thus have maximum resolution 3.
Aliasing in regular fractions
The two half-fractions of a
factorial experiment described above are of a special kind: Each is the solution set of a linear equation using ''
modular arithmetic
In mathematics, modular arithmetic is a system of arithmetic operations for integers, other than the usual ones from elementary arithmetic, where numbers "wrap around" when reaching a certain value, called the modulus. The modern approach to mo ...
''. More exactly:
* The fraction
is the solution set of the equation
. For example,
is a solution because
.
* Similarly, the fraction
is the solution set to
Such fractions are said to be ''regular''. This idea applies to fractions of "classical"
designs, that is,
(or "symmetric") factorial designs in which the number of levels,
, of each of the
factors is a prime or the power of a prime.
: A fractional factorial design is ''regular'' if it is the solution set of a system of one or more equations of the form
::
: where the equation is modulo
if
is prime, and is in the
finite field
In mathematics, a finite field or Galois field (so-named in honor of Évariste Galois) is a field (mathematics), field that contains a finite number of Element (mathematics), elements. As with any field, a finite field is a Set (mathematics), s ...
if
is a power of a prime.
[The case that is prime is mentioned separately only for clarity, since the set of integers modulo is itself a finite field, though often denoted rather than .] Such equations are called ''defining equations'' of the fraction. When the defining equation or equations are
''homogeneous'', the fraction is said to be ''principal''.
One defining equation yields a fraction of size
, two independent equations a fraction of size
and so on. Such fractions are generally denoted as
designs. The half-fractions described above are
designs. The notation often includes the resolution as a subscript, in Roman numerals; the above fractions are thus
designs.
Associated to each expression
is another, namely
, which rewrites the coefficients as exponents. Such expressions are called "''words''", a term borrowed from
group theory
In abstract algebra, group theory studies the algebraic structures known as group (mathematics), groups.
The concept of a group is central to abstract algebra: other well-known algebraic structures, such as ring (mathematics), rings, field ( ...
. (In a particular example where
is a specific number, the letters
are used, rather than
.) These words can be multiplied and raised to powers, where the word
acts as a multiplicative identity, and they thus form an ''
abelian group
In mathematics, an abelian group, also called a commutative group, is a group in which the result of applying the group operation to two group elements does not depend on the order in which they are written. That is, the group operation is commu ...
''
, known as the ''effects group''.
When
is prime, one has
for every element (word)
; something similar holds in the prime-power case.
In
factorial experiments, each element of
represents a main effect or interaction. In
experiments with
, each one-letter word represents the main effect of that factor, while longer words represent ''components of interaction''. An example below illustrates this with
.
To each ''defining'' expression (the left-hand side of a defining equation) corresponds a ''defining word''. The defining words
generate a subgroup of
that is variously called the ''alias subgroup'',
the ''defining contrast subgroup'', or simply the ''defining subgroup'' of the fraction. Each element of
is a defining word since it corresponds to a defining equation, as one can show. The effects represented by the defining words are completely lost in the fraction while all other effects are preserved. If
, say, then the equation
:
is called the ''defining relation'' of the fraction. This relation is used to determine the aliasing structure of the fraction: If a given effect is represented by the word
, then its aliases are computed by multiplying the defining relation by
, viz.,
:
where the products
are then simplified. This relation indicates complete (not partial) aliasing, and W is unaliased with all other effects listed in
.
Example 1
In either of the
fractions described above, the defining word is
, since the exponents on these letters are the coefficients of
. The
effect is completely lost in the fraction, and the defining subgroup
is simply
, since squaring does not generate new elements
. The defining relation is thus
:
,
and multiplying both sides by
gives
; which simplifies to
:
the alias relation seen earlier. Similarly,
and
. Note that multiplying both sides of the defining relation by
and
does not give any new alias relations.
For comparison, the
fraction with defining equation
has the defining word
(i.e.,
). The effect
is completely lost, and the defining relation is
. Multiplying this by
, by
, and by
gives the alias relations
,
, and
among the six remaining effects. This fraction only has resolution 2 since all effects (except
) are preserved but two main effects are aliased. Finally, solving the defining equation
yields the fraction . One may verify all of this by sorting the table above on column
.
The use of arithmetic modulo 2 explains why the factor levels in such designs are labeled 0 and 1.
Example 2
In a 3-level design, factor levels are denoted 0, 1 and 2, and arithmetic is modulo 3. If there are four factors, say
and
, the effects group
will have the relations
:
From these it follows, for example, that
and
.
A defining equation such as
would produce a regular 1/3-fraction of the 81 (=
) treatment combinations, and the corresponding defining word would be
. Since its powers are
:
and
,
the defining subgroup
would be
, and so the fraction would have defining relation
:
Multiplying by
, for example, yields the aliases
:
For reasons explained elsewhere, though, all powers of a defining word represent the same effect, and the convention is to choose that power whose leading exponent is 1. Squaring the latter two expressions does the trick and gives the alias relations
:
Twelve other sets of three aliased effects are given by Wu and Hamada. Examining all of these reveals that, like
, main effects are unaliased with each other and with two-factor effects, although some two-factor effects are aliased with each other. This means that this fraction has maximum resolution 4, and so is of type
.
The effect
is one of 4 components of the
interaction, while
is one of 8 components of the
interaction. In a 3-level design, each component of interaction carries 2 degrees of freedom.
Example 3
A
design (
of a
design) may be created by solving ''two'' equations in 5 unknowns, say
:
modulo 2. The fraction has eight treatment combinations, such as 10000, 00110 and 11111, and is displayed in the article on
fractional factorial designs.
[That article uses alternate notation for treatment combinations; for example, 10000, 00110 and 11111 are expressed as and .] Here the coefficients in the two defining equations give defining words
and
. Setting
and multiplying through by
gives the alias relation
. The second defining word similarly gives
. The article uses these two aliases to describe an alternate method of construction of the fraction.
The defining subgroup
has one more element, namely the product
, making use of the fact that
. The extra defining word
is known as the ''generalized interaction'' of
and
, and corresponds to the equation
, which is also satisfied by the fraction. With this word included, the full defining relation is
:
(these are the four elements of the defining subgroup), from which all the alias relations of this fraction can be derived – for example, multiplying through by
yields
:
.
Continuing this process yields six more alias sets, each containing four effects. An examination of these sets reveals that main effects are not aliased with each other, but are aliased with two-factor interactions. This means that this fraction has maximum resolution 3. A quicker way to determine the resolution of a regular fraction is given below.
It is notable that the alias relations of the fraction depend only on the left-hand side of the defining equations, not on their constant terms. For this reason, some authors will restrict attention to principal fractions "
without loss of generality
''Without loss of generality'' (often abbreviated to WOLOG, WLOG or w.l.o.g.; less commonly stated as ''without any loss of generality'' or ''with no loss of generality'') is a frequently used expression in mathematics. The term is used to indicat ...
", although the reduction to the principal case often requires verification.
Determining the resolution of a regular fraction
The ''length of a word'' in the effects group is defined to be the number of letters in its name, not counting repetition. For example, the length of the word
is 3.
[This differs from the definition used in ]group theory
In abstract algebra, group theory studies the algebraic structures known as group (mathematics), groups.
The concept of a group is central to abstract algebra: other well-known algebraic structures, such as ring (mathematics), rings, field ( ...
, which counts repetitions. According to the latter view, the length of is 4.
Using this result, one immediately gets the resolution of the preceding examples without computing alias relations:
*In the
fraction with defining word
, the maximum resolution is 3 (the length of that word), while the fraction with defining word
has maximum resolution 2.
*The defining words of the
fraction were
and
, both of length 4, so that the fraction has maximum resolution 4, as indicated.
*In the
fraction with defining words
and
, the maximum resolution is 3, which is the shortest "wordlength".
:One could also construct a
fraction from the defining words
and
, but the defining subgroup
will also include
, their product, and so the fraction will only have resolution 2 (the length of
). This is true starting with any two words of length 4. Thus resolution 3 is the best one can hope for in a fraction of type
.
As these examples indicate, one must consider ''all'' the elements of the defining subgroup in applying the theorem above. This theorem is often taken to be a definition of resolution, but the Box-Hunter definition given
earlier applies to arbitrary fractional designs and so is more general.
Aliasing in general fractions
Nonregular fractions are common, and have certain advantages. For example, they are not restricted to having size a power of
, where
is a prime or prime power. While some methods have been developed to deal with aliasing in particular nonregular designs, no overall algebraic scheme has emerged.
There is a universal ''combinatorial'' approach, however, going back to Rao. If the treatment combinations of the fraction are written as rows of a table, that table is an ''
orthogonal array
In mathematics, an orthogonal array (more specifically, a fixed-level orthogonal array) is a "table" (array) whose entries come from a fixed finite set of symbols (for example, ), arranged in such a way that there is an integer ''t'' so that for ev ...
''. These rows are often referred to as "runs". The columns will correspond to the
factors, and the entries of the table will simply be the symbols used for factor levels, and need not be numbers. The number of levels need not be prime or prime-powered, and they may vary from factor to factor, so that the table may be a ''mixed-level'' array. In this section fractional designs are allowed to be mixed-level unless explicitly restricted.
A key parameter of an orthogonal array is its ''strength'', the definition of which is given in the
article on orthogonal arrays. One may thus refer to the ''strength'' of a fractional design. Two important facts flow immediately from its definition:
*If an array (or fraction) has strength
then it also has strength
for every