Goodman And Kruskal's Gamma
   HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, Goodman and Kruskal's gamma is a measure of
rank correlation In statistics, a rank correlation is any of several statistics that measure an ordinal association — the relationship between rankings of different ordinal data, ordinal variables or different rankings of the same variable, where a "ranking" is t ...
, i.e., the similarity of the orderings of the data when ranked by each of the quantities. It measures the strength of association of the cross tabulated data when both variables are measured at the ordinal level. It makes no adjustment for either table size or ties. Values range from −1 (100% negative association, or perfect inversion) to +1 (100% positive association, or perfect agreement). A value of zero indicates the absence of association. This statistic (which is distinct from
Goodman and Kruskal's lambda In probability theory and statistics, Goodman & Kruskal's lambda (\lambda) is a measure of proportional reduction in error in cross tabulation analysis. For any sample with a nominal independent variable and dependent variable (or ones that can b ...
) is named after Leo Goodman and William Kruskal, who proposed it in a series of papers from 1954 to 1972.


Definition

The estimate of gamma, ''G'', depends on two quantities: :*''Ns'', the number of pairs of cases ranked in the same order on both variables (number of
concordant pair In statistics, a concordant pair is a pair of observations, each on two variables, (X''1'',Y''1'') and (X''2'',Y''2''), having the property that : \sgn (X_2 - X_1)\ = \sgn (Y_2 - Y_1), where "sgn" refers to whether a number is positive, zero, o ...
s), :*''Nd'', the number of pairs of cases ranked in reversed order on both variables (number of reversed pairs), where "ties" (cases where either of the two variables in the pair are equal) are dropped. Then :G=\frac\ . This statistic can be regarded as the
maximum likelihood estimator In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed stati ...
for the theoretical quantity \gamma, where :\gamma=\frac\ , and where ''P''''s'' and ''P''''d'' are the probabilities that a randomly selected pair of observations will place in the same or opposite order respectively, when ranked by both variables. Critical values for the gamma statistic are sometimes found by using an approximation, whereby a transformed value, ''t'' of the statistic is referred to Student t distribution, where : t \approx G \sqrt\ , and where ''n'' is the number of observations (not the number of pairs): : n \ne N_s+N_d. \,


Yule's Q

A special case of Goodman and Kruskal's gamma is Yule's Q, also known as the Yule coefficient of association, which is specific to 2×2 matrices. Consider the following
contingency table In statistics, a contingency table (also known as a cross tabulation or crosstab) is a type of table in a matrix format that displays the multivariate frequency distribution of the variables. They are heavily used in survey research, business int ...
of events, where each value is a count of an event's frequency: Yule's Q is given by: :Q=\frac\ . Although computed in the same fashion as Goodman and Kruskal's gamma, it has a slightly broader interpretation because the distinction between nominal and ordinal scales becomes a matter of arbitrary labeling for dichotomous distinctions. Thus, whether Q is positive or negative depends merely on which pairings the analyst considers to be concordant, but is otherwise symmetric. ''Q'' varies from −1 to +1. −1 reflects total negative association, +1 reflects perfect positive association and 0 reflects no association at all. The sign depends on which pairings the analyst initially considered to be concordant, but this choice does not affect the magnitude. In term of the
odds ratio An odds ratio (OR) is a statistic that quantifies the strength of the association between two events, A and B. The odds ratio is defined as the ratio of the odds of event A taking place in the presence of B, and the odds of A in the absence of B ...
OR, Yule's ''Q'' is given by :Q= \frac\ . and so Yule's ''Q'' and Yule's ''Y'' are related by :Q = \frac\ , :Y = \frac\ .


See also

*
Kendall tau rank correlation coefficient In statistics, the Kendall rank correlation coefficient, commonly referred to as Kendall's τ coefficient (after the Greek letter τ, tau), is a statistic used to measure the ordinal association between two measured quantities. A τ test is a ...
*
Goodman and Kruskal's lambda In probability theory and statistics, Goodman & Kruskal's lambda (\lambda) is a measure of proportional reduction in error in cross tabulation analysis. For any sample with a nominal independent variable and dependent variable (or ones that can b ...
* Yule's Y, also known as the coefficient of colligation


References


Further reading

*Sheskin, D.J. (2007) ''The Handbook of Parametric and Nonparametric Statistical Procedures''. Chapman & Hall/CRC, {{DEFAULTSORT:Gamma Test (Statistics) Rankings Statistical tests Summary statistics for contingency tables