Exact Statistics
   HOME

TheInfoList



OR:

Exact statistics, such as that described in
exact test In statistics, an exact (significance) test is a test such that if the null hypothesis is true, then all assumptions made during the derivation of the distribution of the test statistic are met. Using an exact test provides a significance test th ...
, is a branch of
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
that was developed to provide more accurate results pertaining to
statistical test A statistical hypothesis test is a method of statistical inference used to decide whether the data at hand sufficiently support a particular hypothesis. Hypothesis testing allows us to make probabilistic statements about population parameters. ...
ing and
interval estimation In statistics, interval estimation is the use of sample data to estimate an '' interval'' of plausible values of a parameter of interest. This is in contrast to point estimation, which gives a single value. The most prevalent forms of interval es ...
by eliminating procedures based on
asymptotic In analytic geometry, an asymptote () of a curve is a line such that the distance between the curve and the line approaches zero as one or both of the ''x'' or ''y'' coordinates tends to infinity. In projective geometry and related contexts, ...
and approximate statistical methods. The main characteristic of exact methods is that statistical tests and
confidence interval In frequentist statistics, a confidence interval (CI) is a range of estimates for an unknown parameter. A confidence interval is computed at a designated ''confidence level''; the 95% confidence level is most common, but other levels, such as 9 ...
s are based on exact probability statements that are valid for any
sample size Sample size determination is the act of choosing the number of observations or Replication (statistics), replicates to include in a statistical sample. The sample size is an important feature of any empirical study in which the goal is to make stat ...
. Exact statistical methods help avoid some of the unreasonable assumptions of traditional statistical methods, such as the assumption of equal variances in classical
ANOVA Analysis of variance (ANOVA) is a collection of statistical models and their associated estimation procedures (such as the "variation" among and between groups) used to analyze the differences among means. ANOVA was developed by the statistician ...
. They also allow exact inference on
variance component In statistics, a random effects model, also called a variance components model, is a statistical model where the model parameters are random variables. It is a kind of hierarchical linear model, which assumes that the data being analysed are ...
s of
mixed model A mixed model, mixed-effects model or mixed error-component model is a statistical model containing both fixed effects and random effects. These models are useful in a wide variety of disciplines in the physical, biological and social sciences. ...
s. When exact ''p''-values and confidence intervals are computed under a certain distribution, such as the normal distribution, then the underlying methods are referred to as exact parametric methods. The exact methods that do not make any distributional assumptions are referred to as exact nonparametric methods. The latter has the advantage of making fewer assumptions whereas, the former tend to yield more powerful tests when the distributional assumption is reasonable. For advanced methods such as higher-way ANOVA
regression analysis In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome' or 'response' variable, or a 'label' in machine learning parlance) and one ...
, and mixed models, only exact parametric methods are available. When the sample size is small, asymptotic results given by some traditional methods may not be valid. In such situations, the asymptotic ''p''-values may differ substantially from the exact ''p''-values. Hence asymptotic and other approximate results may lead to unreliable and misleading conclusions.


The approach

All classical statistical procedures are constructed using statistics which depend only on observable random vectors, whereas generalized estimators, tests, and confidence intervals used in exact statistics take advantage of the observable random vectors and the observed values both, as in the Bayesian approach but without having to treat constant parameters as random variables. For example, in sampling from a normal population with mean \mu and variance \sigma ^2, suppose \overline and S ^2 are the sample mean and the sample variance. Then, defining Z and U thus: : Z = \sqrt(\overline - \mu)/ \sigma \sim N(0,1) and that :U = n S^2 / \sigma^2 \sim \chi^2 _ . Now suppose the parameter of interest is the coefficient of variation, \rho = \mu /\sigma . Then, we can easily perform exact tests and exact confidence intervals for \rho based on the generalized statistic :R = \frac - \frac = \frac \frac ~-~ \frac , where \overline is the observed value of \overline and S is the observed value of s. Exact inferences on \rho based on probabilities and expected values of R are possible because its distribution and the observed value are both free of nuisance parameters.


Generalized ''p''-values

Classical statistical methods do not provide exact tests to many statistical problems such as testing Variance Components and ANOVA under unequal variances. To rectify this situation, the generalized ''p''-values are defined as an extension of the classical ''p''-values so that one can perform tests based on exact probability statements valid for any sample size.


See also

*
Fisher's exact test Fisher's exact test is a statistical significance test used in the analysis of contingency tables. Although in practice it is employed when sample sizes are small, it is valid for all sample sizes. It is named after its inventor, Ronald Fisher, ...
*
Optimal discriminant analysis Optimal Discriminant Analysis (ODA)Provider: John Wiley & Sons, Ltd Content:text/plain; charset="UTF-8" TY - JOUR AU - Yarnold, Paul R. AU - Soltysik, Robert C. TI - Theoretical Distributions of Optima for Univariate Discrimination of Random ...
*
Classification tree analysis Classification is a process related to categorization, the process in which ideas and objects are recognized, differentiated and understood. Classification is the grouping of related facts into classes. It may also refer to: Business, organiza ...


References

* Fisher, R. A. 1954. ''
Statistical Methods for Research Workers ''Statistical Methods for Research Workers'' is a classic book on statistics, written by the statistician R. A. Fisher. It is considered by some to be one of the 20th century's most influential books on statistical methods, together with his ''The ...
''. Oliver and Boyd. * Mehta, C. R. 1995. SPSS 6.1 Exact test for Windows.
Prentice Hall Prentice Hall was an American major educational publisher owned by Savvas Learning Company. Prentice Hall publishes print and digital content for the 6–12 and higher-education market, and distributes its technical titles through the Safari B ...
. * Mehta CR and Patel NR. 1983. A network algorithm for performing Fisher’s exact test in rxc contingency tables. ''
Journal of the American Statistical Association The ''Journal of the American Statistical Association (JASA)'' is the primary journal published by the American Statistical Association, the main professional body for statisticians in the United States. It is published four times a year in March, ...
'', 78(382): 427-434. * Mehta CR and Patel NR. 1995
Exact logistic regression: theory and examples
''Statistics in Medicine'', 14: 2143-2160. * Mehta CR, Patel NR and Gray R. 1985. On computing an exact confidence interval for the common odds ratio in several 2 x 2 contingency tables. ''
Journal of the American Statistical Association The ''Journal of the American Statistical Association (JASA)'' is the primary journal published by the American Statistical Association, the main professional body for statisticians in the United States. It is published four times a year in March, ...
'', 80(392): 969-973. * Weerahandi, S. 1995.
Exact Statistical Method for Data Analysis
'.
Springer-Verlag Springer Science+Business Media, commonly known as Springer, is a German multinational publishing company of books, e-books and peer-reviewed journals in science, humanities, technical and medical (STM) publishing. Originally founded in 1842 in ...
. * Weerahandi, S. 2004.
Generalized Inference in Repeated Measures: Exact Methods in MANOVA and Mixed Models
'.
John Wiley & Sons John Wiley & Sons, Inc., commonly known as Wiley (), is an American multinational publishing company founded in 1807 that focuses on academic publishing and instructional materials. The company produces books, journals, and encyclopedias, in p ...
.


External links


LogXact, StatXact
Commercial software packages for exact parametric statistics
XPro
Free software package for exact parametric statistics {{DEFAULTSORT:Exact Statistics Statistical inference