False Coverage Rate
   HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, a false coverage rate (FCR) is the average rate of false
coverage Coverage may refer to: Filmmaking * Coverage (lens), the size of the image a lens can produce * Camera coverage, the amount of footage shot and different camera setups used in filming a scene * Script coverage, a short summary of a script, wri ...
, i.e. not covering the true parameters, among the selected intervals. The FCR gives a simultaneous coverage at a (1 − ''α'')×100% level for all of the parameters considered in the problem. The FCR has a strong connection to the
false discovery rate In statistics, the false discovery rate (FDR) is a method of conceptualizing the rate of type I errors in null hypothesis testing when conducting multiple comparisons. FDR-controlling procedures are designed to control the FDR, which is the expe ...
(FDR). Both methods address the
problem of multiple comparisons In statistics, the multiple comparisons, multiplicity or multiple testing problem occurs when one considers a set of statistical inferences simultaneously or infers a subset of parameters selected based on the observed values. The more inferenc ...
, FCR from
confidence interval In frequentist statistics, a confidence interval (CI) is a range of estimates for an unknown parameter. A confidence interval is computed at a designated ''confidence level''; the 95% confidence level is most common, but other levels, such as 9 ...
s (CIs) and FDR from P-value's point of view. FCR was needed because of dangers caused by selective inference. Researchers and scientists tend to report or highlight only the portion of data that is considered significant without clearly indicating the various hypothesis that were considered. It is therefore necessary to understand how the data is falsely covered. There are many FCR procedures which can be used depending on the length of the CI – Bonferroni-selected–Bonferroni-adjusted, Adjusted BH-Selected CIs (Benjamini and Yekutieli 2005 ). The incentive of choosing one procedure over another is to ensure that the CI is as narrow as possible and to keep the FCR. For
microarray A microarray is a multiplex lab-on-a-chip. Its purpose is to simultaneously detect the expression of thousands of genes from a sample (e.g. from a tissue). It is a two-dimensional array on a solid substrate—usually a glass slide or silicon t ...
experiments and other modern applications, there are a huge number of
parameters A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
, often tens of thousands or more and it is very important to choose the most powerful procedure. The FCR was first introduced by
Daniel Yekutieli Daniel is a masculine given name and a surname of Hebrew origin. It means "God is my judge"Hanks, Hardcastle and Hodges, ''Oxford Dictionary of First Names'', Oxford University Press, 2nd edition, , p. 68. (cf. Gabriel—"God is my strength" ...
in his PhD thesis in 2001.Theoretical Results Needed for Applying the False Discovery Rate in Statistical Problems
April, 2001 (Section 3.2, Page 51)


Definitions

Not keeping the FCR means \text>q when q= \frac = \frac , where m_0 is the number of true null hypotheses, R is the number of rejected hypothesis, V is the number of false positives, and \alpha is the significance level. Intervals with simultaneous coverage probability 1-q can control the FCR to be bounded by q.


Classification of multiple hypothesis tests


The problems addressed by FCR


Selection

Selection Selection may refer to: Science * Selection (biology), also called natural selection, selection in evolution ** Sex selection, in genetics ** Mate selection, in mating ** Sexual selection in humans, in human sexuality ** Human mating strategie ...
causes reduced average coverage. Selection can be presented as conditioning on an event defined by the data and may affect the coverage probability of a CI for a single
parameter A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
. Equivalently, the problem of selection changes the basic sense of
P-values In null-hypothesis significance testing, the ''p''-value is the probability of obtaining test results at least as extreme as the result actually observed, under the assumption that the null hypothesis is correct. A very small ''p''-value means ...
. FCR procedures consider that the goal of conditional coverage following any selection rule for any set of (unknown) values for the parameters is impossible to achieve. A weaker property when it comes to selective CIs is possible and will avoid false coverage statements. FCR is a measure of interval coverage following selection. Therefore, even though a 1 − ''α'' CI does not offer selective ( conditional) coverage, the probability of constructing a no covering CI is at most ''α'', where : \Pr theta \not\in \mathrm,\ \text\leq \Pr theta \not\in \mathrm\leq \alpha


Selection and multiplicity

When facing both multiplicity (inference about multiple parameters) and
selection Selection may refer to: Science * Selection (biology), also called natural selection, selection in evolution ** Sex selection, in genetics ** Mate selection, in mating ** Sexual selection in humans, in human sexuality ** Human mating strategie ...
, not only is the expected proportion of coverage over selected parameters at 1−α not equivalent to the expected proportion of no coverage at α, but also the latter can no longer be ensured by constructing marginal CIs for each selected parameter. FCR procedures solve this by taking the expected proportion of parameters not covered by their CIs among the selected parameters, where the proportion is 0 if no parameter is selected. This false coverage-statement rate (FCR) is a property of any procedure that is defined by the way in which parameters are selected and the way in which the multiple intervals are constructed.


Controlling procedures


Bonferroni procedure (Bonferroni-selected–Bonferroni-adjusted) for simultaneous CI

Simultaneous CIs with Bonferroni procedure when we have m parameters, each marginal CI constructed at the 1 − α/m level. Without selection, these CIs offer simultaneous coverage, in the sense that the probability that all CIs cover their respective parameters is at least 1 − α. unfortunately, even such a strong property does not ensure the conditional confidence property following selection.


FCR for Bonferroni-selected–Bonferroni-adjusted simultaneous CI

The Bonferroni–Bonferroni procedure cannot offer conditional coverage, however it does control the FCR at <α In fact it does so too well, in the sense that the FCR is much too close to 0 for large values of θ. Intervals selection is based on Bonferroni testing, and Bonferroni CIs are then constructed. The FCR is estimated as, the proportion of intervals failing to cover their respective parameters among the constructed CIs is calculated (setting the proportion to 0 when none are selected). Where selection is based on unadjusted individual testing and unadjusted CIs are constructed.


FCR-adjusted BH-selected CIs

In BH procedure for FDR after sorting the ''p'' values ''P''(1) ≤ • • • ≤ ''P''(''m'') and calculating ''R'' = max, the ''R'' null hypotheses for which ''P''(''i'') ≤ ''R'' • ''q''/''m'' are rejected. If testing is done using the Bonferroni procedure, then the lower bound of the FCR may drop well below the desired level ''q'', implying that the intervals are too long. In contrast, applying the following procedure, which combines the general procedure with the FDR controlling testing in the BH procedure, also yields a lower bound for the FCR, ''q''/2 ≤ FCR. This procedure is sharp in the sense that for some configurations, the FCR approaches ''q''. 1. Sort the p values used for testing the m hypotheses regarding the parameters, ''P''(1) ≤ • • • ≤''P''(''m''). 2. Calculate ''R'' = max. 3. Select the ''R'' parameters for which ''P''(''i'') ≤ ''R'' • ''q''/''m'', corresponding to the rejected hypotheses. 4. Construct a 1 − ''R'' • ''q''/''m'' CI for each parameter selected.


See also

*
False positive rate In statistics, when performing multiple comparisons, a false positive ratio (also known as fall-out or false alarm ratio) is the probability of falsely rejecting the null hypothesis for a particular test. The false positive rate is calculated as th ...
*
Post-hoc analysis In a scientific study, post hoc analysis (from Latin '' post hoc'', "after this") consists of statistical analyses that were specified after the data were seen. They are usually used to uncover specific differences between three or more group mea ...


References

Footnotes Other Sources * {{dead link, date=December 2016 , bot=InternetArchiveBot , fix-attempted=yes Statistical hypothesis testing Multiple comparisons