- Source: Standardized mean of a contrast variable
In statistics, the standardized mean of a contrast variable (SMCV or SMC), is a parameter assessing effect size. The SMCV is defined as mean divided by the standard deviation of a contrast variable.
The SMCV was first proposed for one-way ANOVA cases
and was then extended to multi-factor ANOVA cases.
Background
Consistent interpretations for the strength of group comparison, as represented by a contrast, are important.
When there are only two groups involved in a comparison, SMCV is the same as the strictly standardized mean difference (SSMD). SSMD belongs to a popular type of effect-size measure called "standardized mean differences" which includes Cohen's
d
{\displaystyle d}
and Glass's
δ
.
{\displaystyle \delta .}
In ANOVA, a similar parameter for measuring the strength of group comparison is standardized effect size (SES). One issue with SES is that its values are incomparable for contrasts with different coefficients. SMCV does not have such an issue.
Concept
Suppose the random values in t groups represented by random variables
G
1
,
G
2
,
…
,
G
t
{\displaystyle G_{1},G_{2},\ldots ,G_{t}}
have means
μ
1
,
μ
2
,
…
,
μ
t
{\displaystyle \mu _{1},\mu _{2},\ldots ,\mu _{t}}
and variances
σ
1
2
,
σ
2
2
,
…
,
σ
t
2
{\displaystyle \sigma _{1}^{2},\sigma _{2}^{2},\ldots ,\sigma _{t}^{2}}
, respectively. A contrast variable
V
{\displaystyle V}
is defined by
V
=
∑
i
=
1
t
c
i
G
i
,
{\displaystyle V=\sum _{i=1}^{t}c_{i}G_{i},}
where the
c
i
{\displaystyle c_{i}}
's are a set of coefficients representing a comparison of interest and satisfy
∑
i
=
1
t
c
i
=
0
{\displaystyle \sum _{i=1}^{t}c_{i}=0}
. The SMCV of contrast variable
V
{\displaystyle V}
, denoted by
λ
{\displaystyle \lambda }
, is defined as
λ
=
E
(
V
)
stdev
(
V
)
=
∑
i
=
1
t
c
i
μ
i
Var
(
∑
i
=
1
t
c
i
G
i
)
=
∑
i
=
1
t
c
i
μ
i
∑
i
=
1
t
c
i
2
σ
i
2
+
2
∑
i
=
1
t
∑
j
=
i
c
i
c
j
σ
i
j
{\displaystyle \lambda ={\frac {\operatorname {E} (V)}{\operatorname {stdev} (V)}}={\frac {\sum _{i=1}^{t}c_{i}\mu _{i}}{\sqrt {{\text{Var}}\left(\sum _{i=1}^{t}c_{i}G_{i}\right)}}}={\frac {\sum _{i=1}^{t}c_{i}\mu _{i}}{\sqrt {\sum _{i=1}^{t}c_{i}^{2}\sigma _{i}^{2}+2\sum _{i=1}^{t}\sum _{j=i}c_{i}c_{j}\sigma _{ij}}}}}
where
σ
i
j
{\displaystyle \sigma _{ij}}
is the covariance of
G
i
{\displaystyle G_{i}}
and
G
j
{\displaystyle G_{j}}
. When
G
1
,
G
2
,
…
,
G
t
{\displaystyle G_{1},G_{2},\ldots ,G_{t}}
are independent,
λ
=
∑
i
=
1
t
c
i
μ
i
∑
i
=
1
t
c
i
2
σ
i
2
.
{\displaystyle \lambda ={\frac {\sum _{i=1}^{t}c_{i}\mu _{i}}{\sqrt {\sum _{i=1}^{t}c_{i}^{2}\sigma _{i}^{2}}}}.}
Classifying rule for the strength of group comparisons
The population value (denoted by
λ
{\displaystyle \lambda }
) of SMCV can be used to classify the strength of a comparison represented by a contrast variable, as shown in the following table. This classifying rule has a probabilistic basis due to the link between SMCV and c+-probability.
Statistical estimation and inference
The estimation and inference of SMCV presented below is for one-factor experiments. Estimation and inference of SMCV for multi-factor experiments has also been discussed.
The estimation of SMCV relies on how samples are obtained in a study. When the groups are correlated, it is usually difficult to estimate the covariance among groups. In such a case, a good strategy is to obtain matched or paired samples (or subjects) and to conduct contrast analysis based on the matched samples. A simple example of matched contrast analysis is the analysis of paired difference of drug effects after and before taking a drug in the same patients. By contrast, another strategy is to not match or pair the samples and to conduct contrast analysis based on the unmatched or unpaired samples. A simple example of unmatched contrast analysis is the comparison of efficacy between a new drug taken by some patients and a standard drug taken by other patients. Methods of estimation for SMCV and c+-probability in matched contrast analysis may differ from those used in unmatched contrast analysis.
= Unmatched samples
=Consider an independent sample of size
n
i
{\displaystyle n_{i}}
,
Y
i
=
(
Y
i
1
,
Y
i
2
,
…
,
Y
i
n
i
)
{\displaystyle Y_{i}=\left(Y_{i1},Y_{i2},\ldots ,Y_{in_{i}}\right)}
from the
i
th
(
i
=
1
,
2
,
…
,
t
)
{\displaystyle i^{\text{th}}(i=1,2,\ldots ,t)}
group
G
i
{\displaystyle G_{i}}
.
Y
i
{\displaystyle Y_{i}}
's are independent. Let
Y
¯
i
=
1
n
i
∑
j
=
1
n
i
Y
i
j
{\displaystyle {\bar {Y}}_{i}={\frac {1}{n_{i}}}\sum _{j=1}^{n_{i}}Y_{ij}}
,
s
i
2
=
1
n
i
−
1
∑
j
=
1
n
i
(
Y
i
j
−
Y
¯
i
)
2
,
{\displaystyle s_{i}^{2}={\frac {1}{n_{i}-1}}\sum _{j=1}^{n_{i}}\left(Y_{ij}-{\bar {Y}}_{i}\right)^{2},}
N
=
∑
i
=
1
t
n
i
{\displaystyle N=\sum _{i=1}^{t}n_{i}}
and
MSE
=
1
N
−
t
∑
i
=
1
t
(
n
i
−
1
)
s
i
2
.
{\displaystyle {\text{MSE }}={\frac {1}{N-t}}\sum _{i=1}^{t}\left(n_{i}-1\right)s_{i}^{2}.}
When the
t
{\displaystyle t}
groups have unequal variance, the maximal likelihood estimate (MLE) and method-of-moment estimate (MM) of SMCV (
λ
{\displaystyle \lambda }
) are, respectively
λ
^
MLE
=
∑
i
=
1
t
c
i
Y
¯
i
∑
i
=
1
t
n
i
−
1
n
i
c
i
2
s
i
2
{\displaystyle {\hat {\lambda }}_{\text{MLE }}={\frac {\sum _{i=1}^{t}c_{i}{\bar {Y}}_{i}}{\sqrt {\sum _{i=1}^{t}{\frac {n_{i}-1}{n_{i}}}c_{i}^{2}s_{i}^{2}}}}}
and
λ
^
MM
=
∑
i
=
1
t
c
i
Y
¯
i
∑
i
=
1
t
c
i
2
s
i
2
.
{\displaystyle {\hat {\lambda }}_{\text{MM}}={\frac {\sum _{i=1}^{t}c_{i}{\bar {Y}}_{i}}{\sqrt {\sum _{i=1}^{t}c_{i}^{2}s_{i}^{2}}}}.}
When the
t
{\displaystyle t}
groups have equal variance, under normality assumption, the uniformly minimal variance unbiased estimate (UMVUE) of SMCV (
λ
{\displaystyle \lambda }
) is
λ
^
UMVUE
=
K
N
−
t
∑
i
=
1
t
c
i
Y
¯
i
∑
i
=
1
t
MSE
c
i
2
{\displaystyle {\hat {\lambda }}_{\text{UMVUE}}={\sqrt {\frac {K}{N-t}}}{\frac {\sum _{i=1}^{t}c_{i}{\bar {Y}}_{i}}{\sqrt {\sum _{i=1}^{t}{\text{MSE }}c_{i}^{2}}}}}
where
K
=
2
(
Γ
(
N
−
t
2
)
)
2
(
Γ
(
N
−
t
−
1
2
)
)
2
{\displaystyle K={\frac {2\left(\Gamma \left({\frac {N-t}{2}}\right)\right)^{2}}{\left(\Gamma \left({\frac {N-t-1}{2}}\right)\right)^{2}}}}
.
The confidence interval of SMCV can be made using the following non-central t-distribution:
T
=
∑
i
=
1
t
c
i
Y
¯
i
∑
i
=
1
t
MSE
c
i
2
/
n
i
∼
noncentral
t
(
N
−
t
,
b
λ
)
{\displaystyle T={\frac {\sum _{i=1}^{t}c_{i}{\bar {Y}}_{i}}{\sqrt {\sum _{i=1}^{t}{\text{MSE }}c_{i}^{2}/n_{i}}}}\sim {\text{noncentral }}t(N-t,b\lambda )}
where
b
=
∑
i
=
1
t
c
i
2
∑
i
=
1
t
c
i
2
/
n
i
.
{\displaystyle b={\sqrt {\frac {\sum _{i=1}^{t}c_{i}^{2}}{\sum _{i=1}^{t}c_{i}^{2}/n_{i}}}}.}
= Matched samples
=In matched contrast analysis, assume that there are
n
{\displaystyle n}
independent samples
(
Y
1
j
,
Y
2
j
,
⋯
,
Y
t
j
)
{\displaystyle \left(Y_{1j},Y_{2j},\cdots ,Y_{tj}\right)}
from
t
{\displaystyle t}
groups (
G
i
{\displaystyle G_{i}}
's), where
i
=
1
,
2
,
⋯
,
t
;
j
=
1
,
2
,
⋯
,
n
{\displaystyle i=1,2,\cdots ,t;j=1,2,\cdots ,n}
. Then the
j
th
{\displaystyle j^{\text{th}}}
observed value of a contrast
V
=
∑
i
=
1
t
c
i
G
i
{\displaystyle V=\sum _{i=1}^{t}c_{i}G_{i}}
is
v
j
=
∑
i
=
1
t
c
i
Y
i
{\displaystyle v_{j}=\sum _{i=1}^{t}c_{i}Y_{i}}
.
Let
V
¯
{\displaystyle {\bar {V}}}
and
s
V
2
{\displaystyle s_{V}^{2}}
be the sample mean and sample variance of the contrast variable
V
{\displaystyle V}
, respectively. Under normality assumptions, the UMVUE estimate of SMCV is
λ
^
UMVUE
=
K
n
−
1
V
¯
s
V
{\displaystyle {\hat {\lambda }}_{\text{UMVUE}}={\sqrt {\frac {K}{n-1}}}{\frac {\bar {V}}{s_{V}}}}
where
K
=
2
(
Γ
(
n
−
1
2
)
)
2
(
Γ
(
n
−
2
2
)
)
2
.
{\displaystyle K={\frac {2\left(\Gamma \left({\frac {n-1}{2}}\right)\right)^{2}}{\left(\Gamma \left({\frac {n-2}{2}}\right)\right)^{2}}}.}
A confidence interval for SMCV can be made using the following non-central t-distribution:
T
=
V
¯
s
V
/
n
∼
noncentral
t
(
n
−
1
,
n
λ
)
.
{\displaystyle T={\frac {\bar {V}}{s_{V}/{\sqrt {n}}}}\sim {\text{noncentral }}t\left(n-1,{\sqrt {n}}\lambda \right).}
See also
Dual-flashlight plot
References
Kata Kunci Pencarian:
- Standardized mean of a contrast variable
- Categorical variable
- Effect size
- Linear regression
- Moderation (statistics)
- Dual-flashlight plot
- Regression toward the mean
- Strictly standardized mean difference
- Coefficient of variation
- Standardized test