- Source: Deficiency (statistics)
In statistics, the deficiency is a measure to compare a statistical model with another statistical model. The concept was introduced in the 1960s by the french mathematician Lucien Le Cam, who used it to prove an approximative version of the Blackwell–Sherman–Stein theorem. Closely related is the Le Cam distance, a pseudometric for the maximum deficiency between two statistical models. If the deficiency of a model
E
{\displaystyle {\mathcal {E}}}
in relation to
F
{\displaystyle {\mathcal {F}}}
is zero, then one says
E
{\displaystyle {\mathcal {E}}}
is better or more informative or stronger than
F
{\displaystyle {\mathcal {F}}}
.
Introduction
Le Cam defined the statistical model more abstract than a probability space with a family of probability measures. He also didn't use the term "statistical model" and instead used the term "experiment". In his publication from 1964 he introduced the statistical experiment to a parameter set
Θ
{\displaystyle \Theta }
as a triple
(
X
,
E
,
(
P
θ
)
θ
∈
Θ
)
{\displaystyle (X,E,(P_{\theta })_{\theta \in \Theta })}
consisting of a set
X
{\displaystyle X}
, a vector lattice
E
{\displaystyle E}
with unit
I
{\displaystyle I}
and a family of normalized positive functionals
(
P
θ
)
θ
∈
Θ
{\displaystyle (P_{\theta })_{\theta \in \Theta }}
on
E
{\displaystyle E}
. In his book from 1986 he omitted
E
{\displaystyle E}
and
X
{\displaystyle X}
.
This article follows his definition from 1986 and uses his terminology to emphasize the generalization.
Formulation
= Basic concepts
=Let
Θ
{\displaystyle \Theta }
be a parameter space. Given an abstract L1-space
(
L
,
‖
⋅
‖
)
{\displaystyle (L,\|\cdot \|)}
(i.e. a Banach lattice such that for elements
x
,
y
≥
0
{\displaystyle x,y\geq 0}
also
‖
x
+
y
‖
=
‖
x
‖
+
‖
y
‖
{\displaystyle \|x+y\|=\|x\|+\|y\|}
holds) consisting of lineare positive functionals
{
P
θ
:
θ
∈
Θ
}
{\displaystyle \{P_{\theta }:\theta \in \Theta \}}
. An experiment
E
{\displaystyle {\mathcal {E}}}
is a map
E
:
Θ
→
L
{\displaystyle {\mathcal {E}}:\Theta \to L}
of the form
θ
↦
P
θ
{\displaystyle \theta \mapsto P_{\theta }}
, such that
‖
P
θ
‖
=
1
{\displaystyle \|P_{\theta }\|=1}
.
L
{\displaystyle L}
is the band induced by
{
P
θ
:
θ
∈
Θ
}
{\displaystyle \{P_{\theta }:\theta \in \Theta \}}
and therefore we use the notation
L
(
E
)
{\displaystyle L({\mathcal {E}})}
. For a
μ
∈
L
(
E
)
{\displaystyle \mu \in L({\mathcal {E}})}
denote the
μ
+
=
μ
∨
0
=
max
(
μ
,
0
)
{\displaystyle \mu ^{+}=\mu \vee 0=\max(\mu ,0)}
. The topological dual
M
{\displaystyle M}
of an L-space with the conjugated norm
‖
u
‖
M
=
sup
{
|
⟨
u
,
μ
⟩
|
;
‖
μ
‖
L
≤
1
}
{\displaystyle \|u\|_{M}=\sup\{|\langle u,\mu \rangle |;\|\mu \|_{L}\leq 1\}}
is called an abstract M-space. It's also a lattice with unit defined through
I
μ
=
‖
μ
+
‖
L
−
‖
μ
−
‖
L
{\displaystyle I\mu =\|\mu ^{+}\|_{L}-\|\mu ^{-}\|_{L}}
for
μ
∈
L
{\displaystyle \mu \in L}
.
Let
L
(
A
)
{\displaystyle L(A)}
and
L
(
B
)
{\displaystyle L(B)}
be two L-space of two experiments
A
{\displaystyle A}
and
B
{\displaystyle B}
, then one calls a positive, norm-preserving linear map, i.e.
‖
T
μ
+
‖
=
‖
μ
+
‖
{\displaystyle \|T\mu ^{+}\|=\|\mu ^{+}\|}
for all
μ
∈
L
(
A
)
{\displaystyle \mu \in L(A)}
, a transition. The adjoint of a transitions is a positive linear map from the dual space
M
B
{\displaystyle M_{B}}
of
L
(
B
)
{\displaystyle L(B)}
into the dual space
M
A
{\displaystyle M_{A}}
of
L
(
A
)
{\displaystyle L(A)}
, such that the unit of
M
A
{\displaystyle M_{A}}
is the image of the unit of
M
B
{\displaystyle M_{B}}
ist.
= Deficiency
=Let
Θ
{\displaystyle \Theta }
be a parameter space and
E
:
θ
→
P
θ
{\displaystyle {\mathcal {E}}:\theta \to P_{\theta }}
and
F
:
θ
→
Q
θ
{\displaystyle {\mathcal {F}}:\theta \to Q_{\theta }}
be two experiments indexed by
Θ
{\displaystyle \Theta }
. Le
L
(
E
)
{\displaystyle L({\mathcal {E}})}
and
L
(
F
)
{\displaystyle L({\mathcal {F}})}
denote the corresponding L-spaces and let
T
{\displaystyle {\mathcal {T}}}
be the set of all transitions from
L
(
E
)
{\displaystyle L({\mathcal {E}})}
to
L
(
F
)
{\displaystyle L({\mathcal {F}})}
.
The deficiency
δ
(
E
,
F
)
{\displaystyle \delta ({\mathcal {E}},{\mathcal {F}})}
of
E
{\displaystyle {\mathcal {E}}}
in relation to
F
{\displaystyle {\mathcal {F}}}
is the number defined in terms of inf sup:
δ
(
E
,
F
)
:=
inf
T
∈
T
sup
θ
∈
Θ
1
2
‖
Q
θ
−
T
P
θ
‖
TV
,
{\displaystyle \delta ({\mathcal {E}},{\mathcal {F}}):=\inf \limits _{T\in {\mathcal {T}}}\sup \limits _{\theta \in \Theta }{\tfrac {1}{2}}\|Q_{\theta }-TP_{\theta }\|_{\text{TV}},}
where
‖
⋅
‖
TV
{\displaystyle \|\cdot \|_{\text{TV}}}
denoted the total variation norm
‖
μ
‖
TV
=
μ
+
+
μ
−
{\displaystyle \|\mu \|_{\text{TV}}=\mu ^{+}+\mu ^{-}}
. The factor
1
2
{\displaystyle {\tfrac {1}{2}}}
is just for computational purposes and is sometimes omitted.
= Le Cam distance
=The Le Cam distance is the following pseudometric
Δ
(
E
,
F
)
:=
max
(
δ
(
E
,
F
)
,
δ
(
F
,
E
)
)
.
{\displaystyle \Delta ({\mathcal {E}},{\mathcal {F}}):=\operatorname {max} \left(\delta ({\mathcal {E}},{\mathcal {F}}),\delta ({\mathcal {F}},{\mathcal {E}})\right).}
This induces an equivalence relation and when
Δ
(
E
,
F
)
=
0
{\displaystyle \Delta ({\mathcal {E}},{\mathcal {F}})=0}
, then one says
E
{\displaystyle {\mathcal {E}}}
and
F
{\displaystyle {\mathcal {F}}}
are equivalent. The equivalent class
C
E
{\displaystyle C_{\mathcal {E}}}
of
E
{\displaystyle {\mathcal {E}}}
is also called the type of
E
{\displaystyle {\mathcal {E}}}
.
Often one is interested in families of experiments
(
E
n
)
n
{\displaystyle ({\mathcal {E}}_{n})_{n}}
with
{
P
n
,
θ
:
θ
∈
Θ
n
}
{\displaystyle \{P_{n,\theta }\colon \theta \in \Theta _{n}\}}
and
(
F
n
)
n
{\displaystyle ({\mathcal {F}}_{n})_{n}}
with
{
Q
n
,
θ
:
θ
∈
Θ
n
}
{\displaystyle \{Q_{n,\theta }\colon \theta \in \Theta _{n}\}}
. If
Δ
(
E
n
,
F
n
)
=
0
{\displaystyle \Delta ({\mathcal {E}}_{n},{\mathcal {F}}_{n})=0}
as
n
→
∞
{\displaystyle n\to \infty }
, then one says
(
E
n
)
{\displaystyle ({\mathcal {E}}_{n})}
and
(
F
n
)
{\displaystyle ({\mathcal {F}}_{n})}
are asymptotically equivalent.
Let
Θ
{\displaystyle \Theta }
be a parameter space and
E
(
Θ
)
{\displaystyle E(\Theta )}
be the set of all types that are induced by
Θ
{\displaystyle \Theta }
, then the Le Cam distance
Δ
{\displaystyle \Delta }
is complete with respect to
E
(
Θ
)
{\displaystyle E(\Theta )}
. The condition
δ
(
E
,
F
)
=
0
{\displaystyle \delta ({\mathcal {E}},{\mathcal {F}})=0}
induces a partial order on
E
(
Θ
)
{\displaystyle E(\Theta )}
, one says
E
{\displaystyle {\mathcal {E}}}
is better or more informative or stronger than
F
{\displaystyle {\mathcal {F}}}
.
References
Bibliography
Le Cam, Lucien (1986). Asymptotic methods in statistical decision theory. Springer Series in Statistics. Springer, New York. doi:10.1007/978-1-4612-4946-7.
Le Cam, Lucien (1964). "Sufficiency and Approximate Sufficiency". Annals of Mathematical Statistics. 35 (4). Institute of Mathematical Statistics: 1419–1455. doi:10.1214/aoms/1177700372.
Torgersen, Erik (1991). Comparison of Statistical Experiments. Cambridge University Press, United Kingdom. doi:10.1017/CBO9780511666353.
Kata Kunci Pencarian:
- AIDS
- Generasi Z
- HIV dan lelaki seks lelaki
- Sianokobalamin
- Kanker
- Selenium
- Kegemukan
- Molibdenum
- Garam beriodin
- Deficiency (statistics)
- Deficiency
- Vitamin D deficiency in Australia
- Malnutrition
- Retinol
- Attention deficit hyperactivity disorder
- HIV/AIDS
- Folate
- Haemophilia A
- Confidence interval