- Source: Convergent cross mapping
Convergent cross mapping (CCM) is a statistical test for a cause-and-effect relationship between two variables that, like the Granger causality test, seeks to resolve the problem that correlation does not imply causation. While Granger causality is best suited for purely stochastic systems where the influences of the causal variables are separable (independent of each other), CCM is based on the theory of dynamical systems and can be applied to systems where causal variables have synergistic effects. As such, CCM is specifically aimed to identify linkage between variables that can appear uncorrelated with each other.
Theory
In the event one has access to system variables as time series observations, Takens' embedding theorem can be applied. Takens' theorem generically proves that the state space of a dynamical system can be reconstructed from a single observed time series of the system,
X
{\displaystyle X}
. This reconstructed or shadow manifold
M
X
{\displaystyle M_{X}}
is diffeomorphic to the true manifold,
M
{\displaystyle M}
, preserving instrinsic state space properties of
M
{\displaystyle M}
in
M
X
{\displaystyle M_{X}}
.
Convergent Cross Mapping (CCM) leverages a corollary to the Generalized Takens Theorem that it should be possible to cross predict or cross map between variables observed from the same system. Suppose that in some dynamical system involving variables
X
{\displaystyle X}
and
Y
{\displaystyle Y}
,
X
{\displaystyle X}
causes
Y
{\displaystyle Y}
. Since
X
{\displaystyle X}
and
Y
{\displaystyle Y}
belong to the same dynamical system, their reconstructions via embeddings
M
X
{\displaystyle M_{X}}
and
M
Y
{\displaystyle M_{Y}}
, also map to the same system.
The causal variable
X
{\displaystyle X}
leaves a signature on the affected variable
Y
{\displaystyle Y}
, and consequently, the reconstructed states based on
Y
{\displaystyle Y}
can be used to cross predict values of
X
{\displaystyle X}
. CCM leverages this property to infer causality by predicting
X
{\displaystyle X}
using the
M
Y
{\displaystyle M_{Y}}
library of points (or vice-versa for the other direction of causality), while assessing improvements in cross map predictability as larger and larger random samplings of
M
Y
{\displaystyle M_{Y}}
are used. If the prediction skill of
X
{\displaystyle X}
increases and saturates as the entire
M
Y
{\displaystyle M_{Y}}
is used, this provides evidence that
X
{\displaystyle X}
is causally influencing
Y
{\displaystyle Y}
.
Cross mapping is generally asymmetric. If
X
{\displaystyle X}
forces
Y
{\displaystyle Y}
unidirectionally, variable
Y
{\displaystyle Y}
will contain information about
X
{\displaystyle X}
, but not vice versa. Consequently, the state of
X
{\displaystyle X}
can be predicted from
M
Y
{\displaystyle M_{Y}}
, but
Y
{\displaystyle Y}
will not be predictable from
M
X
{\displaystyle M_{X}}
.
Algorithm
The basic steps of convergent cross mapping for a variable
X
{\displaystyle X}
of length
N
{\displaystyle N}
against variable
Y
{\displaystyle Y}
are:
If needed, create the state space manifold
M
Y
{\displaystyle M_{Y}}
from
Y
{\displaystyle Y}
Define a sequence of library subset sizes
L
{\displaystyle L}
ranging from a small fraction of
N
{\displaystyle N}
to close to
N
{\displaystyle N}
.
Define a number of ensembles
N
E
{\displaystyle N_{E}}
to evaluate at each library size.
At each library subset size
L
i
{\displaystyle L_{i}}
:
For
N
E
{\displaystyle N_{E}}
ensembles:
Randomly select
L
i
{\displaystyle L_{i}}
state space vectors from
M
Y
{\displaystyle M_{Y}}
Estimate
X
^
{\displaystyle {\hat {X}}}
from the random subset of
M
Y
{\displaystyle M_{Y}}
using the Simplex state space prediction
Compute the correlation
ρ
{\displaystyle \rho }
between
X
^
{\displaystyle {\hat {X}}}
and
X
{\displaystyle X}
Compute the mean correlation
ρ
¯
{\displaystyle {\bar {\rho }}}
over the
N
E
{\displaystyle N_{E}}
ensembles at
L
i
{\displaystyle L_{i}}
The spectrum of
ρ
¯
{\displaystyle {\bar {\rho }}}
versus
L
{\displaystyle L}
must exhibit convergence.
Assess significance. One technique is to compare
ρ
¯
{\displaystyle {\bar {\rho }}}
to
ρ
S
¯
{\displaystyle {\bar {\rho _{S}}}}
computed from
S
{\displaystyle S}
random realizations (surrogates) of
X
{\displaystyle X}
.
Applications
CCM is used to detect if two variables belong to the same dynamical system, for example, can past ocean surface temperatures be estimated from the population data over time of sardines or if there is a causal relationship between cosmic rays and global temperatures. As for the latter it was hypothesised that cosmic rays may impact cloud formation, therefore cloudiness, therefore global temperatures.
Extensions
Extensions to CCM include:
Extended Convergent Cross Mapping
Convergent Cross Sorting
See also
Empirical dynamic modeling
System dynamics
Complex dynamics
References
Further reading
Chang, CW., Ushio, M. & Hsieh, Ch. (2017). "Empirical dynamic modeling for beginners". Ecol Res. 32 (6): 785–796. doi:10.1007/s11284-017-1469-9. hdl:2433/235326.{{cite journal}}: CS1 maint: multiple names: authors list (link)
Stephan B Munch, Antoine Brias, George Sugihara, Tanya L Rogers (2020). "Frequently asked questions about nonlinear dynamics and empirical dynamic modelling". ICES Journal of Marine Science. 77 (4): 1463–1479. doi:10.1093/icesjms/fsz209.{{cite journal}}: CS1 maint: multiple names: authors list (link)
External links
Animations:
State Space Reconstruction: Time Series and Dynamic Systems on YouTube
State Space Reconstruction: Takens' Theorem and Shadow Manifolds on YouTube
State Space Reconstruction: Convergent Cross Mapping on YouTube