- Source: Multifidelity simulation
Multifidelity (or multi-fidelity) methods leverage both low- and high-fidelity data in order to maximize the accuracy of model estimates, while minimizing the cost associated with parametrization. They have been successfully used in impedance cardiography, wing-design optimization, robotic learning, computational biomechanics, and have more recently been extended to human-in-the-loop systems, such as aerospace and transportation. They include both model-based methods, where a generative model is available or can be learned, in addition to model-free methods, that include regression-based approaches, such as stacked-regression. A more general class of regression-based multi-fidelity methods are Bayesian approaches, e.g. Bayesian linear regression, Gaussian mixture models, Gaussian processes, auto-regressive Gaussian processes, or Bayesian polynomial chaos expansions.
The approach used depends on the domain and properties of the data available, and is similar to the concept of metasynthesis, proposed by Judea Pearl.
Data fidelity spectrum
The fidelity of data can vary along a spectrum between low- and high-fidelity. The next sections provide examples of data across the fidelity spectrum, while defining the benefits and limitations of each type of data.
= Low fidelity data (LoFi)
=Low-fidelity data (LoFi) includes any data that was produced by a person or Stochastic Process that deviates from the real-world system of interest. For example, LoFi data can be produced by models of a physical system that use approximations to simulate the system, rather than modeling the system in an exhaustive manner.
Moreover, in human-in-the-loop (HITL) situations the goal may be to predict the impact of technology on expert behavior within the real-world operational context. Machine learning can be used to train statistical models that predict expert behavior, provided that an adequate amount of high-fidelity (i.e., real-world) data are available or can be produced.
LoFi benefits and limitations
In situations when there is not an adequate amount of high-fidelity data available to train the model, low-fidelity data can sometimes be used. For example, low-fidelity data can be acquired by using a distributed simulation platform, such as X-Plane, and requiring novice participants to operate in scenarios that are approximations of the real-world context. The benefit of using low-fidelity data is that they are relatively inexpensive to acquire, so it is possible to elicit larger amounts of data. However, the limitation is that the low-fidelity data may not be useful for predicting real-world expert (i.e., high-fidelity) performance due to differences between the low-fidelity simulation platform and the real-world context, or between novice and expert performance (e.g., due to training).
= High-fidelity data (HiFi)
=High-fidelity data (HiFi) includes data that was produced by a person or Stochastic Process that closely matches the operational context of interest. For example, in wing design optimization, high-fidelity data uses physical models in simulation that produce results that closely match the wing in a similar real-world setting. In HITL situations, HiFi data would be produced from an operational expert acting in the technological and situational context of interest.
HiFi benefits and limitations
An obvious benefit of utilizing high-fidelity data is that the estimates produced by the model should generalize well to the real-world context. However, these data are expensive in terms of both time and money, which limits the amount of data that can be obtained. The limited amount of data available can significantly impair the ability of the model to produce valid estimates.
= Multifidelity methods (MfM)
=Multifidelity methods attempt to leverage the strengths of each data source, while overcoming the limitations. Although small to medium differences between low- and high-fidelity data are sometimes able to be overcome by multifidelity models, large differences (e.g., in KL divergence between novice and expert action distributions) can be problematic leading to decreased predictive performance when compared to models that exclusively relied on high-fidelity data.
Multifidelity models enable low-fidelity data to be collected on different technology concepts to evaluate the risk associated with each concept before actually deploying the system.
Bayesian auto-regressive Gaussian processes
In an auto-regressive model of
N
t
{\displaystyle N_{t}}
Gaussian processes (GP), each level of output fidelity,
t
{\displaystyle t}
, where a higher
t
{\displaystyle t}
denotes a higher fidelity, is modeled as a GP,
z
t
(
x
)
{\displaystyle z_{t}(x)}
, which can be expressed in terms of the previous level's GP,
z
t
−
1
(
x
)
{\displaystyle z_{t-1}(x)}
, a proportionality constant
ρ
t
−
1
{\displaystyle \rho _{t-1}}
and a "difference-GP"
δ
t
(
x
)
{\displaystyle \delta _{t}(x)}
as follows:
z
1
(
x
)
=
δ
1
(
x
)
{\displaystyle z_{1}(x)=\delta _{1}(x)}
z
t
(
x
)
=
ρ
t
−
1
z
t
−
1
(
x
)
+
δ
t
(
x
)
{\displaystyle z_{t}(x)=\rho _{t-1}z_{t-1}(x)+\delta _{t}(x)}
The scaling constant that quantifies the correlation of levels
t
{\displaystyle t}
and
t
−
1
{\displaystyle t-1}
, and can generally depend on
x
{\displaystyle x}
.
Under the assumption, that all information about a level is contained in the data corresponding to the same pivot point
x
{\displaystyle x}
at level
t
{\displaystyle t}
as well as
t
−
1
{\displaystyle t-1}
, semi-analytical first and second moments are feasible. This assumption formally is
C
o
v
(
z
t
(
x
)
,
z
t
−
1
(
x
′
)
∣
z
t
−
1
(
x
)
)
=
0
{\displaystyle Cov{\bigg (}z_{t}(x),z_{t-1}(x')\mid z_{t-1}(x){\bigg )}=0}
I.e. given a data at
x
{\displaystyle x}
on level
t
−
1
{\displaystyle t-1}
, there is no further information about level
t
{\displaystyle t}
to extract from the data
x
′
{\displaystyle x'}
on level
t
−
1
{\displaystyle t-1}
.